| ▲ | GaggiX 3 hours ago | |
This is not local but Gemini models can process very long videos and provide description with timestamps if asked for. https://ai.google.dev/gemini-api/docs/video-understanding#tr... | ||
| ▲ | embedding-shape 2 hours ago | parent [-] | |
Nor would it be describing things as they happen, but instead needing pre-processing, so in the end, very different :) | ||