| ▲ | adishj 3 hours ago | |
thanks for the comment, thats exactly right we're using a mix of out-of-the-box multimodal AI capability + traditional audio / video analysis techniques as part of our video understanding pipeline, all of which become context for the agent to use during its editing process | ||