Remix clone Hacker News
new
|
show
|
ask
|
jobs
Github
▲
preetsojitra
9 hours ago
Meta's Perception Encoder Audio-Visual, its CLIP like but has three modality: Audio, Video and Text