| ▲ | janalsncm 4 hours ago | |
This model says it accepts video inputs. I asked it to transcribe a 5 second video of a digital water curtain which spelled “Boo Happy Halloween”, and it came back with “Happy” which wasn’t the first frame, but also is incomplete. This kind of test is good because it requires stitching together info from the whole video. | ||
| ▲ | aabhay 4 hours ago | parent [-] | |
It reads videos at 1fps by default. You have to set the video resolution to high in ai studio | ||