| ▲ | villgax 4 hours ago | |
That’s not the actual time if you run it, encoding and decoding is extra | ||
| ▲ | Lerc an hour ago | parent [-] | |
Nevertheless it does seem that generating will fairly soon become fast enough to extend a video clip in realtime. Autoregressive by the second. Integrated with a multi modal input model you would be very close to an AI avatar that would be extremely compelling. | ||