| ▲ | aakashks 2 days ago | |
The video compression is very cool. And the small tricks like binning the mouse movements. Wonder how much data is generalizable across different UIs? ie how good will the model be at using Figma if it’s never seen it before but has seen a lot of Photoshop | ||
| ▲ | nee1r 2 days ago | parent [-] | |
this is honestly an issue for the inverse dynamics (for app specific shortcuts etc.) but for general UI learning we still see promising eval trends | ||