| ▲ | Havoc 3 hours ago | |
Are world models from the perspective of an observer in the world or zoomed out? Or in gaming terms do these models think FPS or RTS? Text models and pixel grid vision models is easy but struggling to wrap my head around what world model "sees" so to speak. | ||