| ▲ | ra7 6 hours ago | ||||||||||||||||
The novel aspect here seems to be 3D LiDAR output from 2D video using post-training. As far as I'm aware, no other video world models can do this. IMO, access to DeepMind and Google infra is a hugely understated advantage Waymo has that no other competitor can replicate. | |||||||||||||||||
| ▲ | codexb 4 hours ago | parent | next [-] | ||||||||||||||||
3d from moving 2d images has been a thing for decades. | |||||||||||||||||
| |||||||||||||||||
| ▲ | moffkalast 44 minutes ago | parent | prev [-] | ||||||||||||||||
It's not unheard of, there are a handful [0] of metric monodepth methods that output data that's not unlike a really inaccurate 3D lidar, though theirs certainly looks SOTA. | |||||||||||||||||