Remix.run Logo
MattCruikshank 7 hours ago

I know feelings about AI are mixed. But when AI can dream up gaussian splats in real time, from a prompt, and do refinement as you get closer to things... That's going to be pretty bonkers.

perching_aix 7 hours ago | parent | next [-]

That's kinda what NERFs are (neural radience fields). They actually preceeded this Gaussian story, with Gaussians coming in and outperforming them. Maybe they'll merge later for something even better, I don't know enough about them.

MattCruikshank 7 hours ago | parent | next [-]

Sure, but NERFs were trying to match your input photos and poses, not some arbitrary prompt, if I understand correctly.

Lerc 7 hours ago | parent [-]

Yes they are image generators. You want image generator generators.

A diffusion style process generating gausians instead of pixels. You could possibly do nerfs that way, but it would be effectively generating a trained network. If you managed to do that it would have broad application throughout the field of AI.

dpoloncsak 4 hours ago | parent [-]

What would this look like in practice? A net that outputs weights for a new net to use?

xigoi 4 hours ago | parent [-]

Couldn’t you “uncurry” such a process to have only a single network?

dpoloncsak 3 hours ago | parent [-]

Probably? I'm no expert, just a SysAdmin trying to keep up really... but in my head it's would look like a form of MoE that would gen the 'Expert' model on demand instead of having a variety baked in.

That's assuming you could even reasonably train a neural net to output viable weights, of course.

cubefox 6 hours ago | parent | prev [-]

NERFs have significantly higher image quality than 3D Gaussian Splatting or more recent similar techniques, though they are much slower to render.

thrownthatway 6 hours ago | parent [-]

This one month old video did a reasonable job of getting my entirely ignorant self relatively up to day on NERFs and Gaussian Splats:

https://youtu.be/X8yRlA7jqEQ

basch 4 hours ago | parent | prev | next [-]

I could see a kind of fun game / design tool / worldbuilding where you get a blurry world and you describe what you are seeing, and it comes into focus. The game world, mechanics, aesthetic, and playstyle build as you form your view. A sort of fog of war meets rorschach game.

corysama 2 hours ago | parent | prev | next [-]

We are currently at real-time video generation that can be converted to splats or meshes.

https://research.nvidia.com/labs/sil/projects/lyra2/

Lerc 7 hours ago | parent | prev | next [-]

This will be the future of a class of 3d Game. the prompt may not be text however.

An input of a kind of schematic representation of what the designer wants would be better. It may resemble a storyboard or a collection of organised notes that large projects tend to already use.

Fully generative could probably do some cool things, but people will still want to bring their peronal vision to life.

satvikpendem 6 hours ago | parent | next [-]

Curious, why wouldn't the future be a full world model like Google's Genie? It just renders every pixel so someone could still make their vision come to life via a prompt too.

Lerc 23 minutes ago | parent [-]

It could be done that way but you are spending parameters managing the fact that the output changes completely with a change in view position or orientation. A observer independent model only has to manage changes of things that are actually changing in the world.

Since you can view Gaussian splats from any POV you end up generating an output that is closer to the representation of the world instead of a projection that a single observer sees.

MattCruikshank 6 hours ago | parent | prev [-]

Yeah, when you describe that, I picture Wave Function Collapse to generate a map schematic... And then a text prompt, and some style photos the designers want it to match.

notdefio 7 hours ago | parent | prev [-]

This sounds like it could be a great concept for a future sequel to LSD: Dream Emulator

yard2010 7 hours ago | parent [-]

If I'm not mistaken that is the inspiration for one of Alt-J albums