| ▲ | Artgor 5 hours ago | |||||||
I'm cautiously waiting for the feedback from the first users. Meta has produced a lot of great models (LLama), maybe this is a comeback... but I'm cautious, as the jump in the quality is almost too high. Also, I think people aren't used that using such models requires meta.ai or meta ai app. | ||||||||
| ▲ | solenoid0937 5 hours ago | parent | next [-] | |||||||
My Meta friends say it's benchmaxxed af | ||||||||
| ||||||||
| ▲ | conradkay 5 hours ago | parent | prev | next [-] | |||||||
It doesn't seem benchmaxxed, ARC AGI 2 score is quite bad (42.5%, GPT 5.4 is 76.1%) and coding is okay. But maybe this is the best Meta can do even benchmaxxing The impressive part is multimodality, very plausible since there's less focus there by other labs (especially Anthropic) | ||||||||
| ▲ | dbgrman 2 hours ago | parent | prev [-] | |||||||
Given llama 4 mucked up benchmark numbers, I’d take spark announcement with a many grains of salt. | ||||||||