Not sure why they put so much investment into videoSlop and imageSlop. Anthropic seems to be more focused at least.

▲ Ginden 11 hours ago | parent | next [-]

Because almost everyone involved in AI race grew up in "winner takes it all" environments, typical for software, and they try really hard to make it reality. This means your model should do everything to just take 90% of market share, or at least 90% of specific niche.

The problem is, they can't find the moat, despite searching very hard, whatever you bake into your AI, your competitors will be able to replicate in few months. This is why OpenAI is striking deal with Disney, because copyright provides such moat.

▲

worldsayshi 10 hours ago | parent | next [-]

> your competitors will be able to replicate in few months.

Will they really be able to replicate the quality while spending significantly less in compute investment? If not then the moat is still how much capital you can acquire for burning on training?

	▲	Ginden 8 minutes ago \| parent \| next [-]
		There are multiple tech companies with quadrillion-deep pockets.
	▲	AlotOfReading 10 hours ago \| parent \| prev \| next [-]
		Is that not what distillation is?
	▲	the_gipsy 10 hours ago \| parent \| prev [-]
		What does moat even mean anymore

▲

thisgetsit 10 hours ago | parent | prev | next [-]

> copyright provides such a moat.

Been saying this since the 2016 Alice case. Apple jumped into content production in 2017. They saw the long term value of copyright interests.

https://arstechnica.com/information-technology/2017/08/apple...

Alice changed things such that code monkeys algorithms were not patentable (except in some narrow cases where true runtime novelty can be established.) Since the transformers paper, the potential of self authoring content was obvious to those who can afford to think about things rather than hustle all day.

Apple wants to sell AI in an aluminum box while VCs need to prop up data center agrarianism; they need people to believe their server farms are essential.

Not an Apple fanboy but in this case, am rooting for their "your hardware, your model" aspirations.

Altman, Thiel, the VC model of make the serfs tend their server fields, their control of foundation models, is a gross feeling. It comes with the most religious like sense of fealty to political hierarchy and social structure that only exists as hallucination in the dying generations. The 50+ year old crowd cannot generationally churn fast enough.

▲

wincy an hour ago | parent | next [-]

OpenAIs opsec must be amazing, I had fully expected some version of ChatGPT to be leaked on torrent sites at some point this year. How do you manage to avoid something that could be exfiltrated on a hard disk from escaping your servers in all cases, forever?

	▲	KaiserPro 15 minutes ago \| parent [-]
		The model size is probably the thing here. I suspect they took the FAANG remote workstation approach, where VScode runs on a remote machine. After all its not that great having a desktop with 8 monster GPUs under your desk. (x100) Plus moving all that data about is expensive. Keeping things in the datacenter is means its faster and easier to secure.

▲

CodingJeebus 9 hours ago | parent | prev | next [-]

Totally agree, people love to talk about how hopelessly behind Apple is in terms of AI progress when they’re in a better position to compete directly against Nvidia on hardware than anyone else.

▲

madeofpalk 8 hours ago | parent [-]

Apple's always had great potential. They've struggled to execute on it.

But really, so has everyone else. There's two "races" for AI - creating models, and finding a consumer use case for them. Apple just isn't competing in creating models similar to the likes of OpenAI or Google. They also haven't really done much with using AI technology to deliver 'revolutionary' general purpose user-facing features using LLMs, but neither has anyone else beyond chat bots.

I'm not convinced ChatGPT as a consumer product can sustain current valuations, and everyone is still clamouring to find another way to present this tech to consumers.

	▲	calvinmorrison 6 hours ago \| parent [-]
		I think a major part of it is the shovel selling. Nvidia is selling shovels to OpenAI. OpenAI is selling shovels to endless B2B, Consulting, Accounting, software firms buying into it...

▲

PeterHolzwarth 6 hours ago | parent | prev [-]

My goodness, are you really saying, in effect, "I wish people over 50 would just hurry up and die"?!?

Good lord, expressing that kind of sentiment does not make for a useful and engaging conversation here on hacker news.

	▲	Zetaphor 5 hours ago \| parent [-]
		Clutch pearls

▲

sod22 10 hours ago | parent | prev | next [-]

Striking deals without a proper vision is a waste of resources. And that’s the path OAI is on.

▲

odo1242 9 hours ago | parent | prev [-]

It's also why they bought 40% of the world's RAM supply, too

	▲	aenis 6 hours ago \| parent [-]
		Committed to buying. They dont have the money to actually buy it (at least not yet).

▲ dktp 10 hours ago | parent | prev | next [-]

OpenAI is (was?) extremely good at making things that go viral. The successful ones for sure boost subscriber count meaningfully

Studio Ghibli, Sora app. Go viral, juice numbers then turn the knobs down on copyrighted material. Atlas I believe was a less successful than they would've hoped for.

And because of too frequent version bumps that are sometimes released as an answer to Google's launch, rather than a meaningful improvement - I believe they're also having harder time going viral that way

Overall OpenAI throws stuff at the wall and see what sticks. Most of it doesn't and gets (semi) abandoned. But some of it does and it makes for better consumer product than Gemini

It seems to have worked well so far, though I'm sceptical it will be enough for long

	▲	johnnyanmac 8 hours ago \| parent \| next [-]
		Going viral is great when you're a small team or even a million dollar company. That can make or break your business. Going viral as a billion dollar company spending upward of 1T is still not sustainable. You can't pay off a trillion dollars on "engagement". The entire advertising industry is "only" worth 1T as is: https://www.investors.com/news/advertising-industry-to-hit-1...
	▲	raw_anon_1111 9 hours ago \| parent \| prev [-]
		Selling a bunch of $20 a month subscriptions isn’t going to make a dent in OpenAI losses. Going viral for a day or two doesn’t help. Normal people are already getting tired of AI Slop

▲ piskov 11 hours ago | parent | prev | next [-]

Because as with the internet 99% of the usage won’t be for education, work, personal development, what have you. It will be for effing kitten videos and memes.

▲

only-one1701 10 hours ago | parent | next [-]

That’s an unusual way of saying uh…adult entertainment

▲

nine_k 11 hours ago | parent | prev | next [-]

Are the posters of effing kitten videos a customer base with a significant LTV?

(The obvious well-paying market would be erotic / furry / porn, but it's too toxic to publicly touch, at least in the US.)

	▲	piskov 11 hours ago \| parent \| next [-]
		Openrouter stats already mention 52% usage is roleplay. As for photo/video very large number of people use it for friends and family (turn photo into creative/funny video, change photo, etc.). Also I would think photoshop-like features are coming more and more in chatgpt and alike. For example, “take my poorly-lit photo and make it look professional and suitable for linkedin profile”
	▲	Tcepsa 6 hours ago \| parent \| prev [-]
		Also FWIW I understand that the furry community has a strong culture of commissioning artists for their work, so that's likely to be a headwind against using genAI that isn't explicitly trained only on licensed materials. Sure, there are likely some who would use it regardless, but I expect the use of genAI to generate furry porn to be at least as toxic within that community as the use of genAI to generate furry porn outside of that community.

▲

candiddevmike 11 hours ago | parent | prev [-]

If only 99% of the Internet was kitten videos and memes

	▲	piskov 11 hours ago \| parent [-]
		Well, it sure as hell not all 3blue1brown, crr0ww, Feynman, and alike

▲ Alconicon 11 hours ago | parent | prev | next [-]

Because OpenAI stands for AI leader.

If Gemini can create or edit an image, chatgpt needs to be able to do this too. Who wants to copy&paste prompts between ai agents?

Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

OpenAI is also relevant bigger than antropic and is known as a generic 'helper'. Antropic probably saw the benefits of being more focused on developer which allows it to succeed longer in the game for the amount of money they have.

▲ JumpCrisscross 10 hours ago | parent | next [-]

> Who wants to copy&paste prompts between ai agents?

An AI!

The specialist vs generalist debate is still open. And for complex problems, sure, having a model that runs on a small galaxy may be worth it. But for most tasks, a fleet of tailor-made smaller models being called on by an agent seems like a solidly-precedented (albeit not singularity-triggering) bet.

▲ andrekandre 9 hours ago | parent [-]

  > But for most tasks, a fleet of tailor-made smaller models being called on by an agent seems like a solidly-precedented (albeit not singularity-triggering) bet.

not an expert by any means, but wouldn't smaller but highly refined models also output more reproducible results?

intuitively it sounds akin to the unix model...

	▲	nubg 7 hours ago \| parent [-]
		But then again the main selling point of using LLMs as part of some code that solves a certain business need is that you don't have to finetune a usecase-specific model (like in the mid 2010s), you just prompt engineer a bit and it often magically works.

▲ password54321 11 hours ago | parent | prev | next [-]

>Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

I think you are confusing generation with analysis. As far I am aware your model does not need to be good at generating images to be able to decode an image.

▲

adastra22 11 hours ago | parent [-]

It is, to first approximation, the same thing. The generative part of genAI is just running the analysis model in reverse.

Now there are all sorts of tricks to get the output of this to be good, and maybe they shouldn't be spending time and resources on this. But the core capability is shared.

	▲	kaoD 11 hours ago \| parent [-]
		> The generative part of genAI is just running the analysis model in reverse. I think that hasn't been the case since DeepDream?

▲ mbreese 11 hours ago | parent | prev | next [-]

I think you're partially right, but I don't think being an AI leader is the main motivation -- that's a side effect.

I think it's important to OpenAI to support as many use-cases as possible. Right now, the experience that most people have with ChatGPT is through small revenue individual accounts. Individual subscriptions with individual needs, but modest budgets.

The bigger money is in enterprise and corporate accounts. To land these accounts, OpenAI will need to provide coverage across as many use-cases as they can so that they can operate as a one-stop AI provider. If a company needs to use OpenAI for chat, Anthropic for coding, and Google for video, what's the point? If Google's chat and coding is "good enough" and you need to have video generation, then that company is going to go with Google for everything. For the end-game I think OpenAI is playing for, they will need to be competitive in all modalities of AI.

▲ nutjob2 9 hours ago | parent | prev [-]

> Because OpenAI stands for AI leader.

It'll just end up spreading itself too thin and be second or third best at everything.

The 500lb gorilla in the room is Google. They have endless money and maybe even more importantly they have endless hardware. OpenAI are going to have an increasingly hard time competing with them.

That Gemini 3 is crushing it right now isn't the problem. It's Gemini 4 or 5 that will likely leave them in the dust for the general use case, meanwhile specialist models will eat what remains of their lunch.

▲ mFixman 2 hours ago | parent | prev | next [-]

Because there is only so much programmers and companies will pay for AI coders. The big prizes is AI-generated TikTok.

The entertainment industry is by far the easiest way to tap into global discretionary income.

▲ conception 6 hours ago | parent | prev | next [-]

When they released their first good image model is when they got a new 100 million users in a week.

▲ jdminhbg 11 hours ago | parent | prev | next [-]

Because for all the incessant whining about "slop," multimodal AI i/o is incredibly useful. Being able to take a photo of a home repair issue, have it diagnosed, and return a diagram showing you what to do with it is great, and it's the same algos that power the slop. "Sorry, you'll have to go to Gemini for that use case, people got mad about memes on the internet" is not really a good way for them to be a mass consumer company.

▲

tayo42 11 hours ago | parent [-]

Can Claude not do that? I've sent it pictures for simpler things and got answers, usually Id of bugs and plants.

	▲	esafak 10 hours ago \| parent [-]
		Yes, Claude is multi-modal.

▲ dyauspitr 10 hours ago | parent | prev | next [-]

Because those and world models are the endgame, way way more than text is.

▲ Cyclone_ 10 hours ago | parent | prev | next [-]

The fact that they do this isn't very bullish for them achieving whatever they define as AGI.

▲

fastball 10 hours ago | parent [-]

You don't expect AGI to be multi-modal?

▲

madeofpalk 8 hours ago | parent [-]

What is AGI?

	▲	fittingopposite 7 hours ago \| parent [-]
		Artificial general intelligence

▲ SAI_Peregrinus 12 hours ago | parent | prev | next [-]

Because their main use is for advertising/propaganda, which is largely videoSlop & imageSlop even without AI.

▲

password54321 12 hours ago | parent [-]

Outside of this: https://openai.com/index/disney-sora-agreement/ I don't think there has been much of a win for them even in advertising for image/video slop.

▲

anomaly_ 11 hours ago | parent [-]

It's like half the poster on here live in some parallel universe. I am making real money using generated image/video advertising content for both B2C and B2B goods. I am using Whisper and LLMs to review customer service call logs at scale and identity development opportunities for staff. I am using GPT/Gemini to help write SQL queries and little python scripts to do data analysis on my customer base. My business's productivity is way up since GenAI become accessible.

	▲	bdangubic 10 hours ago \| parent [-]
		that (very vocal) half tried it once and it didn’t work :)

▲ johnnyanmac 9 hours ago | parent | prev | next [-]

because these are mostly the same players of the 2010's. So when they can't get more investor money and the hard problems are still being cracked, the easiest fallback is the same social media slop they used to become successful 10-15 years prior. Falling back on old ways to maximize engagement and grind out (eventually) ad revenue.

▲ johnnyfived 11 hours ago | parent | prev [-]

But how much more profitable are they? We see revenue but not profits / spending. Anthropic seems to be growing faster than OpenAI did but that could be the benefit of post-GPT hype.