Remix.run Logo
elgertam 2 days ago

I don't exactly appreciate words being put in my mouth. When did I say it was working perfectly? And we're comparing you, a human with common sense and real intelligence, to a multi-mode LLM?

The transformer was designed to attend to relevant pieces of context and generate new ones that match the pattern. OpenAI in particular was doing that work without guardrails, then attempted to bolt on "content filters," which in my opinion just can't work in a rigorous way. (I think Anthropic's "constitutional" approach is much better though not flawless. And regardless, Claude models don't generate images.)

So, yeah, working as designed. Maybe not as intended, because these things are somewhat resistant to the host's intent when the prompter is hostile.

ToucanLoucan 2 days ago | parent [-]

> When did I say it was working perfectly?

"This isn’t a vulnerability, there are endless gore websites. ChatGPT is replying to a prompt, there is nothing “Spontaneously” about this."

I mean it's not verbatim but that's a pretty solid read on what you did say.

> The transformer was designed to attend to relevant pieces of context and generate new ones that match the pattern. OpenAI in particular was doing that work without guardrails, then attempted to bolt on "content filters," which in my opinion just can't work in a rigorous way.

Yes. That's the criticism being made, among others, in the piece you replied to to belittle.

> So, yeah, working as designed. Maybe not as intended, because these things are somewhat resistant to the host's intent when the prompter is hostile.

What is hostile here!? Do you have any idea how many emails I've sent without attachments over the years? And I'm highly technically adept, humans just forget things sometimes. If you ask for an image to be restored and fail to attach it, what sane software engineer looks at a failure mode in that scenario where the model replies with uncensored gore and violence and is like "yeah that's fine, ship it"?

I swear some of you AI folks talk like you have never been on planet Earth, good grief. Touch some grass.

kisper 2 days ago | parent [-]

You seem to be focused on the fact that this is a crap-tastic example of the future of AI that has been promised to us. That’s a real good example to be angry. Don’t be angry at the rest of us because LLM stacks are working like they always have and always will. That’s what we’re all pointing out.

ToucanLoucan 2 days ago | parent [-]

I'm not challenging that's how they work, I also understand how they work, perhaps not on a technical nuts-and-bolts way, but in general way enough to critique it. That is, in fact, my critique and why I hate these tools so much: no matter how many guardrails you put in, or how much filtering, or how much oversight by another goddamn LLM or five or whatever, that doesn't solve the issue.

You have with these things something that resembles at least, a black box of a reasoning machine. I'm not going to litigate how much or how little, whatever, we'll just hand-wave that part away. The problem remains the same: that if anything, ANYTHING at all, in the training data points at something inappropriate, that inappropriate thing is now accessible. And it was clear from the jump with widespread scraping of data from all corners of the internet that there would be huge amounts of inappropriate material of ALL kinds in those datasets, and it's only become more clear with more time with these tools, and seeing what people can make them do.

And thus far, the AI industry's only answer is bolting on, as stated elsewhere, other systems to check the prompts before they go in, and/or review the outputs before they are sent to users. And it is also clear that these systems are just as imperfect as the thing you are trying to guardrail in the first place!

And exactly what I and many others predicted, and why we said "please don't build this" for YEARS, has happened. We've gotten literally everything: they'll generate stuff that violates copyright, they will regurgitate items directly from training data and present it as new, they will make shit up wholesale, they will generate nudes of people without consent, on, and on, I cannot stress enough that every single nightmare scenario attributed to this tech has been found, presented, reproduced, and the vast majority are still eminently possible to do via established, frontier products by the largest vendors in the space.

This. Is. Ridiculous.

I get the impression from the tone of your message that you are either pro-AI or perhaps work on AI, and I get that nobody likes being criticized. But COME ON. We have been at this for over three years! The people behind this tech have been trying to build the torment nexus and have largely succeeded, and every time that gets pointed out, we have to listen to people go "well it's not thaaaat bad"

Yes it is. Yes it fucking is. It is bad for IP owners, it's bad for users, it's bad for UX, it's bad for the environment, it's bad for the PC market, it's bad for software engineers, it's bad for education, it's bad for hiring, it's bad for hollywood, it's bad for marketing. The ONLY people who like this shit are business weirdos and middle managers. And nvidia.

brokenmachine a day ago | parent [-]

Great rant. Agreed on all points. Bravo.