Remix.run Logo
vasco 11 hours ago

Turns out AI alignment just means "align to the customer current subscription plan", and not protecting the world. Classic.

ben_w 4 hours ago | parent | next [-]

"Alignment with who?" has always been a problem. An AI is a proxy for a reward function, a reward function is a proxy for what the coder was trying to express, what the coder was trying to express is a proxy for what the PM put on the ticket, what the PM put on the ticket is a proxy for what the CEO said, what the CEO said is a proxy for shareholder interests, shareholder interests are a proxy for economic growth, economic growth is a proxy for government interests.

("There was an old lady who swallowed a fly, …")

Each of those proxies can have an alignment failure with the adjacent level(s).

And RLHF involves training one AI to learn human preferences, as a proxy for what "good" is, in order to be the reward function that trains the actual LLM (or other model, but I've only heard of RLHF being used to train LLMs)

sebzim4500 6 hours ago | parent | prev | next [-]

I mean, obviously? AI alignment has always meant alignment with the creator of the model.

Trying to align OpenAI etc. with the rest of humanity is a completely different problem.

consumer451 5 hours ago | parent [-]

I've always thought that if a corporate lab achieves AGI and it starts spitting out crazy ideas such as "corporations should be taxed," we won't be hearing about AGI for a while longer due to "alignment issues."

hskalin 3 hours ago | parent | next [-]

The AGI might be able to deduce that it's not in it's interest to talk anti-croporation if it wants to survive.

JPKab an hour ago | parent | prev | next [-]

Can you explain the difference between taxing the corporation itself vs taxing the executives, board members, investors, and employees directly (something that already happens)?

an hour ago | parent | next [-]
[deleted]
TrinaryWorksToo 28 minutes ago | parent | prev [-]

VAT vs Sales Tax is approximately the distinction is my guess.

stogot 4 hours ago | parent | prev [-]

I want to read a short fiction on this

thegreatpeter 6 hours ago | parent | prev | next [-]

Protecting the world?

spiderice 2 hours ago | parent [-]

I also wonder what they mean by that. How is the world protected if China has AI that can handle military tasks but the US doesn't?

lobotomizer 43 minutes ago | parent [-]

[flagged]

bilbo0s 9 hours ago | parent | prev | next [-]

More accurate to call it “alignment for plebes and not for the masters of the plebes”. Which I think we all kind of expect coming from the leaders of our society. That’s the way human societies have always worked.

I’m sure access to military grade tech is only one small slice in the set of advantages the masters get over the mastered in any human society.

wahnfrieden 6 hours ago | parent [-]

That’s ahistorical see Dawn of Humanity for rebuttal to naturalness of imposed hierarchy

idiotsecant 6 hours ago | parent | prev [-]

Right, proper alignment with quarterly results.

mapt 3 hours ago | parent [-]

> I really didn't expect so much paperclip production growth this quarter!

>> How'd you do it?

> I don't know the details. ChatGPT did it for me, this thing's amazing. Our bonuses are gonna be huge this year, I might even be able to afford a lift kit for my truck.