Remix.run Logo
InsideOutSanta 3 hours ago

I believe it, because it makes a kind of sense. Post-training has a huge impact on how well LLMs perform, and labeled data is what determines the effectiveness of post-training. This is why companies like Anthropic are so worried about distillation.

So if you have access to a large number of highly skilled people, and you really don't absolutely need them to do other things, why wouldn't you force data labeling tasks on them?

Facebook is also planning a 10% layoff, so this also works as encouragement for people to leave voluntarily.

(Before you downvote me, note that I'm not endorsing this or saying it's a good idea. I'm just saying that I believe it's true, because I can see how Facebook's leadership would think it's a good idea.)

layer8 3 hours ago | parent | next [-]

From the article:

> Forced data labeling with 4,500+ engineers is to generate high-quality RLHF

I doubt that you get high quality from forced reassignments where the now-data labelers don’t actually want to do that kind of work.

It’s crazy to think that Meta leadership believed that it makes sense.

xnx 3 hours ago | parent | prev | next [-]

Do the skills these people have overlap with the skills needed for a good data labeler? I'm guessing being a domain expert is most valuable as a data labeler.

HDThoreaun 3 hours ago | parent | prev [-]

Because you can just get rid of all those people and do the data labeling tasks for 1/4 the cost?

vanuatu 3 hours ago | parent [-]

unironically if those engineers were considered to be 'bloat' its better to have them label data because they are smarter and vetted

basically a soft layoff