Remix.run Logo
ratg13 4 days ago

I can understand training AIs on books, and even internet forums, but I can't help but think that training an AI on lots of dumb questions with probably an excessive amount of grammar and spelling errors will somehow make it smarter.

nrclark 4 days ago | parent | next [-]

Depends on how you’re using the data. There’s a pretty strong correctness signal in the user behavior.

Did they rephrase the question? Probably the first answer was wrong. Did the session end? Good chance the answer was acceptable. Did they ask follow-ups? What kind? Etc.

vb-8448 4 days ago | parent | next [-]

I'm used to doing the same task 4 or 5 times (different sessions, similar prompts), and most of the time the result is useless or completely wrong. Sometimes I go back and pick the first result, other time none of them, other time a mix of them. I'm wondering how can they extract value from this.

dudefeliciano 4 days ago | parent | prev [-]

> Did the session end? Good chance the answer was acceptable.

Or that the user just ragequit

victorbjorklund 4 days ago | parent | prev | next [-]

Doubt they feed everything in. They probably pick out a small subset of conversations for the training round.

mrweasel 4 days ago | parent | prev | next [-]

They train AI on Reddit and Stack Overflow questions, I can't see it getting any worse.

dahsameer 4 days ago | parent | prev [-]

> and even internet forums

i would consider internet forums also includes a lot of dumb questions

ratg13 4 days ago | parent | next [-]

Agree, but people generally take a small pause before saying stuff online.

In 'private', people are less ashamed of their ignorance, and also know they can say gibberish and the AI will figure it out.

timeon 4 days ago | parent | prev [-]

Like what?