▲ | DaveZale 3 days ago | ||||||||||||||||||||||||||||
Ummm... doesn't the AI have to scrape the data of those non- WEIRD cultures to work then? What am I missing here? There are parts of the world where constant person-electronic connection isn't a thing. Is that your point? | |||||||||||||||||||||||||||||
▲ | psidium 3 days ago | parent | next [-] | ||||||||||||||||||||||||||||
I don’t have the data but I assume the corpus available to train an LLM is majorly in English, written by Americans and western counterparts. If we’re training the LLMs to sound similar to the training data, I imagine the responses have to match that world view. My anecdote is that before LLMs I would default to search Google in English instead of my own native language simply because there was so much more content in English to be found that would help me. And here I am producing novel sentences in English to respond to your message, further continuing the cycle where English is the main language to search and do things. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
▲ | YurgenJurgensen 3 days ago | parent | prev [-] | ||||||||||||||||||||||||||||
“Fancy autocomplete better at completing documents similar to ones it has seen before” isn’t as headline-worthy. | |||||||||||||||||||||||||||||
|