| ▲ | postalrat 3 hours ago | ||||||||||||||||
Where do you think llms learned to write that way? | |||||||||||||||||
| ▲ | jlund-molfese 3 hours ago | parent | next [-] | ||||||||||||||||
You can also look at past posts by the same author (before LLM usage proliferated) if you’re curious. The project is still very cool, but it’s a little less enjoyable to read when everything sounds the same. It would be just as annoying for people to manually write in a corporate/marketing style, because humanity is what makes the small web interesting. | |||||||||||||||||
| |||||||||||||||||
| ▲ | tgv 3 hours ago | parent | prev | next [-] | ||||||||||||||||
Because their custom training data contains an emphasis on such verbiage. It doesn't come from the God-knows-how-many TB of web content the model is pre-trained on. There, such phrasing is only a drop in the sea. But the "yes, you're right" phrases, the em dash, etc., come from the later stage, for which content is created according to some (probably overprecise) guidelines. | |||||||||||||||||
| |||||||||||||||||
| ▲ | lelanthran 3 hours ago | parent | prev | next [-] | ||||||||||||||||
> Where do you think llms learned to write that way? Not from individual human content, that's for sure - maybe MLM marketing copy? Sleazy 4AM ads? I mean, every time this response comes up, I keep asking the person to point at something written prior to 2022 that gets 80%+ on the LLM detectors, and yet no one can find anything. Maybe you, postalrat, can find something written in this style that was published prior to 2022. | |||||||||||||||||
| |||||||||||||||||
| ▲ | alehlopeh 3 hours ago | parent | prev [-] | ||||||||||||||||
Marketing content. | |||||||||||||||||