Remix.run Logo
gofreddygo 12 hours ago

there are 31 emdashes in that piece. the domain ends with _ai_

wood_spirit 12 hours ago | parent | next [-]

It’s a tangent but two points:

First, the reason LLMs learned to like em dashes is that they are common in the training corpus - they are a thing before LLMs that LLMs have learned, not invented?

Second, work browser has nice blue swiggles under everything I write into a textbox. I dutifully click through them and accept the rephrasing suggestions. I get a lot of em dashes. My blog posts and whitepapers and stuff are full of them and other “AI tells” - but I think they read better because of it.

jorisw 12 hours ago | parent | prev | next [-]

I use emdashes all the time. They're correct punctuation as opposed to a minus sign. They're easy to type too: opt-shift-minus. If they were such a huge giveaway without ever being used by humans, models would be trained by now not to use them as much.

The blog is about AI. So yeah the TLD is .ai

phainopepla2 10 hours ago | parent [-]

I've never seen writing created before the advent of LLMs that used emdashes in the same way and with the same frequency that LLMs regularly do. There's probably some out there but it would be a real outlier. LLMs overuse them to an absurd degree, putting them where most writers would put commas, occasionally semi-colons, or nothing at all.

I count 51 em-dashes on the page, which is extreme. They're also used in places where they don't really belong. It's very obviously LLM-generated, at least in part.

That said, it puzzles me why people don't prompt LLMs to change up the writing style a bit and remove some of the tells.

tiahura 12 hours ago | parent | prev [-]

I can't imagine why a system designed to reproduce the best writing styles would frequently use em dashes.