| ▲ | Ask HN: Why do LLMs use em dashes so often? | ||||||||||||||||
| 2 points by dwa3592 6 hours ago | 5 comments | |||||||||||||||||
Why is it that all of the LLMs by default use em dashes more than any other punctuation? Is it the versatility of em dashes? But even then how did LLMs decide that em dashes were the versatile punctuation marks. I have been reading aldaily for the last 11-12 years and I have not seen em dashes used that frequently. LLMs insert em dash in almost every long response. | |||||||||||||||||
| ▲ | dlcarrier 3 hours ago | parent | next [-] | ||||||||||||||||
It follows the same style as the training data, and academic papers, which use a lot of em dashes, are an excellent source of training data. | |||||||||||||||||
| ▲ | carlos_rpn 4 hours ago | parent | prev | next [-] | ||||||||||||||||
Maybe this will help: https://marcusolang.substack.com/p/im-kenyan-i-dont-write-li... And I think I saw some others similar articles and blog posts over the years, but the tl;dr; is that a large enough ammount of text used in their training contains em-dashes because a large ammount of writers use them. | |||||||||||||||||
| ▲ | alexavilov 6 hours ago | parent | prev [-] | ||||||||||||||||
it's a quick and almost sure AI created text tell :) | |||||||||||||||||
| |||||||||||||||||