| ▲ | applfanboysbgon 4 hours ago | |||||||||||||
Spellchecking is absolutely not a solved problem. I immediately disable spellchecking on every avenue it tries to approach because managing a bunch of dictionaries on every browser/device/application that has its own spellchecker for some godforsaken reason to not have squigglies spammed over every piece of jargon, slang, and slightly atypical spelling is incredibly annoying. I don't know how effective LLMs are, but it's difficult to imagine they can be worse than the existing regime, which is embarrassingly bad for the decades it's been around. | ||||||||||||||
| ▲ | quuxplusone 3 hours ago | parent | next [-] | |||||||||||||
An interesting idea I saw long ago in some book (I thought it was K&P's "Software Tools," or my second guess was K&R1, but neither of those panned out — a strong Mandela effect) was the clever idea of a whole-document spellchecker that works purely probabilistically, by histograms: you feed it a document, it tallies the trigraphs, and any trigraph that appears only rarely is flagged as a likely typo. This approach lets through unknown-but-realistic words like "antithematory" while flagging unrealistic words like "prisencolinensinainciusol" (because of its unlikely "ciu" and "ius" clusters) and "antthemaory" (because of "ntt" and "aor"). To make this approach work better, feed it a bunch of English text (or whatever language your document is in) before the document you really want to "spellcheck." Essentially this isn't a spell "checker" so much as a spell "linter" — it looks for antipatterns statistically associated with bugs, and reports the patterns for further investigation. If anyone knows where this trigraph-based "spellchecker" was first presented, I'd love to find out again. | ||||||||||||||
| ||||||||||||||
| ▲ | PaulHoule 3 hours ago | parent | prev [-] | |||||||||||||
Human copy editors are less than perfect too. I hired one copy editor who I could not trust to be the last person who touched a document before it went out. I had a friend who wrote an article for the New York Times: the article made a lot of sense before she submitted it, but it was edited for length and style and it definitely read like a New York Times piece but didn't completely make sense. | ||||||||||||||