▲ | mannicken a day ago | |
Only Python, TypeScript and JavaScript? Well there go my vibe-coded elisp scripts. I guess it's impossible (or really hard) to train a language-agnostic classifier. Reference, from your own URL here: https://www.span.app/introducing-span-detect-1 | ||
▲ | henryl a day ago | parent | next [-] | |
It's probably impossible to detect ALL languages without training for them specifically, but there's good generalization happening. Our model is a unified model rather than a separate model per language. We started out with language-specific models but found that the unified approach yielded slightly better results in addition to being more efficient to train. | ||
▲ | johnsillings a day ago | parent | prev [-] | |
I'll let Henry elaborate here, but we think there's a chance that a truly language-agnostic classifier is possible. That being said, the next version of this will support a few more languages: Ruby, C#, and Java. |