Remix.run Logo
Achterlangs 6 hours ago

It is not about the country but the language. Most llms have poor or no support for Dutch.

tgv 6 hours ago | parent | next [-]

Idk which models you refer to, but I tested a bunch recently, and they performed well on Dutch. Only the smallest, such as qwen 3.6 27B, made up words and switched languages.

dvdkon 3 hours ago | parent | next [-]

There would be a bunch of value in having, say, a good 30B-class model that used my local language as well as it does English. There's lots of cases, especially in the government sphere, where local processing is a requirement and frontier-level capabilities aren't required. Making those cheap to run seems like a fine goal.

throw310822 2 hours ago | parent [-]

Can you provide some examples of these use cases?

numeri 3 hours ago | parent | prev [-]

There's a large gap between making up words and an actually native text distribution. LLMs have a clear pattern, clear tells, a "feel" in English, and it's normally even more pronounced in non-English languages.

Lots of bias towards English sentence structure, idioms, etiquette, etc.

throw310822 2 hours ago | parent | prev [-]

I don't understand this. Even if that were true (and it isn't in my experience), a model that is trained on a Dutch corpus and arguably "knows Dutch well" but has the reasoning and comprehension abilities of a three year old is useless in any case. I'd rather use a model that can only speak English and put an automatic translator around it.