Remix.run Logo
vidarh 2 hours ago

The issue with German as well as Norwegian is that a space creates a semantically distinct structure, so it's not that they remove the space, but that one wasn't there in the first place, and some of those compounds then become important enough for the dictionary.

Absolutely not all - there's a near unbounded set of possible compounds.

In Norwegian, we in fact have a compound for the incorrect separation of compounds: "orddelingsfeil" (word separation error). Actually, we have two - technically it's "særskrivingsfeil" (separate writing error), but "orddelingsfeil" is more common... We take this seriously.

The problem is that while some are definitely wrong, others change meaning.

E.g. "en norsk lærer" means "a Norwegian teacher" but "en norsklærer" means "a teacher of the subject Norwegian". There's an infinite set of possible -lærer compounds: If you create a new subject then a teacher of that subject is a <subject>lærer. Obviously they can't all go in the dictionary.

Some other examples:

"Røyk fritt" means "smoke freely" while "røykfritt" means "smokefree". "Steke ovn", means "to fry an oven", while "stekeovn" means "oven". These two belong in the dictionary because they are so common and that though technically you can use "ovn" and "fri"/"fritt" to form a near infinite number of other common forms as well, in practice the number of common forms that use them is quite limited.

The key part is that most compounds in languages like German or Norwegian will only have one valid way of writing them. Add spaces, and you usually end up with something ungrammatical or with an entirely different meaning.

Whereas in English whether or not a word can be written with a space, with a hyphen, or combined much more often changes over time, and can differ in different places at different times, as the <separate words> -> <hyphenated> -> <compound> pipeline in English is slow and arbitrary and not necessarily reflecting a change in meaning.

michaeld123 2 hours ago | parent [-]

I just adapted your comments into the paragraph starting with "German". Hopefully accurately.