Remix.run Logo
spiderfarmer 18 hours ago

These small models are very cheap for "good enough" translations. I just translated 6M comments on my platform with Gemma 32B and this model seems to be on par.

It's cheap enough that I'm currently doing a second pass where another model critiques and if needed, rewrites the original translation.

deaux 18 hours ago | parent | next [-]

To English, I assume, for casual perusal? Before people unfamiliar with this topic start thinking small models are decent at translating between random language pairs. They're poor for translating "to" the overwhelming majority of languages and I wouldn't recommend using them for this purpose for anything user-facing.

larodi 18 hours ago | parent | prev [-]

Second run with same Gemma? Perhaps 12b would perform similarly or not?