Remix.run Logo
anilgulecha 7 hours ago

I think the industry is moving to English as the programming language, and specifications-context-tdd as the framework for building software.

Many find it distasteful, and many finding liberating. I think it's broadly correlates with how they feel about expressing themselves in english vs say C++.

As a side question, is there anyone who's using LLMs primarily in non-english mode to program? I suspect there's quite a few people using mandarin, but can someone share first-hand account.

SwiftyBug 6 hours ago | parent | next [-]

I wonder how well Mandarin works for LLM-based programming. On one hand, it's very token efficient as Mandarin script is very dense in meaning. On the other, I suppose this can increase ambiguity.

pyonpyon 6 hours ago | parent | prev | next [-]

I'm using it 50% English (personal projects)/50% Polish (workplace; reasons being agents.md / team is not that english proficient) and honestly I haven't seen much difference in the output/ambiguity.

Polish prompts tend to be shorter due to the language having a lot of verb forms/conjugations, the only "bad" thing for me is that when it's saying "it broke" it tends to use uncanny / blunt words that make me sometimes laugh.

thedevilslawyer 6 hours ago | parent [-]

Interesting. Some questions: Would you say polish is more dense or less dense than english? It's interesting to hear that code quality is not suffering but the response text is sillier or blunter. Any other descrepenacies compared to English?

pyonpyon 6 hours ago | parent [-]

I would say it certainly can be more dense but even if it's more dense, the tokenizers count it as more. Last time I checked in OpenAI tokenizer for my agents.md it ate 30/40%~ more tokens than the English version at roughly 1:1 meaning.

pjmlp 6 hours ago | parent | prev | next [-]

I agree, and those are still too focused on code generation for specific languages are fighting the last war.

It is the revenge of UML modeling.

Eventually it will get good enough that what comes out of agent work, is a matter of formal specification.

Assuming that code is actually needed and cannot be achieved as pure agent orchestration workflows.

_woland 5 hours ago | parent | prev [-]

I'm using it in english / albanian. Not much difference really. Impressive.