Remix.run Logo
fine_tune 5 days ago

I was going to argue "LLM's need code samples to-do well on languages and if we are honest C# is a language mostly held in private repo's" but Github's 2024 report[0] says its the 5th most used language (I'm to lazy to check if this report includes private repo's but I'll assume it doesn't).

So kinda neat to see this paper!

[0]https://github.blog/news-insights/octoverse/octoverse-2024/#...

CuriouslyC 5 days ago | parent | next [-]

The big labs are almost certainly using compiler/repl output for generated code as an oracle for RL. I doubt they have C# in the mix.

tomjakubowski 5 days ago | parent [-]

Why do you doubt that? It's a widely used language. And there is even an open source C# REPL.

5 days ago | parent | next [-]
[deleted]
CuriouslyC 5 days ago | parent | prev [-]

Because RL time is expensive and I don't think the languages which are more popular than C# have such high performance that it's worth bumping their batches for C#.

stingraycharles 5 days ago | parent [-]

But C# is a typical enterprise language which has people who are willing to pay a lot of money for AI.

We’re just guessing and the fact of the matter is that we don’t know what inputs they use for their models.

yieldcrv 5 days ago | parent | prev [-]

5th most used language based on private repos that the group making the report has the exclusive direct access to seeing

I don't see that contradicting your assumption

BoorishBears 5 days ago | parent [-]

"In this year’s Octoverse report, we study how public and open source activity on GitHub..."