| ▲ | theshrike79 7 hours ago | |||||||||||||||||||||||||
Should be possible with optimised models, just drop all "generic" stuff and focus on coding performance. There's no reason for a coding model to contain all of ao3 and wikipedia =) | ||||||||||||||||||||||||||
| ▲ | jstummbillig 5 hours ago | parent | next [-] | |||||||||||||||||||||||||
There is: It works (even if we can't explain why right now). If we knew how to create a SOTA coding model by just putting coding stuff in there, that is how we would build SOTA coding models. | ||||||||||||||||||||||||||
| ▲ | noveltyaccount 6 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
I think I like coding models that know a lot about the world. They can disambiguate my requirements and build better products. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | moffkalast 5 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
That's what Meta thought initially too, training codellama and chat llama separately, and then they realized they're idiots and that adding the other half of data vastly improves both models. As long as it's quality data, more of it doesn't do harm. Besides, programming is far from just knowing how to autocomplete syntax, you need a model that's proficient in the fields that the automation is placed in, otherwise they'll be no help in actually automating it. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
| ▲ | MarsIronPI 6 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||
But... but... I need my coding model to be able to write fanfiction in the comments... | ||||||||||||||||||||||||||
| ▲ | wongarsu an hour ago | parent | prev [-] | |||||||||||||||||||||||||
Now I wonder how strong the correlation between coding performance and ao3 knowledge is in human programmers. Maybe we are on to something here /s | ||||||||||||||||||||||||||