| ▲ | solarkraft 5 hours ago | ||||||||||||||||||||||
Smells like hyperbole. A lot of people making such claims don’t seem to have continued real world experience with these models or seem to have very weird standards for what they consider usable. Up until relatively recently, while people had already long been making these claims, it came with the asterisks of „oh, but you can’t practically use more than a few K tokens of context“. | |||||||||||||||||||||||
| ▲ | derekp7 4 hours ago | parent | next [-] | ||||||||||||||||||||||
"Create a single page web app scientific RPN calculator" Qwen 3.5 122b/a10b (at q3 using unsloth's dynamic quant) is so far the first model I've tried locally that gets a really usable RPN calculator app. Other models (even larger ones that I can run on my Strix Halo box) tend to either not implement the stack right, have non-functional operation buttons, or most commonly the keypad looks like a Picasso painting (i.e., the 10-key pad portion has buttons missing or mapped all over the keypad area). This seems like such as simple test, but I even just tried it in chatgpt (whatever model they serve up when you don't log in), and it didn't even have any numerical input buttons. Claude Sonet 4.6 did get it correct too, but that is the only other model I've used that gets this question right. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | tempest_ 5 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
Qwen3-Coder-30B-A3B-Instruct is good I think for in line IDE integration or operating on small functions or library code but I dont think you will get too far with one shot feature implementation that people are currently doing with Claude or whatever. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | __mharrison__ 4 hours ago | parent | prev [-] | ||||||||||||||||||||||
I used the 35b model to create a polars implementation of PCA (no sklearn or imports other than math and polars). In less than 10 minutes I had the code. This is impressive to me considering how poorly all models were with polars until very recently. (They always hallucinated pandas code.) | |||||||||||||||||||||||