| ▲ | ronbenton 2 days ago |
| I am used to seeing technical papers from ieee, but this is an opinion piece? I mean, there is some anecdata and one test case presented to a few different models but nothing more. I am not necessarily saying the conclusions are wrong, just that they are not really substantiated in any way |
|
| ▲ | wavemode a day ago | parent | next [-] |
| To be fair, it's very rare that articles praising the power of AI coding assistants are ever substantiated, either. In the end, everyone is kind of just sharing their own experiences. You'll only know whether they work for you by trying it yourself. |
| |
| ▲ | a day ago | parent | next [-] | | [deleted] | |
| ▲ | mrguyorama a day ago | parent | prev | next [-] | | > You'll only know whether they work for you by trying it yourself. But at the same time, even this doesn't really work. The lucky gambler thinks lottery tickets are a good investment. That does not mean they are. I've found very very limited value from these things, but they work alright in those rather constrained circumstances. | |
| ▲ | franktankbank a day ago | parent | prev [-] | | And you can't try it out without for the most part feeding the training machine for at best free. | | |
| ▲ | Leynos a day ago | parent | next [-] | | Codex and Claude Code allows you to opt out of model training. Perhaps you don't believe OpenAI and Anthropic when they say this, but it is a requirement upon which most enterprise contracts are predicated. | |
| ▲ | pc86 a day ago | parent | prev [-] | | Are there a lot of products or services you can try out without using the product or service? | | |
|
|
|
| ▲ | esafak a day ago | parent | prev | next [-] |
| This is the Spectrum magazine; the lighter fare. https://en.wikipedia.org/wiki/IEEE_Spectrum |
|
| ▲ | troyvit a day ago | parent | prev | next [-] |
| Yeah I saw the ieee.org domain and was expecting a much more rigorous post. |
| |
| ▲ | ronbenton a day ago | parent [-] | | This may be a situation where HackerNews' shorthand of omitting the subdomain is not good. spectrum.ieee.org appears to be more of a newsletter or editorial part of the website, but you wouldn't know that's what this was just based on the HN tag. | | |
| ▲ | preommr a day ago | parent | next [-] | | I've been on this site for over a decade now and didn't know this. That's a genuinely baffling decision given how different content across subdomains can be. | |
| ▲ | badc0ffee a day ago | parent | prev | next [-] | | Maybe an exception could be made here, like HN does for medium.com. | |
| ▲ | bee_rider a day ago | parent | prev [-] | | On the other hand, “ieee spectrum” is directly at the top of the page, then “guest article.” | | |
| ▲ | ronbenton a day ago | parent [-] | | Well, as much as I'm sure HN is a special place ;) , it is well documented that a lot of people on the internet just read the headlines | | |
|
|
|
|
| ▲ | causal a day ago | parent | prev | next [-] |
| And the example given was specific to OpenAI models, yet the title is a blanket statement. I agree with the author that GPT-5 models are much more fixated on solving exactly the problem given and not as good at taking a step back and thinking about the big picture. The author also needs to take a step back and realize other providers still do this just fine. |
| |
| ▲ | wavemode a day ago | parent [-] | | He tests several Claude versions as well | | |
| ▲ | causal a day ago | parent [-] | | Ah you're right, scrolled past that - the most salient contrast in the chart is still just GPT-5 vs GPT-4, and it feels easy to contrive such results by pinning one model's response as "ideal" and making that a benchmark for everything else. |
|
|
|
| ▲ | verdverm a day ago | parent | prev [-] |
| and they are using OpenAI models, who haven't had a successful training run since Ilya left, GPT 5x is built on GPT 4x, not from scratch aiui I'm having a blast with gemini-3-flash and a custom copilor replacement extension, it's much more capable than Copilot ever was with any model for me and a personalized dx with deep insights into my usage and what the agentic system is doing under the hood. |
| |
| ▲ | RugnirViking 17 hours ago | parent [-] | | can you talk a little more about your replacement extention? I get copilot from my worksplace and id love to know what I can do with it, ive been trying to build some containerized stuff with copilot cli but im worried I have to give it a little more permissions than im comfortable with around git etc |
|