| ▲ | b--l 7 hours ago |
| Codex is one of the most infamous examples of slopware. Just having the window unhidden on my mac will cause it to use 100% of the GPU displaying the spinner message. THE SPINNER MESSAGE CAUSES 100% GPU USAGE ON AN MBP M5!! So any time you're waiting on the model (which is 90% of the time), your fans will be blasting (careful, don't use it on battery). The issue is on github and close to 6 months old. Probably since the release of vibe coded junk. I would literally fix it myself but it's closed source for whatever reason. There are many discussions about which model is better, or if vibe coding is even possible. I point you to the extent of what one of the most well funded, money flush, well staffed model making companies can do with vibe coding. To me a screwup this bad (where the CEO has already made it clear they're now "focussing on coding") indicates that there's something truly broken in the company. No one on polymarket expects them to have a leading model any time soon for example. It's a tragedy. The world needs competition to anthropic. |
|
| ▲ | jofzar 6 hours ago | parent | next [-] |
| > Codex is one of the most infamous examples of slopware Woah, let's not forget Claude code is right there |
| |
| ▲ | sambcui a minute ago | parent | next [-] | | I don’t know if you can resonate, but I feel like the Vibe Coded codex and Claude Code desktop apps are iterating way faster than they should be. | |
| ▲ | me551ah 2 hours ago | parent | prev | next [-] | | Claude is also weird for being the only coding assistant that for some reason doesn't support AGENTS.md. Codex, Amp, Cursor all of them support it and read from it, but not claude which forces it's users to use CLAUDE.md instead. The issue is the higest voted issue on their gitlab repo: https://github.com/anthropics/claude-code/issues/6235 | | |
| ▲ | ValentineC 2 hours ago | parent | next [-] | | My CLAUDE.md is just: @AGENTS.md
And Claude processes it just fine.(I see that it's a common workaround, and there's a comment in the above link saying just this: https://github.com/anthropics/claude-code/issues/6235#issuec...) It's a hassle having to add it to every repo that I use Claude with though, and I often use other models and harnesses too for the more trivial tasks. | |
| ▲ | chorkpop an hour ago | parent | prev | next [-] | | CLAUDE.md has been incredibly successful for them advertising wise. I wouldn’t expect them to admit AGENTS.md exists anytime soon. | |
| ▲ | anon373839 2 hours ago | parent | prev | next [-] | | Gee, I wonder why that is. Do you think Anthropic’s Claude Code team are just trying to protect humanity somehow? Maybe our mortal brains just can’t comprehend the damage that supporting a non-Claude-branded standard might do... | |
| ▲ | hexsprite 21 minutes ago | parent | prev [-] | | I created a Claude Code plugin to load AGENTS.md. Uses symlinks but it’s better than no support. https://github.com/hexsprite/claude-agents-md |
| |
| ▲ | kokada 4 hours ago | parent | prev | next [-] | | Not that Claude Code is much better, I just hit this issue[1] because it seems setting DO_NOT_TRACK=1 seems enough to get a really strange behavior in the newest versions of CC. [1]: https://github.com/anthropics/claude-code/issues/69238#issue... Edit: I think I misunderstood OP, they're saying that CC is even worse and not better than Codex CLI. | |
| ▲ | varjag 3 hours ago | parent | prev | next [-] | | Right, just yesterday I found my laptop kinda hot. And what do you think, it was good old Claude deciding to load a few cores with completely idling prompts. | | | |
| ▲ | mvATM99 5 hours ago | parent | prev | next [-] | | Yeah exactly. I'm not exactly building TUI's every day, but even i felt pain when i read that "small game engine" post | | |
| ▲ | matheusmoreira 3 hours ago | parent | next [-] | | At least game engines manage to render their frames properly. Claude Code sometimes eats entire paragraphs of text output, resulting in things such as numbered lists jumping from 2 to 4 out of nowhere. I'd just ask Claude to repeat himself at first but it happens so often that I actually made a little tool to dig up the output inside the session history and present it properly in a separate terminal. | |
| ▲ | TacticalCoder 3 hours ago | parent | prev [-] | | > I'm not exactly building TUI's every day, but even i felt pain when i read that "small game engine" post The bigger issue is they where somehow thinking it was "cool" and "advanced" while it's just a kludgy rube-goldbergy monstrous hack. Which is of course only semi-working: to me the model thinking what you see is what it outputs in the TUI is the deal-breaker for me. It's of course not working like that for they're apparently, in their "game engine", converting on the fly a headless browser to approximated characters to display in the terminal. So the model tells you he did output ASCII but people are copy/pasting (because, yes, at times you want to copy/paste) Unicode chars. Plenty of bug reports and pissed users. That's the bigger issue. The biggest issue is those thinking a 10 GB VM required to run a headless Electron browser and then fuxx0ring characters conversion is somehow an achievement. | | |
| |
| ▲ | iLoveOncall 3 hours ago | parent | prev [-] | | Surprisingly Kiro is fine (I work at Amazon but not at all on the Kiro team). I prefer it to anything else I've tried (except Amazon Q Developer in IntelliJ, but it's now deprecated). | | |
| ▲ | epistasis 32 minutes ago | parent [-] | | Kiro is surprisingly good, if the interface for saving and resuming was slightly more reliable, and there was the hope of remote sessions, I'd probably switch to it full time. I vastly prefer it to having to fight against buggy force-fed features like UltraPlan or whatever. |
|
|
|
| ▲ | r_lee 5 hours ago | parent | prev | next [-] |
| if we are at 10x with AI and near AGI or ASI, then how is it possible that these products (Codex, Claude Code CLI) are still such garbage? shouldn't this "agentic AI revolution" have long solved this already? no way they're over there saying "we are on it plz wait" or that "it's too much effort"? |
| |
| ▲ | igleria 4 hours ago | parent | next [-] | | This is the biggest elephant in the room I have seen in my decade+ career. At the same time, look how bad Apple is in software compared to its hardware... It's not an AI only problem, it's almost like software in general gets a free pass on being very unsafe or low quality because no one wants to face the same "profit reducing red tape" that civil engineers or similar face. | | |
| ▲ | CharlieDigital 4 hours ago | parent | next [-] | | Anthropic were the progenitors of the Model Context Protocol. Claude Code does not fully implement the client end of the protocol. A protocol; a literal pre-defined spec that an agent should be able to one-shot. Neither does Codex. Codex does not implement MCP Prompts. (I want Codex to implement MCP Prompts because then we have one central way to ship skills from a server). The fact that neither platform can implement a protocol given what is functionally infinite frontier model tokens really says a lot. I do not care what kind of random project some influencer can ship with a swarm of 1000 agents. If you cannot make the basics work, it is a farce. | | |
| ▲ | deathbob an hour ago | parent [-] | | It still boggles my mind that Anthropic would invent the MCP protocol but not fully implement it. Especially when fully implementing it (prompts, resources, tools) is easily done in harnesses that don’t ship with MCP but allow good extension / modification like Pi. Claude not being able to see its own usage or self invoke slash commands is also very frustrating. | | |
| |
| ▲ | forshaper 26 minutes ago | parent | prev [-] | | How much of all this is due to hardware improving, and software bloating enough to fill the capacity? |
| |
| ▲ | hombre_fatal 3 hours ago | parent | prev | next [-] | | Like anything, you have to decide between polish vs switch to any other task in the queue. If you choose too much from the latter, then polish suffers, yet that's a human thing. Also, Codex and Claude Code aren't as bad as people say. I think most of the noise is embellished by the "hah see? AI sucks" angle. It's kind of like how HNers would claim to your face that you can't actually build anything with Javascript and Node.js (JS just sucks too much), then they'd list off a few footguns that were supposed to demonstrate why. In other words, champing at the bit for JS to lead people to catastrophize issues that were pretty mediocre. | | |
| ▲ | geodel 5 minutes ago | parent | next [-] | | > yet that's a human thing. is this joke? Here we are talking about trillon dollar AI companies who claim AI can fix decade old bugs and create new compilers, OSs and what not. Are parallel agents working autonomously to fix issues as well as create new features not allowed at these companies? | |
| ▲ | coldtea 2 hours ago | parent | prev [-] | | >Like anything, you have to decide between polish vs switch to any other task in the queue Why do you "have to decide"? Let some agents go at both of those, isn't that what they claim people can just do? >Also, Codex and Claude Code aren't as bad as people say. I think most of the noise is embellished by the "hah see? AI sucks" angle. Why shouldn't it? They're not the ones making the extraordinary claims. | | |
| ▲ | hombre_fatal 2 minutes ago | parent [-] | | > Why do you "have to decide"? Let some agents go at both of those, isn't that what they claim people can just do? Because your code is still marching somewhere in tokens per second. You have to decide where they are allocated: polish or the next thing. Humans still are the ones prompting LLMs and deciding what is done. > isn't that what they claim? Why shouldn't it? They're not the ones making the extraordinary claims. Even if I grant that someone else makes excessive claims, why would that let you off the hook to make claims grounded in reality? |
|
| |
| ▲ | jeffybefffy519 3 hours ago | parent | prev | next [-] | | Because vibe coding is a toy… thats the secret. You can use it to accelerate development certainly, but that requires careful change->review cycles. The developer still needs to be in heavy control, versus vibe coding having an agent own the code base. | |
| ▲ | ValentineC an hour ago | parent | prev | next [-] | | The "AI revolution" feels like it's creating a bunch of ultra-smart AI models are scarily good at cracking most of human-created security (Mythos), but also happen to be careless snobs that just leave litter and mess in their wake. | |
| ▲ | mnicky 2 hours ago | parent | prev | next [-] | | If the code churn is high the investment to refactoring etc is less beneficial than may be obvious. I don't remember the details but I heard in some podcast that the code base of Claude Code changes so fast that any piece of code won't be there for long.. | | |
| ▲ | coldtea 2 hours ago | parent | next [-] | | In other words it's an ever moving vibe fest, with random bugs and misbehaviors each time they roll the dice... | | | |
| ▲ | an hour ago | parent | prev | next [-] | | [deleted] | |
| ▲ | tartoran an hour ago | parent | prev [-] | | If they respected their users they’d at least pin some versions that are more stable. |
| |
| ▲ | fg137 4 hours ago | parent | prev | next [-] | | You are asking too many good questions. | |
| ▲ | user43928 4 hours ago | parent | prev | next [-] | | The products generally work just fine on my MacBook. I have not encountered major issues in either the Claude Code CLI, the Codex Desktop app, or Claude Desktop app. They generally get the job done. I don't measure disk writes or analyze the GPU usage. | |
| ▲ | 3 hours ago | parent | prev | next [-] | | [deleted] | |
| ▲ | Zababa 3 hours ago | parent | prev [-] | | A simple explanation is that they are "good enough" for most people and they have better things to do. Even if tomorrow I was 100 times as productive, I still wouldn't have time to do literally everything and I would have to prioritize. | | |
| ▲ | coldtea 2 hours ago | parent [-] | | You might not. But the Claude Code team has ONE job. And they have full access to a platform that they advertise as "humanity-threat" level good, and claim that it can automate everything code related... | | |
| ▲ | Zababa an hour ago | parent [-] | | I think they have more than one job, they have to balance new features with improving the software itself. And Anthropic has to balance investing resources into Claude Code vs on infra or other things. Not that I'm happy with the current state of things, in fact I'm quite sad that improvements in capacity to do things doesn't translate into better quality. |
|
|
|
|
| ▲ | giancarlostoro an hour ago | parent | prev | next [-] |
| > It's a tragedy. The world needs competition to anthropic. I agree, though Sam Altman's company is the last option I'd want to replace Claude with. I would sooner exhaust every open model. |
|
| ▲ | CryZe an hour ago | parent | prev | next [-] |
| > THE SPINNER MESSAGE CAUSES 100% GPU USAGE ON AN MBP M5!! This seems to be a common Chromium problem across tons of software. GitHub has the same issue with its spinners, VSCode as well. |
|
| ▲ | nicce 5 hours ago | parent | prev | next [-] |
| Not only Codex, but I can't leave ChatGPT app in macOS open for few hours, because it will consume 60 gigabytes of RAM over time and crashes all the apps. Mindboggling. Or can't use Google's AI Studio in browser because it takes 100% CPU. Need to write own app for everything??? |
| |
| ▲ | veber-alex 3 hours ago | parent | next [-] | | ChatGPT works ok for me but Whatsapp consumes 1000% cpu after the mac wakes up after sleep. I swear a few years ago shit like this didn't happen on macOS. | | |
| ▲ | coldtea 2 hours ago | parent [-] | | A few years ago vibe-coded crap apps like that didn't exist on macOS. |
| |
| ▲ | porridgeraisin 5 hours ago | parent | prev [-] | | the damn chat.openai.com webapp lags a lot as well on long chats, typing takes so long. | | |
| ▲ | rsfern 2 hours ago | parent [-] | | In my experience the input field lags on short chats too, sometimes in the middle of writing the second or third prompt. Are they running some kind of prospective evaluation or something? | | |
| ▲ | nicce 2 hours ago | parent [-] | | When you are writing completely new prompt - it sends every character to server when writing and tries to make suggestions based on that. And keeps doing it in intervals in /prepare endpoint, during each prompt. So if you are working with something sensitive - don't write it to browser directly and edit it there. | | |
|
|
|
|
| ▲ | markdog12 2 hours ago | parent | prev | next [-] |
| This software has been terrible for me. Burns tokens like crazy, and fails. Most times I try to use the browser plugin, it just says it can't use the plugin. When it does work, it takes minutes to click a button. Unusable workflow. I ask to generate a png with an alpha channel. It can't. Instead, it outputs a chroma-keyed image, then generates a python script to remove chroma key (fails), then a js script (which also fails). Then my 5h allotment is up. It's frustrating because if it worked as they advertise, it'd be an amazing tool. |
| |
| ▲ | EMM_386 2 hours ago | parent [-] | | Although they can technically do it, I wouldn't be asking LLMs to generate binary files like PNG with alpha channels, no matter how simple that may seem. If it's easy enough to manually create one yourself, I would do that. The best way for LLMs to do this is likely to write a scratch program (which is what it seems to have reached for in the second half), write code (which they are good at) and have the library create the image. At some point it is just easier to handle such things yourself, and use them with text-based formats. |
|
|
| ▲ | xpct 5 hours ago | parent | prev | next [-] |
| Well thank you for your service. I thought about trying out Codex after the disaster that is Claude Code. I'll be fine without either one on my machine |
| |
| ▲ | jofzar 5 hours ago | parent | next [-] | | Imo codex is significantly better then Claude code for me ATM. | |
| ▲ | christophilus 3 hours ago | parent | prev | next [-] | | Codex is much better, which is to say, it’s only pretty bad. | |
| ▲ | comboy 5 hours ago | parent | prev [-] | | I mean, Codex CLI is really bad. But Claude's CLI is so much worse. Welcome to the world of tomorrow! |
|
|
| ▲ | an hour ago | parent | prev | next [-] |
| [deleted] |
|
| ▲ | ljlolel an hour ago | parent | prev | next [-] |
| Building an open source native swift version that doesn’t have that bug: https://github.com/Lore-Hex/Quillcode |
|
| ▲ | l33tman 6 hours ago | parent | prev | next [-] |
| This was fixed long ago, if I'm thinking of the same bug. It was stuck in an inf loop all the time the codex window was open. |
| |
| ▲ | cncjvu7 6 hours ago | parent [-] | | Nah it's still doing weird shit. Uninstalled that crapware last week. |
|
|
| ▲ | seviu 5 hours ago | parent | prev | next [-] |
| To be fair with Codex, you can use any harness you want with it. Access is not gatekeeper by a crappy full of slop electron app. So just move to PI, or whatever. Claude on the contrary, forces all plan users to use their horrible app, which, if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all. Not justifying it. But if you use the official Codex app, thats on you. If you use the official Claude app, it's because you are forced to. Sidenote unrelated to the post: since the Fable thing, and after serious thinking, I moved to open source models. I still have the basic OpenAI sub, but then easy lifting is now done elsewhere. |
| |
| ▲ | drdexebtjl 8 minutes ago | parent | next [-] | | I haven’t ever tried Cowork, and Claude Desktop shipped a 10 GB VM image on the tiny internal storage of my Macbook. No way to remove it without hacks like creating an empty, read-only file in its place. Having this slop installed and automatically updating is a liability. | |
| ▲ | coldtea 2 hours ago | parent | prev [-] | | >if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all. Of all the issues, this seems like the most tame. I mean, there are single Chrome tabs that can use 300MB or even 700MB. A 2GB VM for what is likely isolated local testing of scripts and commands or local lightweight first-level inference to help guide the main harness sounds reasonable. |
|
|
| ▲ | xenator 3 hours ago | parent | prev | next [-] |
| I have exactly the same problem with Time Machine spinner on macOS. It even doesn't rotate. Somewhere should be rare specialists with diploma who are capable of fixing such problems with waiting lists for years ahead. |
|
| ▲ | tengada1 2 hours ago | parent | prev | next [-] |
| I had the exact same frustration and switched to Pi and have had zero complaints |
|
| ▲ | hokkos 5 hours ago | parent | prev | next [-] |
| is it closed source ? i can see the rust code in repo contrary to the JS in claude code repo, are you mixing them up ? |
| |
| ▲ | nicce 5 hours ago | parent [-] | | Codex CLI is the main Rust code. There is Codex Desktop separately, using Electron and the same Codex CLI. |
|
|
| ▲ | NamlchakKhandro 2 hours ago | parent | prev | next [-] |
| Pi mono is the only true harness. Everything else is crap |
| |
|
| ▲ | jorl17 2 hours ago | parent | prev | next [-] |
| Claude code (desktop) and Codex (desktop) are both absolutely dogshit pieces of software. I can't pick which one is worse. I'd be sort of ashamed to say I actively worked on them, regardless of how they can empower people. Cursor's new UI is similarly terrible. They're all slowly getting better, but too slow for my taste. They are incredibly slow in unpredictable ways, eat up memory at an insane rate, and just feel like they were built with no regards for UX. Like they crammed together all the engineers with no idea of how to build a coherent and predictable UI and let them loose on the product without proper designers. The other day Codex (desktop) was eating up 70GB of RAM on my machine. What had I done? Literally nothing. I opened it and let it update once. Another one with Codex was when I had a specific conversation where no activity was happening and which would make the app spin up all of my CPU cores, rendering it barely usable. It would take seconds to react to anything or update the UI. The conversation wasn't even in focus!!! Restarting the app wouldn't help. After I archived it, it suddenly got better Claude Code Desktop used to be so, soo, soo slow and eat up so much RAM. It was unusable for anything other than playing around when I first tried it. It also didn't communicate any of what it would do. Using it was like living in a world with no affordances, constantly afraid of interacting with them and being faced with some sort of destructive action. Still, it has definitely been improving in terms of the UI experience. Cursor's new agents mode suffers from similar issues. Obscenely slow, hogging CPU without anything going on, breaking with existing UX patterns (some of them already well implemented in their other, more polished, previous version), confusing buttons and labels which don't explain what to do and that sometimes do destructive operations on your code. My favorite cursor absurdity is that if you use their workflow to create a worktree and the worktree setup script fails, the following happens: 1. The agent has no idea that it failed, let alone have any logs of the failure 2. Often you yourself don't get access to the logs of what failed in that script. Don't ask me, half the time it just says it failed with no further logs. 3. When you do get the logs, you cannot copy them in ANY way. You can't even select them. I have had to resort to taking a screenshot to do OCR on it I've also had cursor repeatedly have concurrency/race condition bugs when creating multiple worktrees in parallel. I have 5 tasks, I spin them up all together so they can create 5 worktrees and they crash with random internal cursor errors. Wasn't the point of this abhorrent new UI you've stuffed me with to enable parallelism? It's like people aren't even testing the shit they ship. Which I guess they aren't. I'm a big believer in AI and think it is changing the world and will continue to do so, but I almost get offended at how bad these products for which I am paying (sometimes quite a lot!) are. There's "move fast and break stuff" and then there's "build crap to call stuff". |
| |
|
| ▲ | stellamariesays an hour ago | parent | prev | next [-] |
| [flagged] |
|
| ▲ | iluvcommunism 14 minutes ago | parent | prev | next [-] |
| [dead] |
|
| ▲ | Trialog an hour ago | parent | prev | next [-] |
| [dead] |
|
| ▲ | energy123 4 hours ago | parent | prev [-] |
| Let me guess, there's also a bug where they train on all our data? |
| |
| ▲ | varjag 3 hours ago | parent [-] | | They don't need to. You pay them for the privilege to do black box reinforcement learning already. |
|