Remix.run Logo
MichaelNolan a day ago

> 95% of Uber engineers now use AI tools monthly with 70% of committed code originating from AI.

Well, that’s to be expected when using AI tools becomes relevant in your performance evaluation.

fidotron a day ago | parent | next [-]

It's actually incredible the extent to which non devs imposing KPIs on devs underestimate how badly this will get gamed, whether it's AIs, PR/line counting or whatever.

darth_avocado a day ago | parent | next [-]

Gaming is one thing, fundamentally not understanding how engineering works will lead to shittier outcomes and cost the company in ways the management will never understand.

Management in the age of AI is falling for the doorman fallacy wrt engineering. If lines of code were the most valuable aspect of software engineering, my front end JavaScript intern would’ve been the most valuable person in the company. https://www.jaakkoj.com/concepts/doorman-fallacy

hnthrow0287345 a day ago | parent [-]

>Gaming is one thing, fundamentally not understanding how engineering works will lead to shittier outcomes and cost the company in ways the management will never understand.

That means nothing to them: they jump ship and find another job just like devs do. The whole industry has been musical chairs for a while.

joshuastuden a day ago | parent | prev | next [-]

Exactly. At Cerebras I know of several people who burn tokens on completely USELESS tasks (randomly changing pixels in an image) just to keep them high up on the token leaderboard.

I suspect the other tokenboard leaders are doing the same. They made the metric "token usage" (which is just a proxy for LOC) so that's what they're gonna get.

ambicapter a day ago | parent | prev | next [-]

Someone at my job uses AI tools to reformat his code...

SatvikBeri a day ago | parent | next [-]

I actually do this, but that's mostly because our team reviewed all the existing autoformatters for the relatively obscure language we use, and either really hated the formatting or found that they actually introduced errors!

i_love_retros a day ago | parent | prev [-]

My coworker said he does that too. Also have coworkers using AI to run git commands. Nothing fancy either- just pull, push, merge etc

phainopepla2 a day ago | parent | next [-]

Do you mean they tell an LLM "push this code", or that the LLM runs the commands as part of a larger agentic workflow. The former is crazy but the latter makes sense.

i_love_retros 13 hours ago | parent [-]

Former. Pull main and merge into my branch, resolve conflicts, etc

(Yes I forgot, they have the agent resolve git conflicts!)

fragmede a day ago | parent | prev [-]

When Claude says "Shall I push it", it's way easier to just respond "yes" than it is to open a new terminal and run git push, and if you're being graded on how much AI tokens you use, saying yes looks even better for your metrics!

jimbokun a day ago | parent | prev | next [-]

I think PRs is pretty good, IF

1. you sample a few to see that they are actually meaningful,

2. they go to prod and are validated without having to roll back.

Still needs to be managed. But it should be much easier for a manager to catch an engineer gaming PRs than something like AI use or lines of code.

dieortin a day ago | parent [-]

It’s very easy to split changes in more PRs than needed to boost the number.

Nuzzerino a day ago | parent | prev [-]

Easily fixable with another KPI to measure the gaming itself :P

p_stuart82 a day ago | parent | prev | next [-]

yeah and once the KPI is "how much AI did you use" instead of "what did you ship," the budget blowout writes itself. people will game the number.

miltonlost a day ago | parent | prev | next [-]

When managers and VPs all say, you must use AI or else you will not work here, then yes, people will use it.

Sherveen a day ago | parent | prev [-]

I don't understand this critique. (1) Did you previously think you weren't getting paid for doing what a company wants you to do, aka what THEY thought was productive? (2) Do you think all this AI generated code is useless?

Edit: y'all are some whiney folk, ain't ya?

RHSeeger a day ago | parent | next [-]

I think the point was that, when you make a metric goal of "you must use AI this much", then people will use AI even in ways that isn't adding to productivity.

arcanemachiner a day ago | parent | prev | next [-]

To answer your second question: Yes, much of it is worse than useless. The tools need guidance to produce useful output. If you use it poorly, you will get garbage output that may do more harm than good.

And your response does not address the point being made in the comment you replied to: Many people are being evaluated by how many tokens they burn, which is about as good a metric as lines of code written.

miyoji a day ago | parent | prev | next [-]

1) I think if the company I work for spends too much effort on things that aren't going to make money, they won't be able to pay me anymore, no matter what they "think" is productive. That's not how executives at companies like this make decisions, though.

2) Mostly, yes.

misterbwong a day ago | parent | prev | next [-]

I think parent is saying "% of code being generated by AI" is not a generally good, direct metric for business value. It's akin to the "we are pushing SO MUCH CODE" phase of early ai marketing.

If we're trying to measure the value of adopting tool, it's probably better to measure the ROI of that tool rather than the usage % of that tool, especially when usage is basically mandated.

To directly answer your questions:

1. You're being paid to create value for the business, which "doing what they think is productive" is a proxy for. You're not being paid to use a tool a high % of the time.

2. I doesn't seem like parent even commented on the quality of the code generated. I think anyone that uses it regularly can agree that: a) the code is not useless and b) all generated code is not immediately production ready c ) AI generation of code is an accelerant for software development

txru a day ago | parent | prev | next [-]

Goodhart's Law isn't a problem immediately. If you want more code to be written, and the only feasible way to write it to goals is to heavily use AI, then you might run into the problems of AI-generated code, and an infrastructure that's poorly architected and much less understood than it would've been ten years ago.

bobsomers a day ago | parent | prev | next [-]

Not OP, but:

1. At my level, the company is not just paying me to do a task the way they want it done, they are paying for my experience to orchestrate the best way to do it. They want an outcome, and I'm responsible for figuring out how to get to that outcome with the right balance of cost, correctness, etc. But yes, the most dystopian reality is what you said.

2. It's not useless, but the AI generated code is absolutely lower quality than what I would have written myself, but there is no desire to clean it up. Companies have always had a disastrously bad understanding of technical debt and they finally have tool they can shove down developers throats that trades even more velocity for even less quality. They're going to take that trade every single time.

jcgrillo a day ago | parent | prev | next [-]

> (1) ...getting paid for doing what a company wants you to do...?

At my previous company, when the thing they thought they wanted me to do (which was not the thing they actually wanted... but whatever) diverged from my values I quit. You can just do things.

> (2) Do you think all this AI generated code is useless?

Almost universally, yes. Especially in organizations that historically haven't been particularly careful about hiring and have a huge number of young, inexperienced people. There are exceptions but they're rare enough that throwing that particular baby out with the bathwater isn't a big loss.

danaw a day ago | parent | prev | next [-]

you're missing their point; LLM use is often a part of your evaluation at some of these larger companies and they expect you to use them heavily or you will get a lashing

skydhash a day ago | parent | prev [-]

GP just saying that any metric will be gamed and if you have some costs that is associated to that, it will grow. Let’s say you set some metric that says the most productive dev are the ones that has the most files changes, you can soon expect every function and structure to be its own file. Same if you say that sales commision are based on how much time you spend calling, expect the phone bills to grow a lot.