Remix.run Logo
lukev 4 hours ago

As has been said, actual evals are needed here.

Anecdotally, the worst and most common failure mode of an agent is when an agent starts spinning its wheels and unproductively trying to fix some error and failing, iterating wildly, eventually landing on a bullshit (if any) “solution”.

In my experience, in Typescript, these “spin out” situations are almost always type-related and often involve a lot of really horrible “any” casts.

resonious 3 hours ago | parent | next [-]

Right, I've noticed agents are very trigger happy with 'any'.

I have had a good time with Rust. It's not nearly as easy to skirt the type system in Rust, and I suspect the culture is also more disciplined when it comes to 'unwrap' and proper error management. I find I don't have to explicitly say "stop using unwrap" nearly as often as I have to say "stop using any".

smackeyacky 3 hours ago | parent | next [-]

Experienced devs coming in to TypeScript are also trigger happy with 'any' until they work out what's going on. Especially if they've come from Javascript.

rossjudson 35 minutes ago | parent | prev | next [-]

LLMs are minimizing energy to solve problems, and if they can convince the human to go away happy with 'any', so be it.

There's a fine line between gradient descent, pedantry, and mocking. I suspect we will learn more about it.

monkpit 32 minutes ago | parent | prev [-]

I’ve tried enforcing no-explicit-any just to have the agent disable the linter rule. I guess I didn’t say you couldn’t do that…

energy123 3 hours ago | parent | prev | next [-]

The question can be asked two ways:

(1) Are current LLMs better at vibe coding typed languages, under some assumptions about user workflow?

(2) Are LLMs as a technology more suited to typed languages in principle, and should RL pipelines gravitate that way?

mewpmewp2 4 hours ago | parent | prev [-]

This is why I have very specific ruleset and linting for my LLMs, not allowing any at all and other quality checks.

monkpit 30 minutes ago | parent | next [-]

Until the agent disables the linter rule without you noticing!

Mtinie 3 hours ago | parent | prev [-]

Is this a shareable ruleset? I would completely understand if not but I’m interested in learning new ways to interact with my tools.