Remix.run Logo
simonw 3 hours ago

They later said: https://twitter.com/TheAmolAvasare/status/204672549859272297...

> When we do land on something, if it affects existing subscribers you'll get plenty of notice before anything changes. Will hear it from us, not a screenshot on X or Reddit.

If you don't want things like this spreading through screenshots of X and Reddit, don't run "tests" like this in the first place!

(Also "if it affects existing subscribers" is a cop-out, I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team, or recommend it to other people, write tutorials etc.)

abtinf 3 hours ago | parent | next [-]

That tweet only makes things worse. On top of all their other nonsense recently, it actually convinced me to cancel my subscription.

I can't trust Anthropic to manage their products in a way that supports my workflow.

trueno 3 hours ago | parent | next [-]

pretty much none of these big providers are offering the guarantees needed to be taken seriously in workplaces right now. the technology itself isn't offering the deterministic guarantees that should warrant it in the workplace right now. problem is everyone's foot is just on the gas. even if your workplace isnt paying for it, people are just straight up rolling their own personal claude accounts to do work at orgs.

ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.

local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.

kelsey98765431 3 hours ago | parent | prev [-]

I just cancelled before seeing this news. i was already pissed about constantly hitting limits on the 20 a month plan and looking for alternatives and this seals the deal. Bye bye!

anakaine 3 hours ago | parent [-]

I just paid for Pro for the first time 24 hours ago. Its been great, but the limits are crazy. It's nice not dealing with ChatGPTs sycophantic gaslighting, and not having random bugs.

That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.

minimaxir 3 hours ago | parent | prev | next [-]

A/B tests only work if the subjects don't realize they are in a A/B test.

abtinf 2 hours ago | parent | next [-]

Perhaps vibe coding the A/B testing engine isn't the best idea.

inetknght 2 hours ago | parent | prev [-]

Solution: don't A/B test your users.

A/B testing people without their informed consent is immoral, unethical, and should be illegal.

skeledrew 2 hours ago | parent | next [-]

To play devil's advocate, without A/B testing a lot of decisions would be made with insufficient relevant data, and lead to subpar results that affect the many negatively form the road.

wat10000 2 hours ago | parent [-]

A lot of decisions made with A/B testing are also made with insufficient relevant data, but it's less obvious since it's easy to think the A/B results cover everything.

shimman 2 hours ago | parent | prev | next [-]

Agreed and I can't wait until they regulate this stuff out of existence. It's absolutely hostile software technique that is deeply anti-human.

vehemenz 2 hours ago | parent | prev [-]

Depends entirely on the stakes and whether personal data is involved

inetknght 2 hours ago | parent [-]

> Depends entirely on the stakes and whether personal data is involved

Sure. Let me just A/B test whether or not you'll respond positively or negatively to having your news delivered via push notification or delayed by 10 minutes.

I'm sure you would appreciate being tested on without your consent, just so that I can make an extra quick buck at your expense. Nothing amoral or unethical about it.

pitched 2 hours ago | parent [-]

What do you think about slow rollouts for new features? Like, we think this new push notification system will be loved but let’s ship to only 1% of users in case there’s a horrible unforeseen consequence like occasional 10min delays? Dashboard goes upside down -> revert then work through logs to figure out what the hell went wrong.

sally_glance 3 hours ago | parent | prev [-]

Maybe a silly bet where the head of sales had 1-2 glasses of wine too much... "I bet they will still pay us 20 bucks/mo without CC! Don't believe me? I'm going to prove it!"