> “The AI hallucinated. I never asked it to do that.”

> That’s the defense. And here’s the problem: it’s often hard to refute with confidence.

Why is it necessary to refute it at all? It shouldn't matter, because whoever is producing the work product is responsible for it, no matter whether genAI was involved or not.

▲ nerdsniper 9 hours ago | parent | next [-]

The distinction some people are making is between copy/pasting text vs agentic action. Generally mistakes "work product" as in output from ChatGPT that the human then files with a court, etc. are not forgiven, because if you signed the document, you own its content. Versus some vendor-provided AI Agent which simply takes action on its own that a "reasonable person" would not have expected it to. Often we forgive those kinds of software bloopers.

▲

Wobbles42 6 hours ago | parent | next [-]

"Agentic action" is just running a script. All that's different is now people are deploying scripts that they don't understand and can't predict the outcome of.

It's negligence, pure and simple. The only reason we're having this discussion is that a trillion dollars was spent writing said scripts.

▲

iterance 2 hours ago | parent | prev | next [-]

If I hire an engineer and that engineer authorizes an "agent" to take an action, if that "agentic action" then causes an incident, guess whose door I'm knocking on?

Engineers are accountable for the actions they authorize. Simple as that. The agent can do nothing unless the engineer says it can. If the engineer doesn't feel they have control over what the agent can or cannot do, under no circumstances should it be authorized. To do so would be alarmingly negligent.

This extends to products. If I buy a product from a vendor and that product behaves in an unexpected and harmful manner, I expect that vendor to own it. I don't expect error-free work, yet nevertheless "our AI behaved unexpectedly" is not a deflection, nor is it satisfactory when presented as a root cause.

▲

ori_b 9 hours ago | parent | prev | next [-]

If you put a brick on the accelerator of a car and hop out, you don't get to say "I wasn't even in the car when it hit the pedestrian".

▲

Shalomboy 9 hours ago | parent [-]

This is true for bricks, but it is not true if your dog starts up your car and hits a pedestrian. Collisions caused by non-human drivers are a fascinating edge case for the times we're in.

▲

jacquesm 7 hours ago | parent | next [-]

It is very much true for dogs in that case: (1) it is your dog (2) it is your car (3) it is your responsibility to make sure your car can not be started by your dog (4) the pedestrian has a reasonable expectation that a vehicle that is parked without a person in it has been made safe to the point that it will not suddenly start to move without an operator in it and dogs don't qualify.

You'd lose that lawsuit in a heartbeat.

▲

direwolf20 7 hours ago | parent [-]

what if your car was parked in a normal way that a reasonable person would not expect to be able to be started by a dog, but the dog did several things that no reasonable person would expect and started it anyway?

▲

jacquesm 7 hours ago | parent | next [-]

You can 'what if' this until the cows come home but you are responsible, period.

I don't know what kind of drivers education you get where you live but where I live and have lived one of the basic bits is that you know how to park and lock your vehicle safely and that includes removing the ignition key (assuming your car has one) and setting the parking brake. You aim the wheels at the kerb (if there is one) when you're on an incline. And if you're in a stick shift you set the gear to neutral (in some countries they will teach you to set the gear to 1st or reverse, for various reasons).

We also have road worthiness assessments that ensure that all these systems work as advertised. You could let a pack of dogs loose in my car in any external circumstance and they would not be able to move it, though I'd hate to clean up the interior afterwards.

▲

direwolf20 6 hours ago | parent [-]

I agree. The dog smashed the window, hot–wired the ignition, released the parking brake, shifted to drive, and turned the wheel towards the opposite side of the road where a mother was pushing a stroller, killing the baby. I know, crazy right, but I swear I'm not lying, the neighbor caught it on camera.

Who's liable?

I think this would be a freak accident. Nobody would be liable.

	▲	bigstrat2003 27 minutes ago \| parent \| next [-]
		Your analogy has long since ceased to have any illuminating power, because it involves things that are straight up impossible.
	▲	rdtsc 2 hours ago \| parent \| prev \| next [-]
		Well at that point we might as well say it's gremlins that you summoned, so who knows, there are no laws about gremlins hot-wiring cars. If you summoned them, are they _your_ gremlins, or do they have their own agency. How guilty are you, really... At some point it becomes a bit silly to go into what-if scenarios, it helps to look at exact cases.
	▲	jacquesm 6 hours ago \| parent \| prev \| next [-]
		> I agree. The dog smashed the window, hot–wired the ignition, > released the parking brake, shifted to drive, and turned the > wheel towards the opposite side of the road where a mother was > pushing a stroller, killing the baby. I know, crazy right, but > I swear I'm not lying, the neighbor caught it on camera. > Who's liable? You are. It's still your dog. If you would replace dog with child the case would be identical (but more plausible). This is really not as interesting as you think it is. The fact that you have a sentient dog is going to be laughed out of court and your neighbor will be in the docket together with you for attempting to mislead the court with your AI generated footage. See, two can play at that. When you make such ridiculously contrived examples turnaround is fair play.
	▲	gamblor956 5 hours ago \| parent \| prev [-]
		You would not be guilty of a crime, because that requires intent. But you would be liable for civil damages, because that does not. There are multiple theories for which to establish liability, but most likely this would be treated as negligence.

▲

thatjoeoverthr 4 hours ago | parent | prev [-]

You're stretching it. It's more like if you train your dog to start the car and accelerate, open the door and turn your back.

Everything an AI does is downstream of deliberate, albeit imperfect, training.

You know this, you rig it all up and you let things happen.

▲

Terr_ 2 hours ago | parent | prev | next [-]

Being guilty != Being responsible

They correlate, but we must be careful not to mistake one for the other. The latter is a lower bar.

▲

b00ty4breakfast 5 hours ago | parent | prev | next [-]

I'm dubious, do you have any examples of this happening?

▲

victorbjorklund 9 hours ago | parent | prev | next [-]

I don’t know where you from but at least in Sweden you have strict liability for anything your dog does

▲

9 hours ago | parent | prev | next [-]

[deleted]

▲

ori_b 8 hours ago | parent | prev | next [-]

In the USA, at least, it seems pet owners are liable for any harm their pets do.

▲

cess11 8 hours ago | parent | prev | next [-]

Legally, in a lot of jurisdictions, a dog is just your property. What it does, you did, usually with presumed intent or strict liability.

▲

gowld 8 hours ago | parent [-]

What if you planted a bush that attracted a bat that bit a child?

	▲	Muromec 7 hours ago \| parent \| next [-]
		What if you have an email in your inbox warning you that 1) this specific bush attracts bats and 2) there were in fact bats seen near you bush and 3) bats were observed almost biting a child before. And you also have "how do I fuck up them kids by planting a bush that attracts bats" in your browser history. It's a spectrum you know.
	▲	dragonwriter 7 hours ago \| parent \| prev \| next [-]
		Well, if it was a bush known to also attract children, it was on your property, and the child was in fact attracted by it and also on your property, and the presence of the bush created the danger of bat bites, the principal of “attractive nuisance” is in play.
	▲	b00ty4breakfast 5 hours ago \| parent \| prev [-]
		what if my auntie had wheels, would she be a wagon?

▲

freejazz 9 hours ago | parent | prev [-]

Prima facie negligence = liability

▲

observationist 9 hours ago | parent | prev | next [-]

To me, it's 100% clear - if your tool use is reckless or negligent and results in a crime, then you are guilty of that crime. "It's my robot, it wasn't me" isn't a compelling defense - if you can prove that it behaved significantly outside of your informed or contracted expectations, then maybe the AI platform or the Robot developer could be at fault. Given the current state of AI, though, I think it's not unreasonable to expect that any bot can go rogue, that huge and trivially accessible jailbreak risks exist, so there's no excuse for deploying an agent onto the public internet to do whatever it wants outside direct human supervision. If you're running moltbot or whatever, you're responsible for what happens, even if the AI decided the best way to get money was to hack the Federal Reserve and assign a trillion dollars to an account in your name. Or if Grok goes mechahitler and orders a singing telegram to Will Stancil's house, or something. These are tools; complex, complicated, unpredictable tools that need skillfull and careful use.

There was a notorious dark web bot case where someone created a bot that autonomously went onto the dark web and purchased numerous illicit items.

https://wwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwww.bitnik.or...

They bought some ecstasy, a hungarian passport, and random other items from Agora.

>The day after they took down the exhibition showcasing the items their bot had bought, the Swiss police “arrested” the robot, seized the computer, and confiscated the items it had purchased. “It seems, the purpose of the confiscation is to impede an endangerment of third parties through the drugs exhibited, by destroying them,” someone from !Mediengruppe Bitnik wrote on their blog.

In April, however, the bot was released along with everything it had purchased, except the ecstasy, and the artists were cleared of any wrongdoing. But the arrest had many wondering just where the line gets drawn between human and computer culpability.

▲

dragonwriter 7 hours ago | parent | next [-]

> To me, it's 100% clear - if your tool use is reckless or negligent and results in a crime, then you are guilty of that crime.

For most crimes, this is circular, because whether a crime occurred depends on whether a person did the requisite act of the crime with the requisite mental state. A crime is not an objective thing independent of an actor that you can determine happened as a result of a tool and then conclude guilt for based on tool use.

And for many crimes, recklessness or negligence as mental states are not sufficient for the crime to have occurred.

	▲	rmunn 5 hours ago \| parent [-]
		For negligence that results in the death of a human being, many legal systems make a distinction between negligent homicide and criminally negligent homicide. Where the line is drawn depends on a judgment call, but in general you're found criminally negligent if your actions are completely unreasonable. A good example might be this. In one case, a driver's brakes fail and he hits and kills a pedestrian crossing the street. It is found that he had not done proper maintenance on his brakes, and the failure was preventable. He's found liable in a civil case, because his negligence led to someone's death, but he's not found guilty of a crime, so he won't go to prison. A different driver was speeding, driving at highway speeds through a residential neighborhood. He turns a corner and can't stop in time to avoid hitting a pedestrian. He is found criminally negligent and goes to prison, because his actions were reckless and beyond what any reasonable person would do. The first case was ordinary negligence: still bad because it killed someone, but not so obviously stupid that the person should be in prison for it. The second case is criminal negligence, or in some legal systems it might be called "reckless disregard for human life". He didn't intend to kill anyone, but his actions were so blatantly stupid that he should go to prison for causing the pedestrian's death.

▲

b00ty4breakfast 9 hours ago | parent | prev | next [-]

that darknet bot one always confuses me. The artists/programmers/whatever specifically instructed the computer, through the bot, to perform actions that would likely result in breaking the law. It's not a side-effect of some other, legal action which they were trying to accomplish, it's entire purpose was to purchase things on a marketplace known for hosting illegal goods and services.

If I build an autonomous robot that swings a hunk of steel on the end of a chain and then program it to travel to where people are likely to congregate and someone gets hit in the face, I would rightfully be held liable for that.

▲

cess11 8 hours ago | parent | prev [-]

"computer culpability"

That idea is really weird. Culpa (and dolus) in occidental law is a thing of the mind, what you understood or should have understood.

A database does not have a mind, and it is not a person. If it could have culpa, then you'd be liable for assault, perhaps murder, if you took it apart.

▲

Muromec 7 hours ago | parent | next [-]

>A database does not have a mind, and it is not a person. If it could have culpa, then you'd be liable for assault, perhaps murder, if you took it apart.

We as a society, for our own convenience can choose to believe that LLM does have a mind and can understand results of it's actions. The second part doesn't really follow. Can you even hurt LLM in a way that is equivalent to murdering a person? Evicting it off my computer isn't necessarily a crime.

It would be good news if the answer was yes, because then we just need to find a convertor of camel amounts to dollar amounts and we are all good.

Can LLM perceive time in a way that allows imposing an equivalent of jail time? Is the LLM I'm running on my computer the same personality as the one running on yours and should I also shut down mine when yours acted up? Do we even need the punishment aspect of it and not just rehabilitation, repentance and retraining?

▲

Wobbles42 6 hours ago | parent [-]

The only hallucination here is the idea that giant equation is a mind.

	▲	Muromec 6 hours ago \| parent [-]
		It's only a hallucination if you are the only one seeing it. Otherwise the line between that, a social construct and a religious belief is a bit blurry.

▲

observationist 7 hours ago | parent | prev [-]

Yeah - I'm pretty sure, technically, that current AI isn't conscious in any meaningful way, and even the agentic scaffolding and systems put together lack any persistent, meaningful notion of "mind", especially in a legal sense. There are some newer architectures and experiments with the subjective modeling and "wiring" that I'd consider solid evidence of structural consciousness, but for now, AI is a tool. It also looks like we can make tools arbitrarily intelligent and competent, and we can extend the capabilities to superhuman time scales, so I think the law needs to come up with an explicit precedent for "This person is the user of the tool which did the bad thing" - it could be negligent, reckless, deliberate, or malicious, but I don't think there's any credibility to the idea that "the AI did it!"

At worst, you would confer liability to the platform, in the case of some sort of blatant misrepresentation of capabilities or features, but absolutely none of the products or models currently available withstand any rational scrutiny into whether they are conscious or not. They at most can undergo a "flash" of subjective experience, decoupled from any coherent sequence or persistent phenomenon.

We need research and legitimate, scientific, rational definitions for agency and consciousness and subjective experience, because there will come a point where such software becomes available, and it not only presents novel legal questions, but incredible moral and ethical questions as well. Accidentally oopsing a torment nexus into existence with residents possessed of superhuman capabilities sounds like a great way to spark off the first global interspecies war. Well, at least since the Great Emu War. If we lost to the emus, we'll have no chance against our digital offspring.

A good lawyer will probably get away with "the AI did it, it wasn't me!" before we get good AI law, though. It's too new and mysterious and opaque to normal people.

▲

kazinator 8 hours ago | parent | prev | next [-]

That's the same thing. You signed off on the agent doing things on your behalf; you are responsible.

If you gave a loaded gun to a five year old, would "five-year-old did it" be a valid excuse?

▲

Wobbles42 6 hours ago | parent [-]

If the five year old was a product resulting from trillions of dollars in investments, and the marketability of that product required people to be able to hand guns to that five year old without liability, then we would at least be having that discussion.

Purely organically of course.

	▲	Terr_ 2 hours ago \| parent [-]
		> If the five year old was a product resulting from trillions of dollars in investments In weird way, that's actually true. It's a highly- (soon to be fully-) autonomous giga-swarm of the most complicated nanobots in existence, the result of investments over hundreds of thousands of years. That said, we don't really get to choose which ones we own, although we do have input on their maintenance. :p

▲

niyikiza 8 hours ago | parent | prev | next [-]

> if you signed the document, you own its content. Versus some vendor-provided AI Agent which simply takes action on its own

Yeah that's exactly the I think we should adopt for AI agent tool calls as well: cryptographically signed, task scoped "warrants" that can be traceable even in cases of multi-agent delegation chains

▲

embedding-shape 8 hours ago | parent | next [-]

Kind of like https://github.com/cursor/agent-trace but cryptographically signed?

> Agent Trace is an open specification for tracking AI-generated code. It provides a vendor-neutral format for recording AI contributions alongside human authorship in version-controlled codebases.

	▲	niyikiza 8 hours ago \| parent [-]
		Similar space, different scope/Approach. Tenuo warrants track who authorized what across delegation chains (human to agent, agent to sub-agent, sub-agent to tool) with cryptographic proof & PoP at each hop. Trace tracks provenance. Warrants track authorization flow. Both are open specs. I could see them complementing each other.

▲

Muromec 7 hours ago | parent | prev [-]

Why does it need cryptography even? If you gave the agent a token to interact with your bank account, then you gave it permission. If you want to limit the amount it is allowed to sent and a list of recipients, put a filter that sits between the account and the agent that enforces it. If you want the money to be sent only based on the invoice, let the filter check that invoice reference is provided by the agent. If you did neither of that and the platform that runs the agents didn't accept the liability, it's on you. Setting up filters and engineering prompts it's on you too.

Now if you did all of that, but made a bug in implementing the filter, then you at least tried and wasn't negligible, but it's on you.

▲

niyikiza 6 hours ago | parent | next [-]

Tokens + filters work for single-agent, single-hop calls. Gets murky when orchestrators spawn sub-agents that spawn tools. Any one of them can hallucinate or get prompt-injected. We're building around signed authorization artifacts instead. Each delegation is scoped and signed, chains are verifiable end-to-end. Deterministic layer to constrain the non-deterministic nature of LLMs.

▲

Muromec 6 hours ago | parent [-]

>We're building around signed authorization artifacts instead. Each delegation is scoped and signed, chains are verifiable end-to-end. Deterministic layer to constrain the non-deterministic nature of LLMs.

Ah, I get it. So the token can be downscoped to be passed, like the pledge thing, so sub agent doesn't exceed the scope of it's parent. I have a feeling, that it's like cryptography in general -- you get one problem and reduce it to key management problem.

In a more practical sense, if the non-deterministic layer decides what the reduced scope should be, all delegations can become "Allow: *" in the most pathological case, right? Or like play store, where a shady calculator app can have a permission to read your messages. Somebody has to review those and flag excessive grants.

	▲	niyikiza 5 hours ago \| parent [-]
		Right, the non-deterministic layer can't be the one deciding scope. That's the human's job at the root. The LLM can request a narrower scope, but attenuation is monotonic and enforced cryptographically. You can't sign a delegation that exceeds what you were granted. TTL too: the warrant can't outlive its parent. So yes, key management. But the pathological "Allow: *" has to originate from a human who signed it. That's the receipt you're left holding. You're poking at the right edges though. UX for scope definition and revocation propagation are what we're working through now. We're building this at tenuo.dev if you want to dig in the spec or poke holes.

▲

Wobbles42 6 hours ago | parent | prev [-]

How can you give an agent a token without cryptography being involved?

	▲	Muromec 6 hours ago \| parent [-]
		Not every access token is a (public) key or a signed object. It may be, but it doesn't have to. It's not state of the art, but also not unheard of to use a pre-shared secret with no cryptography involved and to rely on presenting the secret itself with each request. Cookie sessions are often like that.

▲

jacquesm 7 hours ago | parent | prev | next [-]

If you signed the document you are responsible for its content, you are most likely not the owner of it.

▲

IG_Semmelweiss 4 hours ago | parent | prev [-]

Actually, things are heading in a good direction re:AI bloopers.

Courts of law have already found that AI interactions with customers are binding, even if said interactions are considered "bloopers" by the vendor[1]

[1] https://www.forbes.com/sites/marisagarcia/2024/02/19/what-ai...

▲ ibejoeb 9 hours ago | parent | prev | next [-]

Yeah. Legal will need to catch up to deal with some things, surely, but the basic principles for this particular scenario aren't that novel. If you're a professional and have an employee acting under your license, there's already liability. There is no warrant concept (not that I can think of right now, at least) that will obviate the need to check the work and carry professional liability insurance. There will always be negligence and bad actors.

The new and interesting part is that while we have incentives and deterrents to keep our human agents doing the right thing, there isn't really an analog to check the non-human agent. We don't have robot prison yet.

▲ godelski 4 hours ago | parent | prev | next [-]

  > It shouldn't matter, because whoever is producing the work product is responsible for it, no matter whether genAI was involved or not.

I hate to ask, but did you RTFA? Scrolling down ever so slightly (emphasis not my own)

  | *Who authorized this class of action, for which agent identity, under what constraints, for how long; and how did that authority flow?*
  | A common failure mode in agent incidents is not “we don’t know what happened,” but:
  | > We can’t produce a crisp artifact showing that a specific human explicitly authorized the scope that made this action possible.

They explicitly state that the problem is you don't know which human to point at.

▲ imiric 6 hours ago | parent | prev | next [-]

That's quickly becoming difficult to determine.

The workflow of starting dozens or hundreds of "agents" that work autonomously is starting to gain traction. The goal of people who work like this is to completely automate software development. At some point they want to be able to give the tool an arbitrary task, presumably one that benefits them in some way, and have it build, deploy, and use software to complete it. When millions of people are doing this, and the layers of indirection grow in complexity, how do you trace the result back to a human? Can we say that a human was really responsible for it?

Maybe this seems simple today, but the challenges this technology forces on society are numerous, and we're far from ready for it.

	▲	niyikiza 6 hours ago \| parent \| next [-]
		This is the problem we're working on. When orchestrators spawn sub-agents spawn tools, there's no artifact showing how authority flowed through the chain. Warrants are a primitive for this: signed authorization that attenuates at each hop. Each delegation is signed, scope can only narrow, and the full chain is verifiable at the end. Doesn't matter how many layers deep.
	▲	Wobbles42 6 hours ago \| parent \| prev [-]
		Translation: People want to use a tool and not be liable for the result. People not wanting to be liable for their actions is not new. AI hasn't changed anything here, it's just a new lame excuse.

▲ salawat 10 hours ago | parent | prev | next [-]

Except for the fact that that very accountability sink is relied on by senior management/CxO's the world over. The only difference is that before AI, it was the middle manager's fault. We didn't tell anyone to break the law. We just put in place incentive structures that require it, and play coy, then let anticipatory obedience do the rest. Bingo. Accountability severed. You can't prove I said it in a court of law, and skeevy shit gets done because some poor bloke down the ladder is afraid of getting fired if he doesn't pull out all the stops to meet productivity quotas.

AI is just better because no one can actually explain why the thing does what it does. Perfect management scapegoat without strict liability being made explicit in law.

▲

pixl97 8 hours ago | parent | next [-]

Hence why many life and death things require licencing and compliance, and tend to come with very long paper trails.

The software world has been very allergic to getting anywhere near the vicinity of a system like that.

	▲	salawat 8 hours ago \| parent [-]
		Did I give the impression that the phenomena was unique to software? Hell, Boeing was a shining example of the principle in action with 737 MAX. Don't get much more "people live and die by us, and we know it (but management set up the culture and incentives to make a deathtrap anyway)." No one to blame of course. These things just happen. Licensure alone doesn't solve all these ills. And for that matter, once regulatory capture happens, it has a tendency to make things worse due to consolidation pressure.

▲

Muromec 6 hours ago | parent | prev [-]

>AI is just better because no one can actually explain why the thing does what it does. Perfect management scapegoat without strict liability being made explicit in law.

AI is worse in that regard, because, although you can't explain why it does so, you can point a finger at it, say "we told you so" and provide the receipts of repeated warnings that the thing has a tendency of doing the things.

▲ doctorpangloss 9 hours ago | parent | prev | next [-]

Wait till you find out about “pedal confusion.”

▲ NedF 5 hours ago | parent | prev | next [-]

[dead]

▲ niyikiza 10 hours ago | parent | prev [-]

You're right, they should be responsible. The problem is proving it. "I asked it to summarize reports, it decided to email the competitor on its own" is hard to refute with current architectures.

And when sub-agents or third-party tools are involved, liability gets even murkier. Who's accountable when the action executed three hops away from the human? The article argues for receipts that make "I didn't authorize that" a verifiable claim

▲ bulatb 10 hours ago | parent | next [-]

There's nothing to prove. Responsibility means you accept the consequences for its actions, whatever they are. You own the benefit? You own the risk.

If you don't want to be responsible for what a tool that might do anything at all might do, don't use the tool.

The other option is admitting that you don't accept responsibility, not looking for a way to be "responsible" but not accountable.

▲

tossandthrow 10 hours ago | parent [-]

Sounds good in theory, doesn't work in reality.

Had it worked then we would have seen many more CEOs in prison.

▲

walt_grata 9 hours ago | parent | next [-]

There being a few edge cases where it doesn't work in doesn't mean it doesn't work in the majority of cases and that we shouldn't try to fix the edge cases.

▲

Muromec 7 hours ago | parent | prev | next [-]

CEOs are like cars and immigrants. Both kill people all the time, but we choose to believe they are net positive to society, look the other way and try to put symbolic band aids here and there.

The same may happen to AI or not. We can bite the bullet and say it's fine that it sometimes happens. We can ban the entire thing too if we feel the tradeoff not worth it

▲

direwolf20 7 hours ago | parent [-]

You're not doing any favors to your hirability with those first two sentences.

	▲	Muromec 6 hours ago \| parent [-]
		The market is allmighty, but it's allmerciful as well, and thankully, not allknowing.

▲

freejazz 9 hours ago | parent | prev | next [-]

This isn't a legal argument and these conversations are so tiring because everyone here is insistent upon drawing legal conclusions from these nonsense conversations.

▲

bulatb 9 hours ago | parent | prev | next [-]

We're taking about different things. To take responsibility is volunteering to accept accountability without a fight.

In practice, almost everyone is held potentially or actually accountable for things they never had a choice in. Some are never held accountable for things they freely choose, because they have some way to dodge accountability.

The CEOs who don't accept accountability were lying when they said they were responsible.

▲

NoMoreNicksLeft 9 hours ago | parent | prev [-]

The veil of liability is built into statute, and it's no accident.

Such so magic forcefield exists for you, though.

▲ LeifCarrotson 9 hours ago | parent | prev | next [-]

> "I asked it to summarize reports, it decided to email the competitor on its own" is hard to refute with current architectures.

No, it's trivial: "So you admit you uploaded confidential information to the unpredictable tool with wide capabilities?"

> Who's accountable when the action executed three hops away from the human?

The human is accountable.

▲

pixl97 8 hours ago | parent | next [-]

As the saying goes

----

A computer can never be held accountable

Therefore a computer must never make a management decision

	▲	direwolf20 7 hours ago \| parent [-]
		That's when companies were accountable for their results and needed to push the accountability to a person to deter bad results. You couldn't let a computer make a decision because the computer can't be deterred by accountability. Now companies are all about doing bad all the time, they know they're doing it, and need to avoid any individual being accountable for it. Computers are the perfect tool to make decisions without obvious accountability.

▲

gowld 8 hours ago | parent | prev | next [-]

What if you carried a stack of papers between buildings on a windy day, and the papers blew away?

	▲	bigfishrunning 7 hours ago \| parent [-]
		You should have put the papers in a briefcase or a bag. You are responsible.

▲

Muromec 6 hours ago | parent | prev [-]

>The human is accountable.

That's an orthodoxy. It holds for now (in theory and most of the time), but it's just an opinion, like a lot of other things.

Who is accountable when we have a recession or when people can't afford whatever we strongly believe should be affordable? The system, the government, the market, late stage capitalism or whatever. Not a person that actually goes to jail.

If the value proposition becomes attractive, we can choose to believe that the human is not in fact accountable here, but the electric shaitan is. We just didn't pray good enough, but did our best really. What else can we expect?

▲ phoe-krk 9 hours ago | parent | prev | next [-]

> "I asked it to summarize reports, it decided to email the competitor on its own" is hard to refute with current architectures.

If one decided to paint a school's interior with toxic paint, it's not "the paint poisoned them on its own", it's "someone chose to use a paint that can poison people".

Somebody was responsible for choosing to use a tool that has this class of risks and explicitly did not follow known and established protocol for securing against such risk. Consequences are that person's to bear - otherwise the concept of responsibility loses all value.

▲

Muromec 7 hours ago | parent | next [-]

>Somebody was responsible for choosing to use a tool that has this class of risks and explicitly did not follow known and established protocol for securing against such risk. Consequences are that person's to bear - otherwise the concept of responsibility loses all value.

What if I hire you (instead of LLM) to summarize the reports and you decide to email the competitors? What if we work in the industry where you have to be sworn in with an oath to protect secrecy? What if I did (or didn't) check with the police about your previous deeds, but it's first time you emailed competitors? What if you are a schizo that heard God's voice that told you to do so and it's the first episode you ever had?

	▲	phoe-krk an hour ago \| parent [-]
		The difference is LLMs are known to regularly and commonly hallucinate as their main (and only) way of internal functioning. Human intelligence, empirically, is more than just a stochastic probability engine, therefore has different standards applied to it than whatever machine intelligence currently exists.

▲

im3w1l 9 hours ago | parent | prev [-]

> otherwise the concept of responsibility loses all value.

Frankly, I think that might be exactly where we end up going. Finding a responsible person to punish is just a tool we use to achieve good outcomes, and if scare tactics is no longer applicable to the way we work, it might be time to discard it.

▲

phoe-krk 9 hours ago | parent [-]

A brave new world that is post-truth, post-meaning, post-responsibility, and post-consequences. One where the AI's hallucinations eventually drag everyone with it and there's no other option but to hallucinate along.

It's scary that a nuclear exit starts looking like an enticing option when confronted with that.

	▲	direwolf20 7 hours ago \| parent \| next [-]
		I saw some people saying the internet, particularly brainrot social media, has made everyone mentally twelve years old. It feels like it could be true. Twelve–year–olds aren't capable of dealing with responsibility or consequence.
	▲	Muromec 7 hours ago \| parent \| prev \| next [-]
		>A brave new world that is post-truth, post-meaning, post-responsibility, and post-consequences. One where the AI's hallucinations eventually drag everyone with it and there's no other option but to hallucinate along. That value proposition depends entirely on whether there is also an upside to all of that. Do you actually need truth, meaning, responsibility and consequences while you are tripping on acid? Do you even need to be alive and have a physical organic body for that? What if Ikari Gendo was actually right and everyone else are assholes who don't let him be with his wife.
	▲	im3w1l 8 hours ago \| parent \| prev [-]
		Ultimately the goal is to have a system that prevents mistakes as much as possible adapts and self-corrects when they do happen. Even with science we acknowledge that mistakes happen and people draw incorrect conclusions, but the goal is to make that a temporary state that is fixed as more information comes in. I'm not claiming to have all the answers about how to achieve that, but I am fairly certain punishment is not a necessary part of it.

▲ QuadmasterXLII 9 hours ago | parent | prev | next [-]

This doesn't seem conceptually different from running

    [ $[ $RANDOM % 6] = 0 ] && rm -rf / || echo "Click"

on your employer's production server, and the liability doesn't seem murky in either case

▲ staticassertion 9 hours ago | parent [-]

What if you wrote something more like:

    # terrible code, never use ty
    def cleanup(dir):
      system("rm -rf {dir}")


    def main():
        work_dir = os.env["WORK_DIR"]
        cleanup(work_dir)

and then due to a misconfiguration "$WORK_DIR" was truncated to be just "/"?

At what point is it negligent?

▲

direwolf20 9 hours ago | parent [-]

This is not hypothetical. Steam and Bumblebee did it.

▲

extraduder_ire 9 hours ago | parent | next [-]

That was the result of an additional space in the path passed to rm, IIRC.

Though rm /$TARGET where $TARGET is blank is a common enough footgun that --preserve-root exists and is default.

	▲	niyikiza 8 hours ago \| parent \| next [-]
		You'd be surprised to see how often we're seeing those types of semantic attack vulnerabilities in Agent frameworks: https://niyikiza.com/posts/map-territory/
	▲	cyberax 7 hours ago \| parent \| prev [-]
		Even better, $TARGET might be "/home/user/documents and settings /bin"

▲

a_t48 8 hours ago | parent | prev [-]

Bungie, too, in a similar way.

▲ groby_b 9 hours ago | parent | prev | next [-]

"And when sub-agents or third-party tools are involved, liability gets even murkier."

It really doesn't. That falls straight on Governance, Risk, and Compliance. Ultimately, CISO, CFO, CEO are in the line of fire.

The article's argument happens in a vacuum of facts. The fact that a security engineer doesn't know that is depressing, but not surprising.

	▲	Muromec 6 hours ago \| parent [-]
		>The fact that a security engineer doesn't know that is depressing, but not surprising. That's a very subtle guinea pig joke right there.

▲ freejazz 9 hours ago | parent | prev | next [-]

The burden of substantiating a defense is upon the defendant and no one else.

▲ groby_b 9 hours ago | parent | prev [-]

"Our tooling was defective" is not, in general, a defence against liability. Part of a companys obligations is to ensure all its processes stay within lawful lanes.

"Three months later [...] But the prompt history? Deleted. The original instruction? The analyst’s word against the logs."

One, the analysts word does not override the logs, that's the point of logs. Two, it's fairly clear the author of the fine article has never worked close to finance. A three month retention period for AI queries by an analyst is not an option.

SEC Rule 17a-4 & FINRA Rule 4511 have entered the chat.

▲

niyikiza 9 hours ago | parent [-]

Agree ... retention is mandatory. The article argues you should retain authorization artifacts, not just event logs. Logs show what happened. Warrants show who signed off on what

	▲	groby_b 9 hours ago \| parent [-]
		FFIEC guidance since '21: https://www.occ.gov/news-issuances/bulletins/2021/bulletin-2...