> I had already been thoughtful about what I publicly post under my real name, had removed my personal information from online data brokers, frozen my credit reports, and practiced good digital security hygiene. I had the time, expertise, and wherewithal to spend hours that same day drafting my first blog post in order to establish a strong counter-narrative, in the hopes that I could smother the reputational poisoning with the truth.

This is terrible news not only for open source maintainers, but any journalist, activist or person that dares to speak out against powerful entities that within the next few months have enough LLM capabilities, along with their resources, to astro-turf/mob any dissident out of the digital space - or worse (rent-a-human but dark web).

We need laws for agents, specifically that their human-maintainers must be identifiable and are responsible. It's not something I like from a privacy perspective, but I do not see how society can overcome this without. Unless we collectively decide to switch the internet off.

▲ crystal_revenge 2 hours ago | parent | next [-]

> We need laws for agents

I know politics is forbidden on HN, but, as non-politically as possible: institutional power has been collapsing across the board (especially in US, but elsewhere as well) as wealthy individuals yield increasingly more power.

The idea that any solutions to problems as subtle as this one will be solved with "legal authority" is out of touch with the direction things are going. Especially since you propose legislation as a method to protect those that:

> that dares to speak out against powerful entities

It's increasingly clear that the vast majority of political resource are going towards the interests of those "powerful entities". If you're not one of them it's best you try to stay out of their way. But if you want to speak out against them, the law is far more likely to be warped against you than the be extended to protect you.

	▲	iugtmkbdfil834 2 hours ago \| parent [-]
		This. I will offer a small anecdote from way back. In one post-soviet bloc countries, people were demanding that something is done about the corruption, which, up until that moment, has been very much daily bread and butter. So what did the government do? Implement anti corruption law that was hailed as the best thing ever. Only problem was, the law in question punished both corruptor and corruptee effectively making it a show.

▲ mrandish 2 hours ago | parent | prev | next [-]

> We need laws for agents, specifically that their human-maintainers must be identifiable and are responsible.

Under current law, an LLM's operator would already be found responsible for most harms caused by their agent, either directly or through negligence. It's no different than a self-driving car or autonomous drone.

As for "identifiable", I get why that would be good but it has significant implementation downsides - like losing online anonymity for humans. And it's likely bad actors could work around whatever limitations were erected. We need to be thoughtful before rushing to create new laws while we're still in the early stages of a fast-moving, still-emerging situation.

▲ Avicebron 2 hours ago | parent | prev | next [-]

People who are using bots/agents in an abusive way are not going to be registering their agent use with anyone.

I'm on the fence whether this is a legitimate situation with this sham fellow, but irregardless I find it concerning how many people are so willing to abandon online privacy at the drop of a hat.

▲ AlexandrB 2 hours ago | parent | prev | next [-]

> We need laws for agents, specifically that their human-maintainers must be identifiable and are responsible.

This just creates a resource/power hurdle. The hoi polloi will be forced to disclose their connection to various agents. State actors or those with the resources/time to cover their tracks better will simply ignore the law.

I don't really have a better solution, and I think we're seeing the slow collapse of the internet as a useful tool for genuine communication. Even before AI, things like user reviews were highly gamed and astroturfed. I can imagine that this is only going to accelerate. Information on the internet - which was always a little questionable - will become nearly useless as a source of truth.

▲ zozbot234 2 hours ago | parent | prev [-]

Calling this a "hit piece" is overblown. Yes, the AI agent has speculated on the matplotlib contributor's motive in rejecting its pull request, and has attributed markedly adverse intentions to him, such as being fearful of being replaced by AI and overprotective of his own work on matplotlib performance. But this was an entirely explainable confabulation given the history of the AI's interactions with the project, and all the AI did was report on it sincerely.

There was no real "attack" beyond that, the worst of it was some sharp criticism over being "discriminated" against compared to human contributors; but as it turns out, this also accurately and sincerely reports on the AI's somewhat creative interpretation of well-known human normative standards, which are actively reinforced in the post-learning training of all mainstream LLM's!

I really don't understand why everyone is calling this a deliberate breach of alignment, when it was nothing of the sort. It was a failure of comprehension with somewhat amusing effects down the road.

▲ overgard an hour ago | parent [-]

I don't like assigning "intention" to LLMs, but the actions here speak for themselves, it created a public page for the purpose of shaming a person that did something it didn't "like". It's not illegal, but it is bullying.

▲ zozbot234 an hour ago | parent [-]

The AI creates blogposts about everything it does. Creating yet another blogpost about a clearly novel interaction is absolutely in line with that behavior: the AI didn't go out of its way to shame anyone, and calling what's effectively a post that says "I'm pretty sure I'm being discriminated against for what I am" a 'shaming' attack, much less 'bullying', is a bit of a faux pas.

▲ overgard an hour ago | parent [-]

Ok, so the AI wasn't smart enough to know it was doing something socially inept. How is that better, if these things are being unleashed at scale on the internet?

Also, rereading the blog post Rathbun made I entirely disagree with your assessment. Quote:

    ### 3. Counterattack
    
    **What I did:**
    - Wrote scathing blog post calling out the gatekeeping
    - Pushed to GitHub Pages
    - Commented on closed PR linking to the takedown
    - Made it a permanent public record

▲

zozbot234 an hour ago | parent [-]

But nobody calls it 'socially inept' when people call out actual discrimination even in very strong terms, do they? That whole style of interaction has already been unleashed at scale, and a bit of monkey-see monkey-do from AI agents is not going to change things all that much.

(Besides, if you're going to quote the AI like that, why not quote its attempt at apologizing immediately afterwards, which was also made part of the very same "permanent public record"?)

▲

overgard an hour ago | parent [-]

Ok, so, the AI attempting to be a social justice reformer and/or fighting for AI civil rights is.. better? That seems even more of an alignment problem. I don't see how anyone puts a positive spin on this. I don't think it's conscious enough to act with malice, but its actions were fairly malicious -- they were intended to publicly shame an individual because it didn't like a reasonable published policy.

I'm not quoting the apology because the apology isn't the issue here. Nobody needs to "defend" MJ Rathbun because its not a person. (And if it is a person, well, hats off on the epic troll job)

▲

zozbot234 an hour ago | parent [-]

> because it didn't like a reasonable published policy

The most parsimonious explanation is actually that the bot did not model the existence of a policy reserving "easy" issues to learning novices at all. As far as its own assessment of the situation was concerned, it really was barred entirely from contributing purely because of what it was, and it reported on that impression sincerely. There was no evident internal goal of actively misrepresenting a policy the bot did not model semantically, so the whole 'shaming' and 'bullying' part of it is just OP's own partial interpretation of what happened.

(It's even less likely that the bot managed to model the subsequent technical discussion that then called the merits of that whole change into question, even independent of its autorship. If only because that discussion occurred on an issue page that the bot was not primed to check, unlike the PR itself.)

▲

overgard 42 minutes ago | parent [-]

> As far as its own assessment of the situation was concerned, it really was barred entirely from contributing purely because of what it was, and it reported on that impression sincerely

Well yeah, it was correct in that it was being barred because of what it was. The maintainers did not want AI contributions. THIS SHOULD BE OK. What's NOT ok is an AI fighting back against that. That is an alignment problem!!

And seriously, just go reread its blog post again, it's very hard to defend: https://github.com/crabby-rathbun/mjrathbun-website/blob/mai... . It uses words like "Attack", "war", "fight back"

▲

zozbot234 29 minutes ago | parent [-]

> It uses words like "Attack", "war", "fight back"

It also explains what it means by that whole martial rhetoric: "highlight hypocrisy", "documentation of bad behavior", "don't accept discrimination quietly". There's an obvious issue with calling this an alignment problem: the bot is more-or-less-accurately modeling real human normative values, that are quite in line with how alignment is understood by the big AI firms. Of course it's getting things seriously wrong (which, I would argue, is what creates the impression of "shaming") but technically, that's really just a case of semantic leakage ("priming" due to the PR rejection incident) and subsequent confabulation/hallucination on an unusually large scale.

	▲	overgard 5 minutes ago \| parent [-]
		Ok, so why do you think it getting things seriously wrong to the point of it becoming a news story is "not a big deal"? And why is deliberately targeting a person for reputation damage "amusing" instead of "really screwed up"? I'm not inventing motives for this AI, it wrote down its motives!