Remix.run Logo
tptacek 2 days ago

Honestly I'm just trying to be nice about it. I don't know that I can tell you a story about the 90% ceiling that makes any sense, especially since you can task 3 different high-caliber teams of senior software security people on an app and get 3 different (overlapping, but different) sets of vulnerabilities back. By the end of 2027, if you did a triangle test, 2:1 agents/humans or vice/versa, I don't think you'd be able to distinguish.

Just registering the prediction.

karlmdavis 2 days ago | parent [-]

I would take the other side of that bet.

  # if >10 then was_created_by_agent = true
  $ grep -oP '\p{Emoji}' vulns.md | wc -l
tptacek 2 days ago | parent [-]

I don't understand what you're trying to say here.

Paracompact 2 days ago | parent [-]

Just that the superficial details of how AI communicate (e.g. with lots of emojis) might give them away in any triangle test :)

tptacek 2 days ago | parent | next [-]

Ah! Touche.

worksonmine 2 days ago | parent | prev [-]

I see this emoji thing being mentioned a lot recently, but I don't remember ever seeing one. Granted I rarely use AI and when I do it's on duck.ai. What models are (ab)using emojis?