Yeah it's a good call out. I think it's a (more) winnable battle though.

For both a keystroke based AI detector, and software designed to mimic human keystroke patterns, performance will be determined by the size of the dataset they have of genuine human keystroke patterns. The detector has an inherent leg-up in this, because it's constantly collecting more data through the use of the tool, whereas the mimic software doesn't have any built in loop to collect those inputs.

▲

lobsterthief 5 days ago | parent | next [-]

Interesting idea! Could someone use the software to train an LLM prompt that will get around it? By learning what passes and what doesn’t and then having the LLM train on that

	▲	enjeyw 5 days ago \| parent [-]
		Yeah this is something I'm a little worried about - right now it's not extremely difficult to just take an AI generated essay and then just tweak the essay until it passes. My first pass approximation is to make the assessment of whether the essay is AI generated or not accessible only to teachers. I may need to also rate-limit the checks, so people can't brute force it to gather data on what passess.

▲

Footprint0521 5 days ago | parent | prev [-]

I got burned by software like this, when I pasted in a essay I transcribed while driving through Whisper, and software like this thought I had pasted AI content lol

	▲	enjeyw 5 days ago \| parent [-]
		Lol! Technically they're not wrong about it being AI generated, just not at all in the way they meant.