Remix.run Logo
puppystench 3 hours ago

They ran a bounty on Kaggle last year but with $500k in payouts and with all results open and publishable.

https://www.kaggle.com/competitions/openai-gpt-oss-20b-red-t...

With only $25k in payouts and everything locked down under NDA, I can't imagine many people will participate. Well, other than those submitting mountains of LLM-generated junk.

Barbing 2 hours ago | parent | next [-]

> Well, other than those submitting mountains of LLM-generated junk.

Assuming somehow some of them use halfway decent models and prompts… They successfully pushed some of the token cost of their analysis work off on customers!

mpeg an hour ago | parent | prev | next [-]

I was surprised at the low bounty too, considering the resources of openai

Last year I won a similar prompt injection challenge ran by a crypto startup against the latest claude and gpt (at the time) and it was considerably more money, from an org with maybe $5-10m in funding.

That and the restrictive NDA kinda tells me they're not looking for serious bounty hunters, who would either want a lot more money or, alternatively, to be able to publish their work; seems like a marketing stunt.

p_stuart82 2 hours ago | parent | prev | next [-]

basically discount Kaggle. still get people poking at it, just none of the writeups or who-gets-paid drama.

dist-epoch 3 hours ago | parent | prev [-]

This model is much more powerful than gpt-oss-20b, notice how the contest was not even for the 120b model. Also, bio was not a subject.

stonogo 2 hours ago | parent [-]

The model is more powerful, so the bounty is 1/20th the size? More risk, less reward?

"Biorisk" seems to be a concept not only invented by OpenAI but exclusively taken seriously by them. I wonder if this program is less about finding actual risks than it is hopefully casting a wide net for someone to help them prove their model is relevant in this space.

ACCount37 2 hours ago | parent [-]

Not really. Anthropic has the "CBRN filter" on Opus series. It used to kill inquiries on anything that's remotely related to biotech. Seems to have gotten less aggressive lately?

I was reverse engineering a medical device back in 2025 and it was hard killing half my sessions.