Remix.run Logo
PAndreew 3 hours ago

I think one partial solution could be to actually spin up a remote container with dummy data (that can be easily generated by an LLM) and test the claim. With agents it can be done very quickly. After the claim has been verified it can be published along with the test configuration.

ray_v 2 hours ago | parent [-]

A partial solution sure, but the problem is that you need a 100% complete solution to this problem, otherwise it's still unsafe.