Remix clone Hacker News

new | show | ask | jobs Github

	▲	Akranazon 2 hours ago
		> they’re runtime rules that kick in when assertions or ordinality constraints are explicit So there a pre-defined list of rules - is it choosing which checks to care about from the set, or is there also a predefined binding between the task and the test? If it's the former, then you have to ensure that the checks are sufficiently generic that there's a useful test for the given situation. Is an AI doing the choosing, over which of the checks to run? If it's the ladder, I would assume that writing the tests would be the bottleneck, writing a test can be as flaky/time-consuming as implementing the actions by hand.
	▲	tonyww 2 hours ago \| parent [-]
		It’s mostly the former: there’s a small set of generic checks/primitives, and we choose which ones to apply per step. The binding between “task/step” and “what to verify” can come from either: the user (explicit assertions), or the planner/executor proposing a post-condition (e.g. “after clicking checkout, URL contains /checkout and a checkout button exists”). But the verifier itself is not an AI, by design it’s predicate-only