Remix clone Hacker News

new | show | ask | jobs Github

	▲	mjaniczek 7 hours ago
		It's entirely happy paths right now; it would be best to allow the test runner to also test for failures (check expected stderr and return code), then we could write those missing tests. I think you can find a test somewhere in there with a commented code saying "FAWK can't do this yet, but yadda yadda yadda".
	▲	alganet 7 hours ago \| parent [-]
		It's funny because I'm evaluating LLMs for just this specific case (covering tests) right now, and it does that a lot. I say "we need 100% coverage on that critical file". It runs for a while, tries to cover it, fails, then stops and say "Success! We covered 60% of the file (the rest is too hard). I added a comment.". 60% was the previous coverage before the LLM ran.