I've done something similar here but for Prolog: https://github.com/nlothian/Vibe-Prolog
It's interesting comparing what different LLMs can get done.