Ask it to take control of a browser using something like Playwright and use the UI itself like an end user would and evaluate whether it is a good experience.