It has the same score on https://lmarena.ai/leaderboard/webdev , but AFAIK Air version is much smaller.
I've added results for GLM 4.5 and 4.5 Air.