This is a 2024 survey, so it predates Claude Code and is mostly measuring GPT-4o:
https://bfi.uchicago.edu/wp-content/uploads/2025/04/BFI_WP_2...