I think models from one year ago with proper harness should be easily beating humans at this task on average. Human CEOs decisions are worse than random chance.