▲ | New Agent Benchmark from Meta Super Intelligence Lab and Hugging Face(huggingface.co) | |
1 points by clmnt 8 hours ago | 1 comments | ||
▲ | clmnt 8 hours ago | parent [-] | |
Introducing Gaia2, the follow-up to the agentic benchmark GAIA, allowing analysis of considerably more complex behaviors. Gaia2 is released with the open Meta Agents Research Environments (ARE) framework to run, debug and evaluate agents. ARE simulates complex real world-like conditions and can be customized to further study agents behaviors. Gaia2 dataset is released under CC by 4.0 license, and ARE under MIT license. |