Immagine della notizia

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

Date: 2025-08-22 22:50:55

A new benchmark from Salesforce research evaluates model and agentic performance on real-life enterprise tasks.


Sources:

Click and go !

More From:

venturebeat.com