H Human–AI Coevolution

Entry

CooperBench: Why Coding Agents Cannot be Your Teammates Yet

Arpandeep Khatua, Hao Zhu, Peter Tran, Arya Prabhudesai, Frederic Sadrieh, Johann K. Lieberwirth, Xinkai Yu, Yicheng Fu, Michael J. Ryan, Jiaxin Pei, Diyi Yang

Synopsis

600+ collaborative coding tasks across 12 libraries / 4 languages; agents achieve 30% lower success rates when working together vs. solo.

Keywords

·coding agents ·cooperation ·benchmark ·multi-agent ·communication

Open paper ↗ arXiv ↗ Report issue ↗

Related entries