Entry

Secret Collusion among AI Agents: Multi-Agent Deception via Steganography

Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H.S. Torr, Lewis Hammond, Christian Schroeder de Witt

Synopsis

Formalises steganographic secret collusion among generative AI agents and evaluates current models — current capabilities limited but GPT-4 shows a capability jump; proposes monitoring and mitigation including paraphrasing.

Keywords

·secret collusion ·steganography ·multi-agent deception ·governance ·mitigation

Open paper ↗ arXiv ↗ Report issue ↗

Related entries

A Co-Evolutionary Theory of Human-AI Coexistence: Mutualism, Governance, and Dynamics in Complex Societies

April 24, 2026 · arXiv
AGENTSAFE: A Unified Framework for Ethical Assurance and Governance in Agentic AI

December 2, 2025 · arXiv
Agentic AI has a Human Oversight Problem

September 15, 2025 · SSRN preprint 2025
TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems

June 4, 2025 · arXiv
The AI Agent Index

February 3, 2025 · arXiv
On the Quest for Effectiveness in Human Oversight: Interdisciplinary Perspectives

April 5, 2024 · FAccT 2024