Entry
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H.S. Torr, Lewis Hammond, Christian Schroeder de Witt
Formalises steganographic secret collusion among generative AI agents and evaluates current models — current capabilities limited but GPT-4 shows a capability jump; proposes monitoring and mitigation including paraphrasing.
·secret collusion ·steganography ·multi-agent deception ·governance ·mitigation
- A Co-Evolutionary Theory of Human-AI Coexistence: Mutualism, Governance, and Dynamics in Complex SocietiesApril 24, 2026 · arXiv
- AGENTSAFE: A Unified Framework for Ethical Assurance and Governance in Agentic AIDecember 2, 2025 · arXiv
- Agentic AI has a Human Oversight ProblemSeptember 15, 2025 · SSRN preprint 2025
- TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent SystemsJune 4, 2025 · arXiv
- The AI Agent IndexFebruary 3, 2025 · arXiv
- On the Quest for Effectiveness in Human Oversight: Interdisciplinary PerspectivesApril 5, 2024 · FAccT 2024