What is LCGuard and how does it work?

LCGuard is a security framework for large language model-based multi-agent systems that treats shared KV caches as latent working memory. It learns representation-level transformations to block sensitive information propagation before data crosses agent boundaries.

Why is LCGuard important for AI agents?

KV cache-based communication bypasses natural language filters, allowing one agent to reconstruct another's private inputs. LCGuard closes this representation-level vulnerability while maintaining high collaborative efficiency.

What are the future directions for LCGuard?

Researchers plan to extend the framework to complex distributed intelligent systems and provide ready-to-use security solutions for sensitive industrial scenarios like enterprise customer service and healthcare.

LCGuard：多智能體系統中基於潛在通信的安全KV快取共享框架

本文針對基於大語言模型的多智能體系統中，利用Transformer鍵值（KV）快取進行潛在通信所帶來的隱私洩露風險，提出了LCuard框架。現有研究雖證實KV快取通信能提升效率並保留豐富資訊，但其作為不透明通道可能傳播敏感內容。LCuard將共享KV快取視為潛在工作記憶，透過學習表示級變換來阻斷敏感資訊傳播。該方法形式化定義了基於重構的敏感資訊洩露，並採用對抗訓練策略，使防禦者學習保留任務語義且減少可重構資訊的變換，而攻擊者嘗試重構敏感輸入。在多個模型家族和多智能體基準上的實驗表明，LCuard在顯著降低基於重構的洩露和攻擊成功率的同時，保持了與標準KV共享基線相當的競爭力任務效能，為安全的多智能體協作提供了新範式。

Sources

arXiv