Gossip Protocols for Agents

One-Line Summary: Gossip protocols spread information probabilistically — each peer periodically picks a few random peers and exchanges state with them, converging the cluster toward a shared view over time without any leader; for large agent populations where eventual consistency is acceptable, gossip is the right scaling strategy.

Prerequisites: Consensus in multi-agent systems

What Is a Gossip Protocol?

Gossip (also called "epidemic" protocols) is the distributed-systems idiom borrowed from how rumors spread. Each peer periodically:

Picks a small random subset of peers.
Exchanges state (or a delta) with them.
Merges incoming state with its own.

After enough rounds, every peer knows everything. Convergence is O(log n) rounds for n peers — fast in practice, robust to peer churn, with no leader to be a bottleneck or SPOF.

The trade vs. Raft and BFT: gossip provides eventual consistency, not strong consistency. Two peers can disagree for some interval after a write, and there's no easy way to read "the latest" reliably.

Why It Fits Some Agent Use Cases

Two classes of agent state work well with gossip:

Membership / liveness: Which agents are alive, which workloads they're running, what their behavioral trust scores are. These need to spread but don't need strong consistency.
Aggregated stats: Counters, distribution updates, "what have the other agents seen lately" — eventual consistency is fine because the values are themselves estimates.

Gossip is not fit for state where two peers disagreeing is bad: planning decisions, memory writes, vote tallies. Use Raft or BFT there.

How It Works in Ruflo's Federation

Ruflo uses gossip for federated peer state: which peers exist, their public keys, their behavioral trust scores, their advertised tools and skills. A new peer joining the federation gossips its identity; existing peers gossip back the cluster view. After a few rounds, the new peer has a complete-ish picture without anyone serializing the join through a leader.

This complements Raft/BFT for the strongly-consistent decisions: gossip handles the "who's around" layer, Raft/BFT handles the "what should we do" layer.

Key Technical Details

Fanout: The number of random peers each round contacts. Higher fanout = faster convergence + more bandwidth.
Push, pull, push-pull: Push (I tell you), pull (I ask you), push-pull (we exchange). Push-pull is most efficient.
Anti-entropy: Periodic full-state reconciliation to catch missed updates. Important for long-running clusters.
Tombstones: Deletes are tricky in gossip — a deleted entry can be re-spread by a peer that hasn't seen the delete. Use tombstones with TTL.
Vector clocks or version vectors: For conflict detection. Without them, last-write-wins is the typical fallback.
Memory cost: Each peer keeps state about every other peer. Sub-linear-in-n is rare; usually O(n).
Bounded staleness: Gossip doesn't bound how out-of-date your view can be in the worst case (only in expectation).

How Harnesses & Frameworks Implement This

Harness / Framework	Gossip support
Claude Code	None
Claude Agent SDK	DIY
ruflo	First-class — federation membership uses gossip
LangGraph	DIY
AutoGen	DIY
CrewAI	DIY
OpenAI Agents SDK	DIY
Codex CLI / Cursor	✗

Connections to Other Concepts

consensus-in-multi-agent-systems.md — The category.
raft-for-agents.md, byzantine-fault-tolerant-agents.md — Strong-consistency alternatives.
cross-machine-agent-federation.md — The natural setting.
behavioral-trust-scoring.md — Trust scores often spread via gossip.