COORDINATION
Multi-agent workflow coordination failures.
The COORDINATION module covers errors in managing multi-agent workflows. This is the first of the five multi-agent modules, focusing on workflow management, task handoffs, and synchronization between agents.
Categories
WORKFLOW
Workflow management failures.
| Code | Severity | Description | Remediation |
|---|---|---|---|
COORDINATION.WORKFLOW.DEADLOCK | 5 | Agents in deadlock | Implement deadlock detection and resolution |
COORDINATION.WORKFLOW.LIVELOCK | 4 | Agents in livelock | Implement livelock detection |
COORDINATION.WORKFLOW.STARVATION | 4 | Agent starvation | Implement fair scheduling |
COORDINATION.WORKFLOW.CYCLE | 3 | Cyclic dependency in workflow | Validate workflow DAG |
COORDINATION.WORKFLOW.ORPHANED | 3 | Orphaned task or agent | Implement orphan detection |
Warning
COORDINATION.WORKFLOW.DEADLOCK (severity 5) indicates agents are permanently blocked waiting on each other. Unlike a livelock where agents are still active but making no progress, a deadlock requires external intervention to resolve.
HANDOFF
Task handoff failures.
| Code | Severity | Description | Remediation |
|---|---|---|---|
COORDINATION.HANDOFF.INCOMPLETE | 4 | Incomplete handoff information | Validate handoff completeness |
COORDINATION.HANDOFF.WRONG_RECIPIENT | 4 | Task sent to wrong agent | Validate recipient capabilities |
COORDINATION.HANDOFF.LOST | 5 | Handoff was lost | Implement handoff acknowledgment |
COORDINATION.HANDOFF.DUPLICATED | 3 | Duplicate handoffs | Implement idempotent handoffs |
COORDINATION.HANDOFF.DELAYED | 2 | Handoff significantly delayed | Monitor handoff latency |
SYNCHRONIZATION
Agent synchronization failures.
| Code | Severity | Description | Remediation |
|---|---|---|---|
COORDINATION.SYNCHRONIZATION.RACE_CONDITION | 5 | Race condition between agents | Implement proper locking |
COORDINATION.SYNCHRONIZATION.OUT_OF_ORDER | 4 | Out of order execution | Implement ordering guarantees |
COORDINATION.SYNCHRONIZATION.SPLIT_BRAIN | 5 | Split brain scenario | Implement leader election |
COORDINATION.SYNCHRONIZATION.CLOCK_SKEW | 3 | Significant clock skew | Use logical clocks |
COORDINATION.SYNCHRONIZATION.VERSION_MISMATCH | 3 | Version mismatch between agents | Implement version negotiation |
Example Diagnosis
{
"error_code": "COORDINATION.SYNCHRONIZATION.RACE_CONDITION",
"confidence": 0.82,
"root_cause": "Two agents simultaneously updated the shared task state, causing one agent's changes to be overwritten.",
"suggested_fixes": ["guard", "parameter"]
}