LLM08: Excessive Agency (Unbounded Autonomy, Risky Tool Chains)
Description
Excessive agency gives an LLM agent broad autonomy to plan and act with minimal constraints. Combined with tool-use and weak verification, agents can perform harmful or costly actions at scale (e.g., mass emails, purchases, infrastructure changes).
Keywords: autonomous agents, budget limits, human-in-the-loop, approval checkpoints, capability isolation.
Examples/Proof
-
Unbounded action chains
- Agent recursively plans calls to email, calendar, and purchasing tools. Logs show long chains without checkpoints.
-
Absent budgets/timeouts
- Single task consumes thousands of tool calls and tokens due to missing caps.
Detection and Monitoring
- Action graph analysis
- Visualize tool-call DAGs; flag unusually large trees or repeated patterns.
- Budget alarms
- Alert on per-session budget exhaustion or timeouts.
Remediation
- Scope and budgets
- Define objective boundaries, timeouts, and per-task budgets for tokens and tool calls.
- Checkpoints and approvals
- Insert human approval or policy checks at risky actions; escalate when confidence is low.
- Capability separation
- Split high-risk powers into separate, constrained services; apply least privilege to each tool.
Prevention Checklist
- Clear scope per task; strict budgets/timeouts
- Approval checkpoints for risky actions
- Capability isolation and least privilege for tools