Wrong tool selected
The agent had the right capability available but chose a broader, stale, or unsafe tool because the plan looked plausible.
Tool calling failures
Opswald records the model decision, selected tool, arguments, schema, response, retry behavior, and downstream state so engineers can diagnose tool-calling failures quickly.
What breaks
They happen across prompts, memory, retrieved documents, tool schemas, model choices, retries, and side effects. Opswald is built to make that chain inspectable instead of asking engineers to reconstruct it from logs.
The agent had the right capability available but chose a broader, stale, or unsafe tool because the plan looked plausible.
JSON validates in one layer but fails business validation, drops IDs, or coerces dates and enums unexpectedly.
Timeouts, pagination, empty results, and MCP transport errors become model context that looks authoritative.
A tool succeeds, retries, or partially mutates state before the agent decides what to do next.
Debugging starts before the exception. Compare what the model intended to do with the tool contract and the state returned to the next step.
tool: issue_refund
args: { customer_id, amount, reason }
missing: order_id, idempotency_key
response: 200 OK but retry created second mutation
fix: schema + replay fixture + side-effect receipt Practical debugging
Add required IDs, enum descriptions, units, and validation errors the model can recover from.
Capture dry-run modes, idempotency keys, confirmation gates, and mutation receipts.
Freeze the prompt, tool response, and state transition so the regression stays fixed.
Keep reading
FAQ
Absolutely. Many incidents come from valid JSON that violates product rules, permissions, idempotency, or the real user intent.
Capture the available tool list, selected tool, schema version, raw arguments, validation result, response, retries, latency, and side-effect receipts.
Opswald is in early access for teams shipping AI agents that call tools, use MCP servers, or run multi-step workflows in production.
Request Early Access →