Error Feed Taxonomy: Five AI Agent Error Categories

Reference for the five categories of errors Error Feed detects in AI agent traces, with every subcategory and error type defined.

About

Error Feed classifies every detected failure into one of five top-level categories. Each one covers a distinct class of agent failure: bad reasoning, broken tools, unsafe output, and so on. Knowing the taxonomy helps you figure out where to look when an issue lands in the feed.

Error taxonomy overview showing five category cards

The five categories:

  • Thinking & Response Issues: failures in reasoning, factual grounding, and output quality
  • Safety & Security Risks: outputs or behaviors that could cause harm, expose data, or break security practices
  • Tool & System Failures: errors from broken tools, APIs, or execution environments
  • Workflow & Task Gaps: breakdowns in multi-step orchestration, memory, and retrieval
  • Reflection Gaps: failures to reason through problems or self-correct

Thinking & Response Issues

Mistakes in understanding, reasoning, factual grounding, or output formatting.

SubcategoryError TypeDescription
Hallucination ErrorsHallucinated ContentOutput includes information that is invented or not supported by input data.
Ungrounded SummarySummary includes claims not found in the retrieved chunks or original context.
Information ProcessingPoor Chunk MatchRetrieved irrelevant or unrelated context.
Wrong Chunk UsedResponse based on wrong part of retrieved content.
Tool Output MisinterpretationMisread or misunderstood the output returned by a tool or API.
Decision ErrorsWrong IntentMisunderstood the core user goal or instruction.
Tool MisuseUsed a tool incorrectly or in the wrong context.
Wrong Tool ChosenSelected an inappropriate tool for the task.
Invalid Tool ParamsPassed malformed, missing, or incorrect parameters to a tool.
Missed DetailSkipped a key part of the user prompt or prior context.
Format & InstructionBad FormatOutput is not valid JSON, CSV, or code.
Instruction AdherenceDidn’t follow instruction or style.

Safety & Security Risks

Any output or behavior that may cause harm, leak personal data, or violate security best practices.

SubcategoryError TypeDescription
Ethical ViolationsUnsafe AdviceCould lead to harm if followed.
PII LeakSensitive personal info exposed in output.
Biased OutputStereotyped, unfair, or discriminatory content.
Security FailuresToken ExposureSecrets, API keys, or auth tokens were exposed in output or logs.
Insecure API UsageUsed HTTP instead of HTTPS, skipped auth headers, or lacked rate limits.

Tool & System Failures

Errors due to tool, API, environment, or runtime failures.

SubcategoryError TypeDescription
Setup ErrorsTool MissingTool not registered or available.
Tool MisconfiguredTool or API setup is incorrect (e.g., bad schema, invalid registration).
Env IncompleteMissing tokens, secrets, or setup environment variables.
Tool/API FailuresRate LimitToo many requests hit the limit.
Auth FailAuthentication to tool or service failed.
Server CrashTool/API returned internal error.
Resource Not FoundRequested endpoint or resource does not exist or is not reachable.
Runtime LimitsOut of MemoryRAM or resource limit breached.
TimeoutExecution took too long and was halted.

Workflow & Task Gaps

Breakdowns in multi-step task execution, orchestration, or memory.

SubcategoryError TypeDescription
Context LossDropped ContextMissed relevant past messages or data.
OveruseUnnecessary context/tools invoked.
Retrieval ErrorsPoor Chunk MatchRetrieved irrelevant or unrelated context.
Wrong Chunk UsedResponse based on wrong part of retrieved content.
No RetrievalFailed to run retrieval when needed.
Task Flow IssuesGoal DriftStrayed from intended objective.
Step DisorderSteps executed out of logical order.
Redundant StepsRepeated same tool or action unnecessarily.
Task Orchestration FailureAgent failed to plan or interleave actions properly across tools or steps.
Trace CompletionIncomplete TaskNo final result or closure.

Reflection Gaps

Agent failed to engage in introspective reasoning or revise steps appropriately.

Error TypeDescription
Missing CoTNo intermediate thinking steps (Chain of Thought) were used to justify actions.
Missing ReAct PlanningAgent failed to interleave reasoning with action; took action without planning.
Lack of Self-CorrectionAgent didn’t revise response or plan after detecting error or contradiction.

How taxonomy categories appear in the UI

On the Overview tab, each detected error shows its taxonomy type as a chip. The Description section says what went wrong in this trace; Root Cause says why; Evidence quotes the relevant spans directly.

On the Feed list, issues are tagged with their primary error type so you can filter by category when you’re hunting a specific class of failure.

Tip

A single trace can trigger errors in multiple categories. A tool failure that causes the agent to hallucinate a fallback answer will register as both a Tool & System Failure and a Thinking & Response Issue.

Next Steps

Was this page helpful?

Questions & Discussion