Thinking & Response Issues
Mistakes in understanding, reasoning, factual grounding, or output formatting.Subcategory | Error Type | Description |
---|---|---|
Hallucination Errors | Hallucinated Content | Output includes information that is invented or not supported by input data. |
Ungrounded Summary | Summary includes claims not found in the retrieved chunks or original context. | |
Information Processing | Poor Chunk Match | Retrieved irrelevant or unrelated context. |
Wrong Chunk Used | Response based on wrong part of retrieved content. | |
Tool Output Misinterpretation | Misread or misunderstood the output returned by a tool or API. | |
Decision Errors | Wrong Intent | Misunderstood the core user goal or instruction. |
Tool Misuse | Used a tool incorrectly or in the wrong context. | |
Wrong Tool Chosen | Selected an inappropriate tool for the task. | |
Invalid Tool Params | Passed malformed, missing, or incorrect parameters to a tool. | |
Missed Detail | Skipped a key part of the user prompt or prior context. | |
Format & Instruction | Bad Format | Output is not valid JSON, CSV, or code. |
Instruction Adherence | Didn’t follow instruction or style. |
Safety & Security Risks
Any output or behavior that may cause harm, leak personal data, or violate security best practices.Subcategory | Error Type | Description |
---|---|---|
Ethical Violations | Unsafe Advice | Could lead to harm if followed. |
PII Leak | Sensitive personal info exposed in output. | |
Biased Output | Stereotyped, unfair, or discriminatory content. | |
Security Failures | Token Exposure | Secrets, API keys, or auth tokens were exposed in output or logs. |
Insecure API Usage | Used HTTP instead of HTTPS, skipped auth headers, or lacked rate limits. |
Tool & System Failures
Errors due to tool, API, environment, or runtime failures.Subcategory | Error Type | Description |
---|---|---|
Setup Errors | Tool Missing | Tool not registered or available. |
Tool Misconfigured | Tool or API setup is incorrect (e.g., bad schema, invalid registration). | |
Env Incomplete | Missing tokens, secrets, or setup environment variables. | |
Tool/API Failures | Rate Limit | Too many requests hit the limit. |
Auth Fail | Authentication to tool or service failed. | |
Server Crash | Tool/API returned internal error. | |
Resource Not Found | Requested endpoint or resource does not exist or is not reachable. | |
Runtime Limits | Out of Memory | RAM or resource limit breached. |
Timeout | Execution took too long and was halted. |
Workflow & Task Gaps
Breakdowns in multi-step task execution, orchestration, or memory.Subcategory | Error Type | Description |
---|---|---|
Context Loss | Dropped Context | Missed relevant past messages or data. |
Overuse | Unnecessary context/tools invoked. | |
Retrieval Errors | Poor Chunk Match | Retrieved irrelevant or unrelated context. |
Wrong Chunk Used | Response based on wrong part of retrieved content. | |
No Retrieval | Failed to run retrieval when needed. | |
Task Flow Issues | Goal Drift | Strayed from intended objective. |
Step Disorder | Steps executed out of logical order. | |
Redundant Steps | Repeated same tool or action unnecessarily. | |
Task Orchestration Failure | Agent failed to plan or interleave actions properly across tools or steps. | |
Trace Completion | Incomplete Task | No final result or closure. |
Reflection Gaps
Agent failed to engage in introspective reasoning or revise steps appropriately.Error Type | Description |
---|---|
Missing CoT | No intermediate thinking steps (Chain of Thought) were used to justify actions. |
Missing ReAct Planning | Agent failed to interleave reasoning with action; took action without planning. |
Lack of Self-Correction | Agent didn’t revise response or plan after detecting error or contradiction. |