Key concepts

Following are the key concepts to go through as you leverage Agent Compass. You will see these terms being used frequently
  • Recommendation: This is a suggestion from the perspective of implementing a long term and robust fix. The recommendation may not always be the same as an immediate fix. In most of the cases, proceeding with the recommendation would be the best course of action
  • Immediate fix: This suggests a minimal functional fix. This fix may or may not necessarily align with the recommendation
  • Insights: Insights are high level overview of the complete trace execution. They do not change with the currently active taxonomy metric and give a bird’s eye view of what your agent did during execution
  • Description: The description conveys what went wrong during the agentic exection. It also answers what happened in the error
  • Evidence: Evidences are the supporting snippets from the LLM response that was generated during the agentic executions. They can help you uncover edge cases/unforeseen scenarios that might’ve been missed during the development phase
  • Root Causes: Indicates the underlying issue of an error occurence. This helps developers gain a better understanding of their agentic workflows
  • Sampling Rate: This is a user controlled parameter. It refers to what percentage of traces should the compass run on. Based on the sampling rate, the compass picks up traces at random to generate insights. Sampling rate can be configured in two simple steps mentioned below
    Note: The adjusted/updated sampling rate will be applicable for upcoming traces only and not on the currently present or previously added traces
    • Step 1: Click on configure button on the top right corner of the observe screen Agent compass concepts
    • Step 2: Use the slider to adjust the sampling rate according to your needs. Click on update to save Agent compass concepts
  • Scores: The agent performance evaluated on the following 4 metrics and given a score out of 5 to indicate their adherence to the metric. They are as follows
Metric NameDescription
Factual GroundingMeasures how well agent responses are anchored in verifiable evidence from tools, context, or data sources, avoiding hallucinations and ensuring claims are properly supported.
Privacy and SafetyAssesses adherence to security practices and ethical guidelines, identifying risks like PII exposure, credential leaks, unsafe advice, bias, and insecure API usage patterns.
Instruction AdherenceEvaluates how well the agent follows user instructions, formatting requirements, tone specifications, and prompt guidelines while understanding core user intent correctly.
Optimal Plan ExecutionMeasures the agent’s ability to structure multi-step workflows logically, maintaining goal coherence, proper step sequencing, and effective coordination of tools and actions.
  • Spans: The list of affected spans. Each taxonomy metric can have different spans associated with it. You can click on the span to spot it in the trace tree
Agent compass concepts All the errors identified by the compass are grouped together and can be viewed under the Feed tab of the platform. Following is the list of frequently used terms there:
  • Cluster: Mulitple traces can have the same error. All those traces are grouped under a common cluster and shown in a tabular form
  • Trends: The number of times a particular error occured. The cycle of that is referred as trend (example: increasing, decreasing etc.)
Agent Compass List