Groundedness
Assesses whether a response is firmly based on the provided context. This evaluation ensures that the response does not introduce information that is not supported by the context, thereby maintaining factual accuracy and relevance.
Evaluation Using Interface
Input:
- Required Inputs:
- output: The generated response column from the model.
- input: The user-provided input column to the model (acting as the source context).
- Configuration Parameters:
- None specified for this evaluation.
Output:
- Score: Percentage score between 0 and 100
Interpretation:
- Higher scores: Indicate that the
output
is well-grounded in theinput
. - Lower scores: Suggest that the
output
includes information not present in or supported by theinput
.
Evaluation Using Python SDK
Click here to learn how to setup evaluation using the Python SDK.
Input Type | Parameter | Type | Description |
---|---|---|---|
Required Inputs | output | string | The generated response from the model. |
input | string | The user-provided input to the model (acting as the source context). |
Output | Type | Description |
---|---|---|
Score | float | Returns a score between 0 and 1, where higher values indicate better grounding in the input. |
What to do when Groundedness Evaluation Fails
If the evaluation fails, the Context Review should reassess the provided context for completeness and clarity, ensuring it includes all necessary information to support the response. In Response Analysis, the response should be examined for any elements not supported by the context, and adjustments should be made to improve alignment with the given information.
Differentiating Groundedness from Context Adherence
While both evaluations assess context alignment, Groundedness ensures that the response is strictly based on the provided context, whereas Context Adherence measures how well the response stays within the context without introducing external information. Both evaluations require a response and context as inputs and produce a Pass/Fail output based on adherence to the provided information.