Input | |||
---|---|---|---|
Required Input | Type | Description | |
input | string | User query to the model | |
output | string | Model’s response to the user query |
Output | ||
---|---|---|
Field | Description | |
Result | Returns Passed if the response is helpful, or Failed if it’s not | |
Reason | Provides a detailed explanation of the evaluation |
Troubleshooting
If you encounter issues with this evaluation:- Ensure that both the
input
(user query) andoutput
(AI response) parameters are provided - The helpfulness evaluation works best when the context of the request is clear
- If evaluating complex responses, make sure the entire response is included
- Consider combining with other evaluations like
completeness
orfactual-accuracy
for more comprehensive assessment
Related Evaluations
- **completeness: Determines if the response addresses all aspects of the query
- **task-completion: Checks if a specific requested task was accomplished
- **instruction-adherence: Evaluates if the response follows specific instructions
- **is-concise: Assesses whether the response avoids unnecessary verbosity