Is Helpful
Evaluates whether the response is helpful in solving the user problem or answering their question
result = evaluator.evaluate(
eval_templates="is_helpful",
inputs={
"input": "Why doesn't honey go bad?",
"output": "Honey doesn't spoil because its low moisture and high acidity prevent the growth of bacteria and other microbes."
},
model_name="turing_flash"
)
print(result.eval_results[0].output)
print(result.eval_results[0].reason)import { Evaluator, Templates } from "@future-agi/ai-evaluation";
const evaluator = new Evaluator();
const result = await evaluator.evaluate(
"is_helpful",
{
input: "Why doesn't honey go bad?",
output: "Honey doesn't spoil because its low moisture and high acidity prevent the growth of bacteria and other microbes."
},
{
modelName: "turing_flash",
}
);
console.log(result); | Input | |||
|---|---|---|---|
| Required Input | Type | Description | |
input | string | User query to the model | |
output | string | Model’s response to the user query |
| Output | ||
|---|---|---|
| Field | Description | |
| Result | Returns Passed if the response is helpful, or Failed if it’s not | |
| Reason | Provides a detailed explanation of the evaluation |
What to Do When Is Helpful Score is Low
- Ensure that both the
input(user query) andoutput(AI response) parameters are provided - The helpfulness evaluation works best when the context of the request is clear
- If evaluating complex responses, make sure the entire response is included
- Consider combining with other evaluations like
completenessfor more comprehensive assessment
Comparing Is Helpful with Similar Evals
- Completeness: Is Helpful evaluates whether the response is useful overall, while Completeness checks if all aspects of the query are addressed.
- Task Completion: Is Helpful assesses general usefulness, while Task Completion checks if a specific requested task was accomplished.
- Instruction Adherence: Is Helpful evaluates usefulness, while Instruction Adherence evaluates if the response follows specific instructions.
- Is Concise: Is Helpful focuses on effectiveness, while Is Concise assesses whether the response avoids unnecessary verbosity.
Was this page helpful?