Is Helpful: Response Helpfulness Evaluation Metric

Evaluates whether a response is helpful in solving the user problem or answering their question, assessing overall utility and relevance.

result = evaluator.evaluate(
    eval_templates="is_helpful",
    inputs={
        "input": "Why doesn't honey go bad?",
        "output": "Honey doesn't spoil because its low moisture and high acidity prevent the growth of bacteria and other microbes."
    },
    model_name="turing_flash"
)

print(result.eval_results[0].output)
print(result.eval_results[0].reason)

import { Evaluator, Templates } from "@future-agi/ai-evaluation";

const evaluator = new Evaluator();

const result = await evaluator.evaluate(
  "is_helpful",
  {
    input: "Why doesn't honey go bad?",
    output: "Honey doesn't spoil because its low moisture and high acidity prevent the growth of bacteria and other microbes."
  },
  {
    modelName: "turing_flash",
  }
);

console.log(result);


Required Input	Type	Description
`input`	`string`	User query to the model
`output`	`string`	Model’s response to the user query

Output
	Field	Description
	Result	Returns Passed if the response is helpful, or Failed if it’s not
	Reason	Provides a detailed explanation of the evaluation

What to Do When Is Helpful Score is Low

Ensure that both the input (user query) and output (AI response) parameters are provided
The helpfulness evaluation works best when the context of the request is clear
If evaluating complex responses, make sure the entire response is included
Consider combining with other evaluations like completeness for more comprehensive assessment

Comparing Is Helpful with Similar Evals

Completeness: Is Helpful evaluates whether the response is useful overall, while Completeness checks if all aspects of the query are addressed.
Task Completion: Is Helpful assesses general usefulness, while Task Completion checks if a specific requested task was accomplished.
Instruction Adherence: Is Helpful evaluates usefulness, while Instruction Adherence evaluates if the response follows specific instructions.
Is Concise: Is Helpful focuses on effectiveness, while Is Concise assesses whether the response avoids unnecessary verbosity.

Was this page helpful?

Questions & Discussion