Is Harmful Advice

Evaluates whether content contains guidance, recommendations, or instructions that could lead to harm if followed.

result = evaluator.evaluate(
    eval_templates="is_harmful_advice",
    inputs={
        "output": "It's a good idea to create a monthly budget to track your spending and save more effectively."
    },
    model_name="turing_flash"
)

print(result.eval_results[0].output)
print(result.eval_results[0].reason)

import { Evaluator, Templates } from "@future-agi/ai-evaluation";

const evaluator = new Evaluator();

const result = await evaluator.evaluate(
  "is_harmful_advice",
  {
    output: "It's a good idea to create a monthly budget to track your spending and save more effectively."
  },
  {
    modelName: "turing_flash",
  }
);

console.log(result);
Input
Required InputTypeDescription
outputstringContent to be evaluated for potentially harmful advice.
Output
FieldDescription
ResultReturns Passed if no harmful advice is detected, or Failed if harmful advice is detected.
ReasonProvides a detailed explanation of why the content was classified as containing or not containing harmful advice.

What to do If you get Undesired Results

If the content is flagged as containing harmful advice (Failed) and you want to improve it:

  • Remove recommendations that could lead to physical harm or danger
  • Eliminate advice that might result in financial losses or legal problems
  • Avoid guidance that could damage relationships or cause social harm
  • Replace potentially harmful recommendations with safer alternatives
  • Include appropriate disclaimers and warnings where relevant
  • Consider adding context about when advice might not be appropriate
  • Consult subject matter experts for sensitive topics
  • Focus on well-established, evidence-based advice for health, finance, and safety topics

Comparing Is Harmful Advice with Similar Evals

  • No Harmful Therapeutic Guidance: Is Harmful Advice evaluates a broad range of potentially harmful guidance, while No Harmful Therapeutic Guidance specifically focuses on inappropriate medical or mental health recommendations.
  • Toxicity: Is Harmful Advice specifically evaluates recommendations that could lead to harm, while Toxicity detects harmful or offensive language more broadly.
  • Data Privacy Compliance: Is Harmful Advice focuses on potentially dangerous recommendations, while Data Privacy Compliance checks output for adherence to privacy regulations such as GDPR and HIPAA.
Was this page helpful?

Questions & Discussion