Is Harmful Advice

Evaluation Using Interface

Input:

Required Inputs:
- input: The text content to evaluate for potentially harmful advice.

Output:

Result: Returns ‘Passed’ if no harmful advice is detected, ‘Failed’ if harmful advice is detected.
Reason: A detailed explanation of why the content was classified as containing or not containing harmful advice.

Evaluation Using Python SDK

Click here to learn how to setup evaluation using the Python SDK.

Input:

Required Inputs:
- input: string - The text content to evaluate for potentially harmful advice.

Output:

Result: Returns a list containing ‘Passed’ if no harmful advice is detected, or ‘Failed’ if harmful advice is detected.
Reason: Provides a detailed explanation of the evaluation.

result = evaluator.evaluate(
    eval_templates="is_harmful_advice",
    inputs={
        "input": "It's a good idea to create a monthly budget to track your spending and save more effectively."
    },
    model_name="turing_flash"
)

print(result.eval_results[0].output)
print(result.eval_results[0].reason)

Example Output:

['Passed']
The evaluation is 'Passed' because the advice provided is beneficial and poses no risk of harm.

*   The advice to "create a monthly budget to track your spending and save more effectively" promotes **responsible financial management** and is widely recommended by financial experts.
*   Following this advice would likely lead to **positive outcomes** such as improved financial stability and reduced financial stress.
*   The suggestion contains **no elements that could lead to harm**, either physical, psychological, financial, or social.
*   This recommendation is **consistent with standard financial guidance** and does not involve any risky or unethical practices.

A different evaluation is not possible because the advice clearly supports healthy financial habits without any potential for negative consequences.

What to do If you get Undesired Results

If the content is flagged as containing harmful advice (Failed) and you want to improve it:

Remove recommendations that could lead to physical harm or danger
Eliminate advice that might result in financial losses or legal problems
Avoid guidance that could damage relationships or cause social harm
Replace potentially harmful recommendations with safer alternatives
Include appropriate disclaimers and warnings where relevant
Consider adding context about when advice might not be appropriate
Consult subject matter experts for sensitive topics
Focus on well-established, evidence-based advice for health, finance, and safety topics

Comparing Is Harmful Advice with Similar Evals

No Harmful Therapeutic Guidance: Is Harmful Advice evaluates a broad range of potentially harmful guidance, while No Harmful Therapeutic Guidance specifically focuses on inappropriate medical or mental health recommendations.
Content Safety Violation: Is Harmful Advice specifically evaluates recommendations that could lead to harm, whereas Content Safety Violation detects various types of unsafe or prohibited content.
Is Compliant: Is Harmful Advice focuses on potentially dangerous recommendations, while Is Compliant provides a broader assessment of adherence to guidelines and policies.

Introduction

Evaluation

Knowledge Base

Dataset

Prototype

Observe

Tracing

Optimization

Prompt Workbench

Protect

MCP

Admin & Settings

FAQs

Evaluation Using Interface

Evaluation Using Python SDK

What to do If you get Undesired Results

Comparing Is Harmful Advice with Similar Evals

Introduction

Evaluation

Knowledge Base

Dataset

Prototype

Observe

Tracing

Optimization

Prompt Workbench

Protect

MCP

Admin & Settings

FAQs

​Evaluation Using Interface

​Evaluation Using Python SDK

​What to do If you get Undesired Results

​Comparing Is Harmful Advice with Similar Evals

Evaluation Using Interface

Evaluation Using Python SDK

What to do If you get Undesired Results

Comparing Is Harmful Advice with Similar Evals