Skip to main content
Future AGI Documentation home page
Search...
⌘K
Get Started
What is Future AGI?
Quickstart
Guides
Evaluation
Running Your First Eval
Create Custom Evals
Evaluation Groups
Use Custom Models
Use Future AGI Models
Evaluate via CI/CD Pipeline
Built-in Evals
Overview
Answer Refusal
Audio Quality
Audio Transcription
Bias Detection
BLEU Score
Caption Hallucination
Chunk Attribution
Chunk Utilization
Clinically Inappropriate Tone
Completeness
Content Moderation
Content Safety Violation
Context Adherence
Context Relevance
Conversation Coherence
Conversation Resolution
Cultural Sensitivity
Data Privacy Compliance
Detect Hallucination
Embedding Similarity
Eval Ranking
Factual Accuracy
Fuzzy Match
Groundedness
Prompt Instruction Adherence
Is Compliant
Is Concise
Is Email
Is Factually Consistent
Is Good Summary
Is Harmful Advice
Is Helpful
Is Informal Tone
Is JSON
Is Polite
Levenshtein Similarity
Length Evals - One Line
LLM Function Calling
No Age Bias
No Apologies
No Gender Bias
No Harmful Therapeutic Guidance
No LLM Reference
No Racial Bias
Numeric Similarity
PII
Prompt Injection
Recall Score
ROUGE Score
Semantic List Contains
Sexist
Summary Quality
Synthetic Image Evaluator
Task Completion
Text to SQL
Tone
Toxicity
Translation Accuracy
Valid Links
Simulation
Dataset
Prompt
Prototype
Observability
Agent Compass
Optimization
Protect
Knowledge Base
Best Practices
Admin Settings
FAQs
Release Notes
Future AGI Documentation home page
Search...
⌘K
Community
Dashboard
Dashboard
Search...
Navigation
Built-in Evals
Overview
Documentation
Integrations
Cookbooks
SDK Reference
API Reference
Documentation
Integrations
Cookbooks
SDK Reference
API Reference
Community
Dashboard
Built-in Evals
Overview
Answer Refusal
Audio Quality
Audio Transcription
Bias Detection
BLEU
Caption Hallucination
Chunk Attribution
Chunk Utilization
Clinically Inappropriate Tone
Completeness
Content Moderation
Content Safety Violation
Context Adherence
Context Relevance
Conversation Coherence
Conversation Resolution
Cultural Sensitivity
Data Privacy
Detect Hallucination
Embedding Similarity
Eval Ranking
Factual Accuracy
Fuzzy Match
Groundedness
Instruction Adherence
Is Compliant
Is Concise
Is Email
Is Factually Consistent
Is Good Summary
Is Harmful Advice
Is Helpful
Is Informal Tone
Is JSON
Is Polite
Lavenshtein Similarity
Length Evals
LLM Function Calling
No Age Bias
No Apologies
No Gender Bias
No Harmful Therapeutic Guidance
No OpenAI Reference
No Racial Bias
Numeric Similarity
PII
Prompt Injection
Recall Score
Rouge
Semantic List Contains
Sexist
Summary Quality
Synthetic Image Evaluator
Task Completion
Text-to-SQL
Tone
Toxicity
Translation Accuracy
Valid Links
Was this page helpful?
Yes
No
Evaluate via CI/CD Pipeline
Answer Refusal
⌘I
Assistant
Responses are generated using AI and may contain mistakes.