Evaluation Using SDK
Troubleshooting
If you encounter issues with this evaluation:- Ensure that both input texts are properly formatted and contain meaningful content
- This evaluation works best with texts that convey similar information but might have different wording
- For very short texts (1-2 words), results may be less reliable
- If you need more precise matching, consider using
levenshtein_similarity
instead
Related Evaluations
- levenshtein_similarity: Provides a more strict character-by-character comparison
- embedding_similarity: Compares semantic meaning rather than surface-level text
- semantic_list_contains: Checks if specific semantic concepts are present in both texts
- rouge_score: Evaluates based on n-gram overlap, especially useful for summarization tasks