Definition

Assesses the accuracy of a summary by comparing it to the original document. This evaluation ensures that the summary accurately reflects the main points and details of the original content without introducing errors or omissions.

A Passed evaluation indicates that the summary accurately represents the original document, while a Failed evaluation suggests discrepancies or inaccuracies in the summary.


Calculation

The evaluation process begins with Configuration Setup, where the original document and the summary to be evaluated are defined, and an appropriate language model is selected. The system then compares the summary against the original document to assess its accuracy, identifying any discrepancies, omissions, or errors.

Eval returns a Pass/Fail output is generated based on how accurately the summary reflects the original document.


What to do when Summarization Accuracy Evaluation Fails

If the evaluation fails, the Document Review should reassess the original document for clarity and completeness, ensuring it provides all necessary information for accurate summarization. In Summary Analysis, the summary should be examined for inaccuracies or missing elements, and refinements should be made to improve alignment with the original document.


Differentiating Summarization Accuracy with Summary Quality

Summarization Accuracy focuses specifically on the factual correctness of a summary compared to the original document, ensuring that no information is misrepresented. While Summary Quality takes a broader approach, evaluating the overall effectiveness of a summary, including completeness, relevance, and conciseness.

In terms of inputs, Summarization Accuracy requires only a document and a summary, whereas Summary Quality also incorporates input, output, and context for a more comprehensive assessment.

The output of Summarization Accuracy provides a Pass/Fail result based on factual correctness, while Summary Quality assigns a score reflecting various quality metrics.