Question 1

What are factuality metrics in QA and summarization?

Accepted Answer

Metrics that measure whether generated answers or summaries faithfully reflect facts from the source, not just whether the text sounds correct.

Question 2

What does a FEVER-style metric evaluate?

Accepted Answer

Whether a claim is supported, contradicted, or not supported by evidence from a reference corpus (e.g., Wikipedia).

Question 3

What is QuestEval and how does it assess factuality?

Accepted Answer

An automatic metric that turns parts of the output into questions and checks if those questions can be answered correctly from the source; higher scores indicate better factuality.

Question 4

What is FactCC?

Accepted Answer

A classifier-based method that checks whether a generated sentence is factually consistent with the source paragraph, flagging mismatches.

Question 5

Why is factuality important in summarization?

Accepted Answer

A faithful summary preserves true information from the source; factual errors can mislead readers and erode trust.

Factuality Metrics for QA and Summarization

Factuality Metrics for QA and Summarization

💡 Key Takeaways

❓ Frequently Asked Questions