"Faithfulness vs Extractiveness Trade-offs in Summarization" refers to the balance between generating summaries that accurately reflect the source content (faithfulness) and those that closely reuse or copy phrases from the original text (extractiveness). In LLM evaluations, this trade-off is crucial: highly extractive summaries may avoid factual errors but lack abstraction, while more abstractive summaries may introduce inaccuracies. Effective evaluation seeks optimal summaries that are both faithful and appropriately abstractive.
"Faithfulness vs Extractiveness Trade-offs in Summarization" refers to the balance between generating summaries that accurately reflect the source content (faithfulness) and those that closely reuse or copy phrases from the original text (extractiveness). In LLM evaluations, this trade-off is crucial: highly extractive summaries may avoid factual errors but lack abstraction, while more abstractive summaries may introduce inaccuracies. Effective evaluation seeks optimal summaries that are both faithful and appropriately abstractive.
What does faithfulness mean in summarization?
Faithfulness is how accurately a summary preserves the original content's facts, meaning, and intent without introducing errors or misinterpretations.
What does extractiveness mean in summarization?
Extractiveness refers to how much of the summary uses exact wording from the source. Highly extractive summaries copy phrases; abstractive ones paraphrase or synthesize new wording.
What is the trade-off between faithfulness and extractiveness?
More extractiveness can preserve precise wording but may be less concise; less extractiveness (more paraphrase) risks losing nuance or introducing errors if not careful. The goal is to stay faithful while being concise.
How can faithfulness and extractiveness be evaluated?
Faithfulness is checked by factual accuracy against the source (human judgments or factuality metrics). Extractiveness is measured by how much content is copied from the source (n-gram overlap, copied passages).
What are practical strategies to balance these aspects?
Focus on preserving key facts, paraphrase for clarity, prefer concise wording, and verify factual accuracy after generation; combine extractive selection with abstractive rewriting and post-editing.