Question 1

What does multi-hop evaluation mean in QA tasks?

Accepted Answer

It means solving questions by chaining information from multiple sources or steps, not relying on a single fact.

Question 2

What is a HotpotQA-style protocol?

Accepted Answer

A QA setup where questions require reasoning across several documents; models must produce the final answer and identify supporting facts that justify it.

Question 3

What are 'supporting facts'?

Accepted Answer

Specific sentences or passages that provide evidence for the answer; in HotpotQA-style tasks, you typically identify them as part of the evaluation.

Question 4

How is performance measured in this style of QA?

Accepted Answer

By how accurately the answer is produced (e.g., exact match or F1) and how well the model selects supporting facts (precision/recall or F1 against gold evidence).

Question 5

What are common challenges in multi-hop QA?

Accepted Answer

Locating relevant information across multiple sources, avoiding distractors, and ensuring a coherent reasoning chain to justify the final answer.

Multi-Hop Evaluation with HotpotQA-Style Protocols

💡 Key Takeaways

❓ Frequently Asked Questions

You may also like

Encoder-Decoder Fusion (FiD) Fundamentals

Sparse vs Dense vs Hybrid Retrieval

Passage Scoring Features and Signals

You may also like

Encoder-Decoder Fusion (FiD) Fundamentals

Sparse vs Dense vs Hybrid Retrieval

Passage Scoring Features and Signals