Question 1

What does 'Multi-Vector Representations per Document' mean?

Accepted Answer

It means representing a single document with several vector forms, each capturing different information (e.g., meaning, topics, structure) rather than a single embedding.

Question 2

Why use multiple vectors instead of one embedding?

Accepted Answer

Multiple vectors capture nuances that a single embedding might miss, improving tasks like search, ranking, and answering questions about the document.

Question 3

What are common types of vectors used for a document?

Accepted Answer

Semantic embeddings, topic vectors, lexical/surface-form vectors, structural vectors (per section), and contextual vectors that relate to other documents.

Question 4

How are multiple vectors used together for retrieval or comparison?

Accepted Answer

They are fused using early fusion (concatenate), late fusion (combine similarity scores), or attention-based methods that weigh each vector type by relevance.

Question 5

What are practical challenges of using multi-vector representations?

Accepted Answer

Increased storage and computation, choosing meaningful vector types, aligning vector spaces, and ensuring effective fusion without noise or redundancy.

Multi-Vector Representations per Document

Multi-Vector Representations per Document

💡 Key Takeaways

❓ Frequently Asked Questions