Question 1

What is cosine similarity?

Accepted Answer

Cosine similarity measures how close two vectors point in the same direction, ignoring their magnitudes. It equals (a · b) / (||a|| · ||b||) and ranges from -1 (opposite) to 1 (same direction).

Question 2

How do you compute cosine similarity between two vectors?

Accepted Answer

Compute cos_sim = (a · b) / (||a|| × ||b||). If either vector is zero, the value is undefined; in practice, handle this case (e.g., set to 0 or skip).

Question 3

What is the dot product and how does it relate to similarity?

Accepted Answer

The dot product a · b = sum of a_i × b_i measures alignment. Larger values indicate more alignment when magnitudes are similar, and it is a component used in computing cosine similarity.

Question 4

What is L2 distance and how is it used for similarity?

Accepted Answer

L2 distance (Euclidean distance) is sqrt(sum (a_i − b_i)²). Smaller distances mean greater similarity. Unlike cosine, it depends on vector magnitudes; you can convert it to a similarity score with transformations like 1/(1+distance).

Question 5

When should you choose cosine vs L2 vs dot product?

Accepted Answer

Choose cosine when only the direction of vectors matters (e.g., text embeddings). Use L2 distance when magnitude differences are meaningful. Dot product is fast and useful in linear models, but is sensitive to vector length unless vectors are normalized.

Similarity Metrics: Cosine, Dot, L2+50

Similarity Metrics: Cosine, Dot, L2
+50

💡 Key Takeaways

❓ Frequently Asked Questions