Question 1

What is latency in computing?

Accepted Answer

The time from when a request is issued to when the first response is received, encompassing network travel and processing time.

Question 2

How does caching reduce latency?

Accepted Answer

By keeping data closer to users (in memory, on fast storage, or at the edge/CDN) so future requests are served without a full retrieval from the origin.

Question 3

What is a cache hit vs a cache miss?

Accepted Answer

A hit means the requested data is found in the cache and served quickly; a miss means it isn’t, so the data is fetched from the origin and then cached.

Question 4

What is an eviction policy like LRU?

Accepted Answer

LRU (Least Recently Used) removes the least recently accessed item when the cache is full to make room for new data.

Question 5

What is TTL and cache invalidation?

Accepted Answer

TTL (time-to-live) sets how long cached data stays valid; invalidation refreshes or removes stale data when it changes, ensuring freshness.

Latency Optimization & Caching Strategies

Latency Optimization & Caching Strategies

💡 Key Takeaways

❓ Frequently Asked Questions