Design a Distributed Search Service (Elasticsearch)

Question

Accepted Answer

To design a distributed search service for billions of documents, I first define the constraints: we need sub-100ms latency and 99.99% availability. The foundation is an inverted index where each term maps to a list of document IDs. For storage, we shard the index horizontally across multiple nodes based on primary keys to distribute the write load. Each shard is replicated, typically three times, ensuring durability if a node fails, which aligns with Amazon's customer obsession regarding reliability. When a query arrives, the coordinator node broadcasts it to all relevant shards in parallel. To optimize latency, we implement aggressive caching at the segment level for frequent queries and use request batching. We also employ compression algorithms like Roaring Bitmaps to reduce memory footprint. Finally, we ensure eventual consistency by using asynchronous updates to replicas, balancing immediate availability with system throughput.

Design a Distributed Search Service (Elasticsearch)

Why Interviewers Ask This

How to Answer This Question

Key Points to Cover

Sample Answer

Common Mistakes to Avoid

Sound confident on this question in 5 minutes

Related Interview Questions

Design a Payment Processing System

Design a System for Real-Time Fleet Management

Design a CDN Edge Caching Strategy