Design an Image Hosting Service (Instagram/Flickr)

Question

Accepted Answer

To design an Instagram-like service, we first clarify that while we store billions of images, reads vastly outnumber writes. We need low-latency retrieval globally. For storage, we should never put raw images in a SQL database. Instead, we use distributed object storage like AWS S3 or Meta's own FBCache. The key is storing metadata—user ID, URL, timestamp—in a sharded NoSQL database like Cassandra or DynamoDB. When a user uploads an image, the request hits our API Gateway, which pushes a job to a message queue like Kafka. A pool of worker services consumes these jobs to generate various thumbnail sizes asynchronously, preventing the user from waiting for heavy processing. These processed images are then stored back in S3. To ensure fast delivery, we place a CDN like CloudFront in front of S3. The CDN caches static assets at edge locations worldwide. Crucially, we must implement cache invalidation logic when a user updates their profile picture. Finally, to handle Meta-scale traffic, we shard the metadata database by user ID to distribute load evenly across nodes, ensuring no single hotspot becomes a bottleneck during viral events.

Design an Image Hosting Service (Instagram/Flickr)

Why Interviewers Ask This

How to Answer This Question

Key Points to Cover

Sample Answer

Common Mistakes to Avoid

Sound confident on this question in 5 minutes

Related Interview Questions

Design a CDN Edge Caching Strategy

Design a System for Monitoring Service Health

Design a Payment Processing System