Design a Pastebin Service

Question

Accepted Answer

To design a Pastebin service, I'd start by defining the scope. We need to support short-lived text snippets with unique IDs, likely serving millions of reads daily. For the API, we need POST /paste, GET /:id, and DELETE /:id. Given the simple key-value nature, I'd propose a hybrid storage approach. The content itself, which can be large, should go into an object store like S3 for durability and low cost, while metadata and active sessions sit in a fast KV store like Redis for sub-millisecond reads. To handle potentially malicious inputs, we must enforce strict input validation at the gateway level, sanitizing HTML to prevent XSS before storage. For very large inputs, we'd reject payloads exceeding a reasonable limit, say 5MB, to protect backend resources. Data expiration is critical for cost control; we can leverage Redis TTLs for temporary pastes or use a background worker to delete expired records from S3. This architecture balances high availability for reads with secure, scalable writes, fitting the needs of a platform like Uber where reliability and speed are paramount.

Design a Pastebin Service

Why Interviewers Ask This

How to Answer This Question

Key Points to Cover

Sample Answer

Common Mistakes to Avoid

Sound confident on this question in 5 minutes

Related Interview Questions

Discuss ACID vs. BASE properties

Design a CDN Edge Caching Strategy

Design a Payment Processing System