Design a Push Notification Delivery System

Question

Accepted Answer

To design a push notification system for a platform like Uber, I would start by establishing clear SLAs, aiming for sub-second latency for ride updates while ensuring 99.9% delivery reliability. The core architecture would involve a Notification Service acting as an orchestrator. When a driver accepts a ride, the service publishes an event to a high-throughput message queue. Consumers then pick up these events, fetch the associated user's device tokens from a low-latency cache like Redis, and route them to the correct provider—APNS for iOS or FCM for Android.

Token management is critical here. We must store tokens with metadata like expiration dates and last seen timestamps, refreshing them upon app re-launch to avoid sending to dead devices. For throttling, we implement a sliding window rate limiter. If a user receives too many notifications, we queue subsequent ones or suppress them to prevent battery drain and app uninstallations. We also need robust retry logic; if FCM returns a 'device not registered' error, we immediately invalidate that token. Finally, monitoring dashboards should track delivery rates, provider-specific error codes, and queue depths to quickly identify bottlenecks during peak hours like Friday nights.

Design a Push Notification Delivery System

Why Interviewers Ask This

How to Answer This Question

Key Points to Cover

Sample Answer

Common Mistakes to Avoid

Sound confident on this question in 5 minutes

Related Interview Questions

Design a CDN Edge Caching Strategy

Design a System for Monitoring Service Health

Design a Payment Processing System