Backend EngineeringSystem DesignDistributed SystemsInterview PrepArchitectureRoadmap

System Design Interview Roadmap: 45 Problems To Master (With Architect-Level Guidance)

Satyam Parmar
January 20, 2025
7 min read

System Design Interview Roadmap

You do not need 500 random questions. You need the right 45 practiced the right way.

This roadmap is organized by difficulty with what to design, what interviewers look for, and how to practice. Use it as a study plan and a checklist.


Easy Level 1 to 10

  1. Parking Garage
  • Design entry exit flows, ticketing, pricing, and availability counters per level.
  • Think rate limits at gates, simple relational schema, and backpressure at peaks.
  1. Vending Machine
  • Finite state machine for selection, payment, dispense, and refunds.
  • Handle out of stock, change return, and timeouts cleanly.
  1. Distributed Cache
  • Read write policies, TTL and eviction like LRU, and cache stampede control.
  • Decide consistency needs and fallbacks when cache misses happen.
  1. Authentication System
  • Login flow, sessions versus JWT, refresh tokens, and password storage.
  • Protect brute force and add device or IP based signals.
  1. Distributed Job Scheduler
  • Leader election, cron triggers, worker pool, retries, and idempotency.
  • Persist job state and move failures to a dead letter queue.
  1. Key Value Store for a Search Engine
  • Partition by hash with awareness of index shards; replicate for availability with quorum reads writes.
  • Optimize for write amplification and compaction; mitigate hot terms with caching and tiered storage.
  1. URL Shortener (Bit.ly or TinyURL)
  • Short id generation, cache heavy read path, and sub millisecond redirects at the edge.
  • Handle hot keys and abuse; add per link analytics and rate limiting.
  1. Content Delivery Network CDN
  • Edge caching, TTL and validation, and origin failover.
  • Optimize cache hit ratio and pre warm popular assets.
  1. Text Storage Pastebin
  • Store variable size text with metadata, retention, and abuse guards.
  • Optimize for read heavy access with cache and CDN.
  1. UPI like Payments simplified
  • Idempotency keys, double spend protection, and a ledger model.
  • Reconciliation jobs and clear failure handling for transfers.

Medium Level 11 to 32

  1. Tinder
  • Recommendation pipeline with geo filters, rate limits, and anti abuse.
  • Store swipe events and build daily aggregates for freshness.
  1. Twitter Timeline and Search (or Facebook Feed and Search)
  • Choose fanout on write vs fan in on read; denormalize for home timeline and enable ranked retrieval.
  • Power search and trends with inverted indexes and streaming aggregation; apply abuse and spam controls.
  1. Reddit
  • Voting mechanics, ranking with decay, and community isolation.
  • Moderation tools and rate limits for spam.
  1. Netflix
  • Catalog service, CDN distribution, and adaptive bitrate streaming.
  • Recommendations pipeline and regional licensing constraints.
  1. Spotify
  • Playlists, offline cache sync, and real time playback state.
  • Search and recommendations with user taste profiles.
  1. TikTok
  • For you feed ranking, upload pipeline, and video processing.
  • Cost controls for storage and network egress.
  1. Shopify
  • Multi tenant stores, product inventory, checkout, and webhooks.
  • Isolation per merchant and spike handling on drops.
  1. YouTube
  • Upload to transcode to store to CDN pipeline with comments.
  • Thumbnails, rate limits, and copyright workflows.
  1. Social Network Data Structures (Facebook)
  • Model the social graph, adjacency lists, secondary indexes, and privacy scopes for reads.
  • Power feed ranking signals and notifications while keeping storage and queries efficient.
  1. WhatsApp
  • End to end messaging basics, delivery receipts, and presence.
  • Group fanout and media handling constraints.
  1. Instagram
  • Media storage, stories life cycle, and thumbnail generation.
  • Explore ranking and hashtag search.
  1. Rate Limiter
  • Token bucket or leaky bucket per key and global limits.
  • Placement at gateway and service level with burst control.
  1. Google Search simplified
  • Crawler to index to ranking with caches.
  • Query autosuggest and abuse defenses.
  1. Payment System
  • Double entry ledger, idempotency, reconciliation, and refunds.
  • Chargeback handling and audit trails.
  1. Personal Finance Aggregator (Mint.com)
  • Aggregation via aggregators or bank APIs; OAuth flows, data normalization, and categorization pipelines.
  • Budgeting insights, anomaly alerts, and encryption/privacy by design with regional compliance.
  1. Notification Service
  • Fanout across email sms push with templates and retries.
  • User preferences and quiet hours.
  1. Flight Booking System
  • Search cache, seat inventory locking, and eventual consistency.
  • Split payment and ticket issuance flows.
  1. Stock Exchange simplified
  • Matching engine, low latency, and ordering guarantees.
  • Market data dissemination and backpressure.
  1. E commerce Store Amazon
  • Catalog, cart, checkout, search, and recommendations.
  • Fraud detection and order lifecycle events.
  1. Amazon Style Sales Ranking by Category
  • Maintain rolling rankings per category with decay; design counters with sharding and anti skew.
  • Serve fast via precomputed leaderboards and caches; update via streams or batch windows.
  1. Distributed Message Queue Kafka
  • Partitions, replication factor, consumer groups, and offsets.
  • Effectively once guarantees via outbox and idempotent consumers.
  1. Scale to Millions of Users on AWS
  • Stateless services behind ALB with ASG; Aurora/RDS read replicas, ElastiCache, S3+CloudFront, multi AZ.
  • Apply observability, autoscaling policies, DR, and cost controls; plan limits and regional expansion.

Hard Level 33 to 45

  1. Uber
  • Dispatch, ETA estimation, surge pricing, and geo indexing.
  • Driver state machine and mobile connectivity constraints.
  1. Zoom
  • Media servers, NAT traversal, and SFU versus MCU tradeoffs.
  • Quality of service and bandwidth adaptation.
  1. Google Docs
  • Collaborative editing via OT or CRDT with conflict resolution.
  • Presence, comments, and version history.
  1. Google Maps
  • Tile generation, routing engine, and traffic ingestion.
  • Multi level caches from client to edge to origin.
  1. Distributed Web Crawler
  • Crawl frontier, deduplication, politeness, and robots rules.
  • Storage tiers for raw pages and indexes.
  1. Code Deployment System
  • Blue green and canary releases with fast rollback.
  • Artifact storage, SBOM, and auditability.
  1. Distributed Locking Service
  • Fencing tokens, leases, and split brain avoidance.
  • Fairness and starvation considerations.
  1. File Sharing Dropbox
  • Delta sync, conflict handling, encryption at rest and in transit.
  • Storage classes and lifecycle policies.
  1. Food Delivery DoorDash
  • Order batching, routing, courier location, and SLA.
  • Cold chain requirements and busy hour spikes.
  1. Location Based Service Yelp
  • Geo queries, ranking, freshness, and caching.
  • Review moderation and fraud controls.
  1. Distributed Cloud Storage S3
  • Durability targets, replication, and erasure coding.
  • Eventual consistency and strong read after write for new objects.
  1. Ticket Booking BookMyShow
  • Seat locking, overbooking prevention, and payment races.
  • Queues for on sale events and fairness.
  1. Distributed Analytics Data Lake
  • Batch and stream processing with schema evolution.
  • Cost controls for storage and compute with partition pruning.

Study Checklist Make four things for every problem 1 API surface and idempotency notes 2 Data model with partition keys 3 High level diagram of components 4 Scaling and failure plan including DLQ and backpressure

Six Week Plan Week 1 Easy 1 to 10 Week 2 Medium 11 to 18 Week 3 Medium 19 to 26 Week 4 Medium or Hard 27 to 34 Week 5 Hard 35 to 40 Week 6 Hard 41 to 45

Practice Prompts What fails first at 10x traffic What must be strongly consistent versus eventually consistent Where do you cache and how do you invalidate it What is your retry and idempotency strategy What is your partition key and why What is the smallest MVP

If you want structured, project based guidance, contact me. I’ll help you move from theory to production.

Related Articles

Home