Breaking Limits: Introduction to Sharding

Welcome to Day 2! 🌐

Replication solves High Availability (Read Scaling). Sharding solves Massive Data (Write Scaling + Storage).

If you have 50TB of data, you can’t buy a hard drive big enough. You split the data across 5 servers (10TB each). This is Sharding.

1. The Architecture

A Sharded Cluster has 3 components:

Shard: A Replica Set containing a subset of the data.
Config Server: Stores the roadmap. “Which data lives on which shard?“.
Mongos (Router): The doorman. The app connects here. It asks the Config Server where to find data and routes the query.

The most important decision you will make. MongoDB splits data based on the Shard Key.

If you shard by zipcode:

New users always have the highest date. All new inserts will hit the Last Shard. Result: Hotspot. One shard works 100%, others 0%.

Only 2-3 values. You can have at most 2-3 chunks. You can’t split “Female” into smaller pieces if it grows to 10TB.

Users are evenly distributed across all shards. Result: Even load balancing.

Ranged: Good for range queries (count users between ID 100 and 200). But risks hotspots if data isn’t random.
Hashed: MongoDB hashes the key. Hash(1) might be on Shard A, Hash(2) on Shard B.
- Pros: perfect distribution.
- Cons: Range queries are slow (scatter-gather).

Think about your current app. If you had to shard your biggest collection, what key would you pick?

See you on Day 3 for Horizontal vs Vertical Scaling! ⚖️