Database Sharding

Update: 2023-06-29

Description

Database sharding is a process of storing a large database across multiple machines. Because a single machine can only hold and process so much data, eventually some systems will scale beyond the ability of a single machine to handle data. Further, as systems scale, they may also need to split data between machines due to security and location considerations. Database sharding overcomes these problems by splitting the system into smaller chunks, allowing work to either be done in parallel, or only in the locations with the relevant data.

Obviously, it matters a lot how you split up your data. For instance, it's unlikely that splitting a customer table based on the customer last name will be as helpful in a large distributed system as it would be to split up customers by location. You probably also want to have shards that are roughly the same size. The idea behind sharding is to improve performance, specifically via parallelization, but it's also helpful if it also provides some resilience to outages. So that will also need to be a consideration when you start thinking about sharding.

Database sharding can be a very useful tool for making your application more resilient to load. However, it's complex and you really need to think through it carefully if you are considering using it in your environment. There are several different ways to do it, with different advantages and disadvantages, and these will need to be thoroughly considered before starting. Plus, sharding is actually a fairly drastic operation, requiring support and extra work for the remaining lifetime of your application. This means that you shouldn't really consider it until most other options have been exhausted.

Join Us On Patreon

Level Up Financial Planning

The post Database Sharding appeared first on Complete Developer Podcast.

Hosted on Acast. See acast.com/privacy for more information.

Comments

In Channel

A Farewell To Our Fans

2023-07-2001:01:10

Preempting System Issues

2023-07-1340:07

SMART Feedback

2023-07-0644:34

Database Sharding

2023-06-2951:28

Four Square Reports

2023-06-2236:19

Getting the Most From Programming Tutorials

2023-06-1540:10

ACID vs BASE Databases

2023-06-0837:38

API Anti-Patterns

2023-05-2550:58

Prioritization

2023-05-1851:49

Mob Programming

2023-05-1149:08

File Transfer Protocols

2023-05-0434:07

DDOS Attacks

2023-04-2736:56

Basics of Git

2023-04-2050:51

Explaining Agile To Non Technical CoWorkers

2023-04-1343:23

404 Personality Not Found

2023-04-0652:49

ChatGPT for Developers

2023-04-0155:52

Breaking Down Goals

2023-03-3053:02

Cross Platform Pitfalls

2023-03-2345:20

7 Habits of Highly Effective Developers

2023-03-1650:26

Software Architecture Mistakes

2023-03-0945:56

00:00

1.0x

#box-pro-ellipsis-177485272730152{-webkit-line-clamp:2;}Database Sharding

Links

Join Us On Patreon

Level Up Financial Planning

Database Sharding

Database Sharding