More than 10 years ago, GitHub.com started out like many other web applications of that time—built on Ruby on Rails, with a single MySQL database to

Partitioning GitHub’s relational databases to handle scale | The GitHub Blog

submited by

Style Pass

2021-09-28 03:00:10

More than 10 years ago, GitHub.com started out like many other web applications of that time—built on Ruby on Rails, with a single MySQL database to store most of its data.

Over the years, this architecture went through many iterations to support GitHub’s growth and ever-evolving resiliency requirements. For example, we started storing data for some features (like statuses in separate MySQL databases, we added read replicas to spread the load across multiple machines, and we started using ProxySQL to reduce the number of connections opened against our primary MySQL instances.

Yet at its core, GitHub.com remained built around one main database cluster (called mysql1) that housed a large portion of the data used by core GitHub features, like user profiles, repositories, issues, and pull requests.

With GitHub’s growth, this inevitably led to challenges. We struggled to keep our database systems adequately sized, always moving to newer and bigger machines to scale up. Any sort of incident negatively affecting mysql1 would affect all features that stored their data on this cluster.

What does First Normal Form actually mean?

Comment

Yes, You May Need a Blockchain

Comment

Timescale grabs $40M Series B as it goes all in on cloud version of time series database

Comment

Automatically generating Github-like og:images in Jekyll

Comment

Recfiles - Wikipedia

Comment

GitHub’s CTO Joins Redpoint As A Partner In The VC Firm’s New $725 Million Growth Fund

Comment

Build Applications and APIs Faster with MongoDB Atlas and AWS App Runner

Comment

Reimagining database querying on unstructured data

Comment

Now in private preview: optimize your data distribution with hierarchical partition keys

Comment

Best practices for consistent configuration management at scale with Tanka

Comment

Partitioning GitHub’s relational databases to handle scale | The GitHub Blog

Leave a Comment

Related Posts

What does First Normal Form actually mean?

Yes, You May Need a Blockchain

Timescale grabs $40M Series B as it goes all in on cloud version of time series database

Automatically generating Github-like og:images in Jekyll

Recfiles - Wikipedia

GitHub’s CTO Joins Redpoint As A Partner In The VC Firm’s New $725 Million Growth Fund

Build Applications and APIs Faster with MongoDB Atlas and AWS App Runner

Reimagining database querying on unstructured data

Now in private preview: optimize your data distribution with hierarchical partition keys

Best practices for consistent configuration management at scale with Tanka

Recent Posts

Ukraine Starts Building First US-design Nuclear Reactors

Drop a beat! Digital drum kit made in toddle

Florida will open schools to volunteer chaplains

Ketamine produces wide variety of responses in the brain, researchers find

Discovery in Maya pyramid reveals dramatic dynasty collapse, archaeologists say

Computer Science > Human-Computer Interaction

Import Reviews from Different Social Channels

OpenAI winds down AI image generator that blew minds and forged friendships in 2022

My 25-Year Engineering Career Retrospective

rDSA : an intelligent tool for data science assignments

A Fidi Office Building With a Wait List

Search code, repositories, users, issues, pull requests...

Google merges the Android, Chrome, and hardware divisions

Rust-Written LAVD Kernel Scheduler Shows Promising Results For Linux Gaming

Distinct features of the regenerating heart uncovered through comparative single-cell profiling

Inverters with constant full load capability enable an increase in the performance of electric drives - Fraunhofer IZM

Ofcom: Almost a quarter of kids aged 5-7 have smartphones

Scientists say they have found evidence of an unknown planet in our solar system

How to create a TODO with Tailwind CSS and Alpinejs

Oxford English Dictionary