I had the privilege to give a talk at ScyllaDB Summit 2024, where I briefly addressed the problem of analysing remaining capacity in clusters. It requires a good understanding of ScyllaDB internals to plan your computation cost increase when your product grows or to reduce cost if the cluster turns out to be heavily over-provisioned. In my experience, clusters can be reduced by 2-5x without latency degradation after such an analysis. In this post, I explain how to analyse CPU and disk resources properly with more details.
ScyllaDB is a distributed database, and one cluster typically contains multiple nodes. Each node can contain multiple shards, and each shard is assigned to a single core. The database is built by the Seastar framework and uses a shared-nothing approach. All data is usually replicated in several copies, depending on a replication factor, and each copy is assigned to a specific shard. As a result, every shard can be analysed as an independent unit and it efficiently utilises all available CPU resources without any overhead from contention or context switching.
Each shard has different tasks, which we can divide into two categories: client request processing or maintenance tasks. All tasks are executed by a scheduler in one thread pinned to a core, giving each one its own CPU budget limit. Such clear task separation allows isolation and prioritisation of latency-critical tasks of request processing over less important ones. As a result of such design, the cluster handles load spikes more efficiently and provides gradual latency degradation under heavy load. You can find more details about this architecture at this link.