In the previous blog, we introduced InstanceSet, along with a series of features derived from it to address high availability and other database needs. In this blog, we will introduce one of these features -- taking specified instances offline.
Before v0.9.0, KubeBlocks generated workloads as StatefulSets, which was a double-edged sword. While KubeBlocks could leverage the advantages of StatefulSets to manage stateful applications like databases, it inherited its limitations.
One of these limitations is evident in horizontal scaling scenarios, where StatefulSets offload Pods sequentially based on the Ordinal order, potentially impacting the availability of databases running within.
For example, managing a PostgreSQL database with one primary and two secondary replicas using a StatefulSet named foo-bar. And Pod foo-bar-2 is selected as the primary node. Now, if we decide to scale in this database cluster due to low read load, according to StatefulSet rules, we can only offload Pod foo-bar-2, which is currently the primary node. In this way, we can either directly offload foo-bar-2, triggering a failover mechanism to elect a new primary pod from foo-bar-0 and foo-bar-1, or use a switchover mechanism to convert foo-bar-2 into a secondary pod before offloading it. Either way, there will be a period where write is not applicable.
Another issue arises in the same scenario: if the node hosting foo-bar-1 experiences a hardware failure, causing disk damage and rendering data read-write inaccessible, according to best operation practices, we need to offload foo-bar-1 and rebuild replicas on healthy nodes. However, performing such operation tasks based on StatefulSets isn't easy.