Think of the meta store as the "brain" of a RisingWave cluster. It's responsible for storing critical information that keeps the entire

Why We Ditched etcd for SQL to Scale RisingWave's Metadata

submited by
Style Pass
2025-01-09 07:30:07

Think of the meta store as the "brain" of a RisingWave cluster. It's responsible for storing critical information that keeps the entire system running smoothly.

RisingWave follows a cloud-native design with stateless components, ensuring high availability and scalability. The persistent state is divided into two key parts:

Persistent Data Storage: This is where the actual data for your tables, materialized views, and the state of stream processing operations are stored. It uses an efficient LSM tree format and resides in a cloud object storage service like AWS S3.

Streaming Task Topology: A map of all the active stream processing tasks and how they are distributed across the compute nodes.

As we scaled, it became clear that the meta store's performance was crucial, and our initial choice of etcd started to show its limitations.

Initially, RisingWave used etcd, a popular distributed key-value store, as its meta store. etcd was a good starting point because it's reliable, provides high availability, and supports transactions. However, as RisingWave gained adoption and users started deploying larger, more complex clusters, we began to encounter limitations with etcd.

Leave a Comment