We are excited to announce that Apache Pinot 0.7.1 was released a few months back in April 2021. Apache Pinot is a real-time distributed datastore des

Announcing Apache Pinot 0.7.1

submited by
Style Pass
2021-06-22 05:00:37

We are excited to announce that Apache Pinot 0.7.1 was released a few months back in April 2021. Apache Pinot is a real-time distributed datastore designed to answer OLAP queries with low latency for those of you who are new readers of this blog. This release introduced several fantastic new features, including JSON index, Lookup-based Join support, GeoSpatial support, TLS support for Pinot connections, and various performance optimizations and improvements. It also adds several new APIs to better manage the segments and upload data to an offline table. It also contains many critical bug fixes.

A JSON string can represent an array, map, or nested field without forcing a fixed schema. It is very flexible, but it comes at a cost: filtering on a JSON string column is very expensive.

Without an index, we need to scan and reconstruct the JSON object from the JSON string for every record to look up a key and filter records based on the value. Then we need to look up the key and compare the value. Pinot’s new JSON index feature is designed to accelerate the filtering on JSON string columns without scanning and reconstructing all the JSON objects.

Leave a Comment