Apache Hudi is an open data lakehouse platform, built on a high-performance open table format to ingest, index, store, serve, transform and manage your data across multiple cloud data environments.
Hudi stores all data and metadata on cloud storage in open formats, providing the following features across different aspects.
The default Spark 3.x version, corresponding to spark3 profile is 3.5.3. The default Scala version is 2.12. Scala 2.13 is supported for Spark 3.5 and above.
Please note that only Spark-related bundles, i.e., hudi-spark-bundle, hudi-utilities-bundle, hudi-utilities-slim-bundle, can be built using scala-2.13 profile. Hudi Flink bundle cannot be built using scala-2.13 profile. To build these bundles on Scala 2.13, use the following command:
The default Flink version supported is 1.20. The default Flink 1.20.x version, corresponding to flink1.20 profile is 1.20.0. Flink is Scala-free since 1.15.x, there is no need to specify the Scala version for Flink 1.15.x and above versions. Refer to the table below for building with different Flink and Scala versions.