Apache Iceberg

Apache Iceberg
Original author(s)Ryan Blue, Daniel Weeks
Initial release10 August 2017; 7 years ago (10 August 2017)
Written inJava, Python
Operating systemCross-platform
TypeData warehouse, Data lake
LicenseApache License 2.0
Website

Apache Iceberg is an open-source high-performance format for huge analytic tables. Iceberg enables the use of SQL tables for big data while making it possible for engines like Spark, Trino, Flink, Presto, Hive, Impala, StarRocks, Doris, and Pig to safely work with the same tables, at the same time.[1] Iceberg is released under the Apache License.[2] Iceberg addresses the performance and usability challenges of using Apache Hive tables in large and demanding data lake environments.[3] Vendors currently supporting Apache Iceberg tables in their products include Buster,[4] CelerData, Cloudera, Crunchy Data,[5] Dremio, IOMETE, Snowflake, Starburst, Tabular,[6] and AWS.[7]

  1. ^ "Apache Iceberg". iceberg.apache.org. Retrieved 5 October 2022.
  2. ^ "apache/iceberg GitHub License". The Apache Software Foundation. 5 October 2022. Retrieved 5 October 2022.
  3. ^ Woodie, Alex (8 February 2021). "Apache Iceberg: The Hub of an Emerging Data Service Ecosystem?". Datanami. Archived from the original on 4 September 2024. Retrieved 5 October 2022.
  4. ^ "Buster".
  5. ^ Woodie, Alex (24 July 2024). "Crunchy Data Goes All-in With Postgres". The Big Data Wire.
  6. ^ "Vendors". iceberg.apache.org. Retrieved 2023-05-05.
  7. ^ "Using Apache Iceberg tables – Amazon Athena". Amazon Web Services, Inc. Archived from the original on 2024-09-04. Retrieved 2023-06-16.