Apache Hive

Apache Hive
Original author(s)Facebook, Inc.
Developer(s)Contributors
Initial releaseOctober 1, 2010; 13 years ago (2010-10-01)[1]
Stable release
3.1.3 / April 8, 2022; 2 years ago (2022-04-08)[2]
Preview release
4.0.0-beta-1 / August 14, 2023; 13 months ago (2023-08-14)[2]
Repositorygithub.com/apache/hive
Written inJava
Operating systemCross-platform
Available inSQL
TypeData warehouse
LicenseApache License 2.0
Websitehive.apache.org

Apache Hive is a data warehouse software project. It is built on top of Apache Hadoop for providing data query and analysis.[3][4] Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data.

Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API. Hive facilitates the integration of SQL-based querying languages with Hadoop, which is commonly used in data warehousing applications.[5] While initially developed by Facebook, Apache Hive is used and developed by other companies such as Netflix and the Financial Industry Regulatory Authority (FINRA).[6][7] Amazon maintains a software fork of Apache Hive included in Amazon Elastic MapReduce on Amazon Web Services.[8]

  1. ^ "Release release-1.0.0 · apache/Hive". GitHub.
  2. ^ a b "Apache Hive - Downloads". Retrieved 21 November 2022.
  3. ^ Venner, Jason (2009). Pro Hadoop. Apress. ISBN 978-1-4302-1942-2.
  4. ^ Yin Huai, Ashutosh Chauhan, Alan Gates, Gunther Hagleitner, Eric N.Hanson, Owen O'Malley, Jitendra Pandey, Yuan Yuan, Rubao Lee, and Xiaodong Zhang (2014). "Major Technical Advancements in Apache Hive". SIGMOD' 14. pp. 1235–1246. doi:10.1145/2588555.2595630.{{cite conference}}: CS1 maint: multiple names: authors list (link)
  5. ^ Programming Hive [Book].
  6. ^ Use Case Study of Hive/Hadoop
  7. ^ OSCON Data 2011, Adrian Cockcroft, "Data Flow at Netflix" on YouTube
  8. ^ Amazon Elastic MapReduce Developer Guide