Apache Drill

Apache Drill
Developer(s)Apache Software Foundation
Initial releaseMay 19, 2015; 9 years ago (2015-05-19)
Stable release
1.20.3 / January 7, 2023; 21 months ago (2023-01-07)
RepositoryDrill Repository
Written inJava
Operating systemCross-platform
LicenseApache License 2.0
Websitedrill.apache.org

Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. Built chiefly by contributions from developers from MapR,[1][2] Drill is inspired by Google's Dremel system.[3] Drill is an Apache top-level project.[4] Tom Shiran is the founder of the Apache Drill Project.[5] It was designated an Apache Software Foundation top-level project in December 2016.[6]

Drill supports a variety of NoSQL databases and file systems, including Alluxio, HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. A single query can join data from multiple datastores.

Drill's datastore-aware optimizer automatically restructures a query plan to leverage the datastore's internal processing capabilities. In addition, Drill supports data locality, if Drill and the datastore are on the same nodes.[7]

  1. ^ Friedman, Ellen (21 Sep 2015). "Apache Drill: Tracking its history as an open source community". Archived from the original on 18 March 2016.
  2. ^ "Brief About The Differences between Apache Drill Vs Presto". HitechNectar. Retrieved 2023-04-13.
  3. ^ "Spark SQL vs. Apache Drill-War of the SQL-on-Hadoop Tools". ProjectPro. Retrieved 2022-11-15.
  4. ^ "The Apache Software Foundation Announces Apache Drill as a Top-Level Project". 2 December 2014. Retrieved 2014-12-02.
  5. ^ Vizard, Michael (2021-09-01). "Apache Software Foundation updates Drill for broader SQL queries". VentureBeat. Retrieved 2022-10-20.
  6. ^ "Apache Drill Eliminates ETL, Data Transformation for MapR Database". The New Stack. 2016-04-11. Retrieved 2022-11-15.
  7. ^ "Apache Drill - Schema-free SQL for Hadoop, NoSQL and Cloud Storage". drill.apache.org. Retrieved 2015-12-29.