Apache Arrow

Apache Arrow
Developer(s)Apache Software Foundation
Initial releaseOctober 10, 2016; 8 years ago (2016-10-10)
Stable release
13.0.0[1] Edit this on Wikidata / 23 August 2023; 14 months ago (23 August 2023)
Repositoryhttps://github.com/apache/arrow
Written inC, C++, C#, Go, Java, JavaScript, MATLAB, Python, R, Ruby, Rust
TypeData format, algorithms
LicenseApache License 2.0
Websitearrow.apache.org

Apache Arrow is a language-agnostic software framework for developing data analytics applications that process columnar data. It contains a standardized column-oriented memory format that is able to represent flat and hierarchical data for efficient analytic operations on modern CPU and GPU hardware.[2][3][4][5][6] This reduces or eliminates factors that limit the feasibility of working with large sets of data, such as the cost, volatility, or physical constraints of dynamic random-access memory.[7]

  1. ^ "Apache Arrow 13.0.0 (23 August 2023)". 23 August 2023. Retrieved 21 September 2023.
  2. ^ "Apache Arrow and Distributed Compute with Kubernetes". 13 Dec 2018.
  3. ^ Baer, Tony (17 February 2016). "Apache Arrow: Lining Up The Ducks In A Row... Or Column". Seeking Alpha.
  4. ^ Baer, Tony (25 February 2019). "Apache Arrow: The little data accelerator that could". ZDNet.
  5. ^ Hall, Susan (23 February 2016). "Apache Arrow's Columnar Layouts of Data Could Accelerate Hadoop, Spark". The New Stack.
  6. ^ Yegulalp, Serdar (27 February 2016). "Apache Arrow aims to speed access to big data". InfoWorld.
  7. ^ Cite error: The named reference biorxiv was invoked but never defined (see the help page).