Developer(s) | Freebase, then Google, now open source community |
---|---|
Initial release | November 10, 2010 |
Stable release | 3.8.2[1]
/ 8 July 2024 |
Repository | |
Written in | Java[2] |
Platform | Microsoft Windows, Linux, macOS |
Available in | English, Italian, Chinese, Japanese, French, German |
Type | |
License | BSD License |
Website | openrefine |
OpenRefine is an open-source desktop application for data cleanup and transformation to other formats, an activity commonly known as data wrangling.[3] It is similar to spreadsheet applications, and can handle spreadsheet file formats such as CSV, but it behaves more like a database.
It operates on rows of data which have cells under columns, similar to the manner in which relational database tables operate. OpenRefine projects consist of one table, whose rows can be filtered using facets that define criteria (for example, showing rows where a given column is not empty).
Unlike spreadsheets, most operations in OpenRefine are done on all visible rows, for example, the transformation of all cells in all rows under one column,[4] or the creation of a new column based on existing data. Actions performed on a dataset are stored the project and can be 'replayed' on other datasets. Formulas are not stored in cells, but are used to transform the data. Transformation is done only once.[5] Formula expressions can be written in General Refine Expression Language (GREL),[6] in Jython (i.e., Python), and in Clojure.[7]
The program operates as a local web app: it starts a web server and opens the default browser to 127.0.0.1:3333.