Apache Tika

Tika
Developer(s)Apache Software Foundation
Stable release
2.9.1 Edit this on Wikidata / 20 October 2023; 12 months ago (20 October 2023)
RepositoryTika Repository
Written inJava
Operating systemCross-platform
TypeSearch and index API
LicenseApache License 2.0
Websitetika.apache.org

Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation.[1] It detects and extracts metadata and text from over a thousand different file types, and as well as providing a Java library, has server and command-line editions suitable for use from other programming languages.

  1. ^ "Apache Tika". Retrieved 2016-04-15.