Developer(s) | Apache Software Foundation |
---|---|
Stable release | |
Repository | Tika Repository |
Written in | Java |
Operating system | Cross-platform |
Type | Search and index API |
License | Apache License 2.0 |
Website | tika |
Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation.[1] It detects and extracts metadata and text from over a thousand different file types, and as well as providing a Java library, has server and command-line editions suitable for use from other programming languages.