This article contains promotional content. (July 2024) |
Original author(s) | Ray Smith, Hewlett-Packard[1] |
---|---|
Developer(s) | Google and others |
Stable release | 5.5.0[2]
/ 10 November 2024 |
Repository | |
Written in | C and C++ |
Operating system | Linux, Windows, and macOS |
Available in | Interface: English Recognition: Afrikaans, Albanian, Arabic, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Catalan, Czech, Cherokee, Croatian, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, German, Greek, Hindi, Hebrew, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Maltese, Malay, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Vietnamese [3] (more can be added using included training files)[4] |
Type | Optical character recognition |
License | Apache License 2.0 |
Website | github |
Tesseract is an optical character recognition engine for various operating systems.[5] It is free software, released under the Apache License.[1][6][7] Originally developed by Hewlett-Packard as proprietary software in the 1980s, it was released as open source in 2005 and development was sponsored by Google in 2006.[8]
In 2006, Tesseract was considered one of the most accurate open-source OCR engines available.[7][9]