Skip to main content
Tesseract OCR logo

Tesseract OCR

Open-source OCR engine supporting over 100 languages

About

Tesseract is an open-source optical character recognition engine that converts images of text into editable data. It supports over 100 languages and is distributed under the Apache 2.0 license for integration into various software projects.