This is a simple OCR (Optical Character Recognition) application built using Streamlit and Tesseract. The application allows users to upload images or PDF files and extracts text from them.
Features āØ
Upload and process images (PNG, JPG, JPEG).
Upload and process PDF files.
Display the uploaded image or PDF.
Extract and display text from the uploaded files using Tesseract OCR.
User-friendly interface with Streamlit.
Fast and accurate text extraction.
Cross-platform support (Windows, macOS, Linux).
About Tesseract š®
Tesseract is one of the most accurate open-source OCR engines available today. Developed by HP and currently maintained by Google, Tesseract has support for multiple languages and can recognize over 100 different languages. It uses machine learning techniques, including LSTM (Long Short-Term Memory) neural networks, for text recognition.
Key Features of Tesseract:
Open Source: Tesseract is available under the Apache License 2.0, making it free to use and modify.
Multi-Language Support: Tesseract can recognize and process text in over 100 languages.
Highly Accurate: Tesseract uses machine learning algorithms to achieve high accuracy in text recognition.
Versatile: It can handle a variety of input formats, including images, PDFs, and even multi-page documents.
How OCR Works š ļø
Optical Character Recognition (OCR) is a technology that converts different types of documents, such as scanned paper documents, PDFs, or images captured by a digital camera, into editable and searchable data. OCR processes the text in an image through the following steps:
Image Preprocessing: Enhances the image quality and prepares it for recognition. This step may include noise reduction, binarization, and deskewing.
Text Detection: Identifies the text areas in the image.
Character Recognition: Converts the detected text areas into machine-encoded text using pattern recognition algorithms.
Post-Processing: Corrects errors and improves the accuracy of the recognized text.
OCR Process Flow š
Prerequisites
Before you begin, ensure you have met the following requirements:
Python 3.6 or higher
pip package installer
Installation š
Clone the repository:
git clone https://github.com/SnehaDeshmukh28/OCR-Vision.git
cd ocr-vision