OCR Project using Transfer Learning

📌 Overview

This repository contains an Optical Character Recognition (OCR) system built using Transfer Learning with modern deep learning techniques. The goal of this project is to extract text from images efficiently with high accuracy using pre-trained models.

Kindly download the ZIP file which contains sample input images. This will help you quickly test and run the OCR App without generating your own data.

🚀 Key Features

✅ Utilizes Transfer Learning for faster training and better accuracy
✅ Supports printed and handwritten text recognition
✅ Preprocessing pipeline for noise removal and image enhancement
✅ Model training, validation, and evaluation modules included
✅ Easy to deploy and extend

🧠 Technologies Used

Component	Technology/Library
Framework	TensorFlow / PyTorch
Transfer Model	CNN, CRNN, or Transformer
Image Processing	OpenCV, PIL
OCR Engine	Deep Learning-based Model
Language Support	English (extendable)

📁 Project Structure

A clear breakdown of the folder structure:

📂 OCR-Project/
├── 📁 artifacts/       # Stores trained OCR models (pickle files) generated from screenshot dataset
├── 📁 src/             # Core Python scripts for model building, training, and OCR prediction
│   ├── model.py        # Model architecture / transfer learning implementation
│   ├── preprocessing.py# Image preprocessing and utilities
│   └── inference.py    # Script to load model and perform prediction
├── 📁 static/          # Frontend assets
│   ├── css/            # Styling files
│   ├── js/             # JavaScript logic
│   └── images/         # UI images/assets
├── 📁 templates/       # Flask HTML templates
│   ├── base.html       # Reusable main layout
│   ├── home.html       # Home page (text extraction UI)
│   └── login.html      # User authentication page
├── app.py              # Main Flask application integrating frontend and backend
├── requirements.txt    # Dependency file to install required libraries
└── README.md           # Project documentation

🔧 Installation

# Clone the repository
$ git clone https://github.com/tusharkolekar24/OCR
$ cd OCR

# Create virtual environment (optional)
$ python -m venv venv
$ source venv/bin/activate   # Linux/Mac
$ venv\Scripts\activate     # Windows

# Install dependencies
$ pip install -r requirements.txt

📊 Dataset

You can use any OCR dataset like IAM, MNIST OCR, or a custom dataset.

🏗️ How It Works

Image Preprocessing – Resize, grayscale conversion, noise reduction
Model Training – Using transfer learning from a pre-trained network
Prediction – Model outputs recognized text from the image
Evaluation – Accuracy and CER (Character Error Rate) calculation

▶️ Training the Model

python src/train.py --epochs 20 --batch_size 32

🔍 Running OCR Inference

python src/infer.py --image_path sample.jpg

📈 Performance Metrics

✅ Accuracy: 98%
✅ Inference Speed: 2s/image
✅ Precision: 95%
✅ Recall: 92%
✅F1 Score : 93%

📦 Deployment

You can deploy this OCR model via:

Flask/FastAPI web app (app.py)
Docker Container

🔮 Future Enhancements

Add support for multiple languages
Improve accuracy using Attention-based Transformers
Deploy on cloud platforms (AWS, Azure)

🤝 Contributing

Contributions are welcome! Feel free to submit pull requests or open issues.

📄 License

This project is licensed under the MIT License. See the LICENSE file for more details.

👨‍💻 Author

Developed by Tushar Kolekar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Project using Transfer Learning

📌 Overview

🚀 Key Features

🧠 Technologies Used

📁 Project Structure

🔧 Installation

📊 Dataset

🏗️ How It Works

▶️ Training the Model

🔍 Running OCR Inference

📈 Performance Metrics

📦 Deployment

🔮 Future Enhancements

🤝 Contributing

📄 License

👨‍💻 Author

⭐ If you find this project useful, consider giving it a star on GitHub!

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
artifacts		artifacts
src		src
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirement.txt		requirement.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

OCR Project using Transfer Learning

📌 Overview

🚀 Key Features

🧠 Technologies Used

📁 Project Structure

🔧 Installation

📊 Dataset

🏗️ How It Works

▶️ Training the Model

🔍 Running OCR Inference

📈 Performance Metrics

📦 Deployment

🔮 Future Enhancements

🤝 Contributing

📄 License

👨‍💻 Author

⭐ If you find this project useful, consider giving it a star on GitHub!

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages