Optical Character Recognition (OCR) in Python

Current Status

Not Enrolled

Price

Free

Get Started

What you will learn

Use Tesseract, EAST and EasyOCR tools for text recognition in images and videos
Understand the differences between OCR in controlled and natural environments
Apply image pre-processing techniques to improve image quality, such as: thresholding, inversion, resizing, morphological operations and noise reduction
Use EAST architecture and EasyOCR library for better performance in natural scenes
Train an OCR from scratch using Deep Learning and Convolutional Neural Networks
Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
License plate reading

Requirements

Programming logic
Basic Python programming

Description

Within the area of Computer Vision is the sub-area of Optical Character Recognition (OCR), which aims to transform images into texts. OCR can be described as converting images containing typed, handwritten or printed text into characters that a machine can understand. It is possible to convert scanned or photographed documents into texts that can be edited in any tool, such as the Microsoft Word. A common application is automatic form reading, in which you can send a photo of your credit card or your driver’s license, and the system can read all your data without the need to type them manually. A self-driving car can use OCR to read traffic signs and a parking lot can guarantee access by reading the license plate of the cars!

To take you to this area, in this course you will learn in practice how to use OCR libraries to recognize text in images and videos, all the code implemented step by step using the Python programming language! We are going to use Google Colab, so you do not have to worry about installing libraries on your machine, as everything will be developed online using Google’s GPUs! You will also learn how to build your own OCR from scratch using Deep Learning and Convolutional Neural Networks! Below you can check the main topics of the course:

Recognition of texts in images and videos using Tesseract, EasyOCR and EAST
Search for specific terms in images using regular expressions
Techniques for improving image quality, such as: thresholding, color inversion, grayscale, resizing, noise removal, morphological operations and perspective transformation
EAST architecture and EasyOCR library for better performance in natural scenes
Training an OCR from scratch using TensorFlow and modern Deep Learning techniques, such as Convolutional Neural Networks
Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
License plate reading

These are just some of the main topics! By the end of the course, you will know everything you need to create your own text recognition projects using OCR!

Who this course is for

Anyone interested in OCR (Optical Character Recognition)
Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision
Data Scientists who want to increase their knowledge in Computer Vision
Professionals interested in developing professional optical character recognition solutions
People interested in creating their own custom OCR

Course Content

Expand All

Introduction 3 Topics

Expand

Lesson Content

0% Complete 0/3 Steps

Course content

Introduction to OCR

Course materials

OCR with Tesseract 10 Topics

Expand

Lesson Content

0% Complete 0/10 Steps

Introduction to Tesseract

Preparing the environment

First text recognition

Support for other languages

Page segmentation mode (PSM)

Selection of texts 1

Selection of texts 2

Selection of texts 3

Search using regular expressions

Detections in natural scenarios

Techniques for image preprocessing 16 Topics

Expand

Lesson Content

0% Complete 0/16 Steps

Grayscale

Thresholding – intuition

Simple thresholding

Thresholding with Otsu method

Adaptive thresholding

Gaussian adaptative thresholding

Color inversion

Resizing – intuition

Resizing – implementation

Morphological operations – intuition

Morphological operations – implementation

Noise removal – intuition

Noise removal – implementation

Text recognition with OCR

HOMEWORK

Homework solution

OCR with EAST for natural scenes 6 Topics

Expand

Lesson Content

0% Complete 0/6 Steps

EAST – introduction

Preprocessing the image

Loading the neural network

Decoding the image 1

Decoding the image 2

Text recognition

Training a custom OCR 18 Topics

Expand

Lesson Content

0% Complete 0/18 Steps

Importing the libraries

MNIST 0-9 dataset

Kaggle A-Z dataset

Joining the datasets

Preprocessing the data

Building the neural network

Training the neural network

Evaluating the neural network

Saving the neural network

Testing with images

Preparing the environment

Preprocessing the image

Contour detection

Processing the detections 1

Processing the detections 2

Character recognition

Problems with 0 and O, 1 and l, 5 and S

Problems with undetected texts

Natural scenarios with EasyOCR 5 Topics

Expand

Lesson Content

0% Complete 0/5 Steps

Preparing the environments

Text recognition

Writing the results on the image

Other languages – French and Chinese

Text recognition (background)

OCR in videos 5 Topics

Expand

Lesson Content

0% Complete 0/5 Steps

Preparing the environment

Video settings

Processing the video

OCR with EAST and Tesseract

OCR with EasyOCR

Project 1: Searching for specific terms 7 Topics

Expand

Lesson Content

0% Complete 0/7 Steps

Preparing the environment

Text recognition

Searching for texts

Word cloud

Named entity recognition

Search for texts in images

Saving the results

Project 2: Scanner + OCR 6 Topics

Expand

Lesson Content

0% Complete 0/6 Steps

Preparing the environment

Contour detection

Perspective transformation

OCR with Tesseract

Improving image quality

Putting all together

Project 3: License plate reading 3 Topics

Expand

Lesson Content

0% Complete 0/3 Steps

Preprocessing the image

Text recognition

Improving image quality

Extra content 1: Artificial neural networks 8 Topics

Expand

Lesson Content

0% Complete 0/8 Steps

Biological fundamentals

Single layer perceptron

Multilayer perceptron – sum and activation functions

Multilayer perceptron – error calculation

Gradient descent

Delta parameter

Updating weights with backpropagation

Bias, error, stochastic gradient descent, and more parameters

Extra content 2: Convutional neural networks 5 Topics

Expand

Lesson Content

0% Complete 0/5 Steps

Introduction to convolutional neural networks

Convolutional operator

Pooling

Flattening

Dense neural network

Final remarks 1 Topic

Expand

Lesson Content

0% Complete 0/1 Steps

Final remarks

Ratings and Reviews

4.7

Avg. Rating

49 Ratings

What's your experience? We'd love to know!

Review posted on Udemy

Posted 3 months ago

by Mohammad Zaghy Zalayetha Sofjan

Great course, hope this can help me during my thesis

Review posted on Udemy

Posted 4 months ago

by Litzy Huel

Amazing educational opportunity! I can now use Python with confidence to recognize optical characters using OCR techniques.

Review posted on Udemy

Posted 4 months ago

by Youssef Rachad

Inreible

Review posted on Udemy

Posted 4 months ago

by Kaleb Lowe

I learned practical OCR techniques in Python quickly, making it easy to process images and texts.

Review posted on Udemy

Posted 4 months ago

by Priscilla Kunde

This course transformed my understanding of OCR, teaching me practical Python skills for text extraction.

Review posted on Udemy

Posted 4 months ago

by Diego Aponte

Fantastic course! It made OCR implementation in Python simple and accessible for beginners.

Review posted on Udemy

Posted 4 months ago

by Rowland Tremblay

Amazing course! It helped me master OCR in Python and improved my text extraction skills.

Review posted on Udemy

Posted 5 months ago

by Puke Face Blam Blam

This content is informative but needs significant improvement in delivery: Strengths: Quality technical information Strong sections with good flow Shows teaching potential Areas for Improvement: Repetitive explanations of basic concepts (e.g., vectors) Overly basic math/Python coverage for a Tesseract/OCR course Inconsistent pacing; sometimes robotic delivery Recommendations: Condense content by 75% Focus on OCR/Tesseract/etc-specific material Provide external links for prerequisites (Khan Academy) instead of making everyone sit through you repeating the same thing 3 different ways. Let students review fundamentals independently. This topic is not meant for children. Your audience is probably people with developer experience, respect our time.

Review posted on Udemy

Posted 6 months ago

by Gaylord Enrique Carrillo Caballero

Just starting the course

Review posted on Udemy

Posted 6 months ago

by Laura Thomasen

Such a great course. The level was just right and I could easily apply Jones' teachings to my own projects!

Show more reviews

What's your experience? We'd love to know!