Current Status
Not Enrolled
Price
Free
Get Started

What you will learn

  • Use Tesseract, EAST and EasyOCR tools for text recognition in images and videos
  • Understand the differences between OCR in controlled and natural environments
  • Apply image pre-processing techniques to improve image quality, such as: thresholding, inversion, resizing, morphological operations and noise reduction
  • Use EAST architecture and EasyOCR library for better performance in natural scenes
  • Train an OCR from scratch using Deep Learning and Convolutional Neural Networks
  • Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
  • License plate reading

Requirements

  • Programming logic
  • Basic Python programming

Description

Within the area of Computer Vision is the sub-area of Optical Character Recognition (OCR), which aims to transform images into texts. OCR can be described as converting images containing typed, handwritten or printed text into characters that a machine can understand. It is possible to convert scanned or photographed documents into texts that can be edited in any tool, such as the Microsoft Word. A common application is automatic form reading, in which you can send a photo of your credit card or your driver’s license, and the system can read all your data without the need to type them manually. A self-driving car can use OCR to read traffic signs and a parking lot can guarantee access by reading the license plate of the cars!

To take you to this area, in this course you will learn in practice how to use OCR libraries to recognize text in images and videos, all the code implemented step by step using the Python programming language! We are going to use Google Colab, so you do not have to worry about installing libraries on your machine, as everything will be developed online using Google’s GPUs! You will also learn how to build your own OCR from scratch using Deep Learning and Convolutional Neural Networks! Below you can check the main topics of the course:

  • Recognition of texts in images and videos using Tesseract, EasyOCR and EAST
  • Search for specific terms in images using regular expressions
  • Techniques for improving image quality, such as: thresholding, color inversion, grayscale, resizing, noise removal, morphological operations and perspective transformation
  • EAST architecture and EasyOCR library for better performance in natural scenes
  • Training an OCR from scratch using TensorFlow and modern Deep Learning techniques, such as Convolutional Neural Networks
  • Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
  • License plate reading

These are just some of the main topics! By the end of the course, you will know everything you need to create your own text recognition projects using OCR!

Who this course is for

  • Anyone interested in OCR (Optical Character Recognition)
  • Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision
  • Data Scientists who want to increase their knowledge in Computer Vision
  • Professionals interested in developing professional optical character recognition solutions
  • People interested in creating their own custom OCR

Course Content

Expand All
Lesson Content
0% Complete 0/1 Steps

Ratings and Reviews

4.7
Avg. Rating
48 Ratings
5
37
4
9
3
2
2
0
1
0
What's your experience? We'd love to know!
Review posted on Udemy
Posted 2 months ago
by Litzy Huel

Amazing educational opportunity! I can now use Python with confidence to recognize optical characters using OCR techniques.

×
Preview Image
Review posted on Udemy
Posted 2 months ago
by Youssef Rachad

Inreible

×
Preview Image
Review posted on Udemy
Posted 2 months ago
by Kaleb Lowe

I learned practical OCR techniques in Python quickly, making it easy to process images and texts.

×
Preview Image
Review posted on Udemy
Posted 2 months ago
by Priscilla Kunde

This course transformed my understanding of OCR, teaching me practical Python skills for text extraction.

×
Preview Image
Review posted on Udemy
Posted 2 months ago
by Diego Aponte

Fantastic course! It made OCR implementation in Python simple and accessible for beginners.

×
Preview Image
Review posted on Udemy
Posted 2 months ago
by Rowland Tremblay

Amazing course! It helped me master OCR in Python and improved my text extraction skills.

×
Preview Image
Review posted on Udemy
Posted 3 months ago
by Puke Face Blam Blam

This content is informative but needs significant improvement in delivery: Strengths: Quality technical information Strong sections with good flow Shows teaching potential Areas for Improvement: Repetitive explanations of basic concepts (e.g., vectors) Overly basic math/Python coverage for a Tesseract/OCR course Inconsistent pacing; sometimes robotic delivery Recommendations: Condense content by 75% Focus on OCR/Tesseract/etc-specific material Provide external links for prerequisites (Khan Academy) instead of making everyone sit through you repeating the same thing 3 different ways. Let students review fundamentals independently. This topic is not meant for children. Your audience is probably people with developer experience, respect our time.

×
Preview Image
Review posted on Udemy
Posted 3 months ago
by Gaylord Enrique Carrillo Caballero

Just starting the course

×
Preview Image
Review posted on Udemy
Posted 4 months ago
by Laura Thomasen

Such a great course. The level was just right and I could easily apply Jones' teachings to my own projects!

×
Preview Image
Review posted on Udemy
Posted 4 months ago
by Önder Can Köse

so far soo good

×
Preview Image
Show more reviews
What's your experience? We'd love to know!
Scroll to Top