How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

The most important Python script I ever wrote

Detect Text in Images with Python - pytesseract vs. easyocr vs keras_ocr

How Employees Are Coffee Badging To Avoid Full Days At The Office

The Most Illegal Baseball Bat Ever Created

Every Home Alone Is Worse Than The Last

How to use Tesseract OCR in a Python script (pytesseract)

JayMartMedia

Просмотров 45 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 янв 2025

Комментарии • 43

@YorukaValorant 10 месяцев назад ⁺¹⁶
Thank you. I was expecting a bad video because of the view count but this Got right to the point.
@JayMartMedia 10 месяцев назад ⁺⁴
Glad you found the video helpful! Thanks for commenting!
Most of my videos are pretty focused so they get views over time as people search for a topic, as opposed to trendy influencer style videos that appeal to lots of people 😁
@scottnelson5270 4 месяца назад ⁺¹
@@JayMartMedia as it should be, cheers for Jay! you'll win for this over the long run.
@markomarjanovic8348 Месяц назад ⁺²
Shortest most useful video, no BS spot on!
@JayMartMedia Месяц назад
That's what I love to hear! Glad you found it helpful!
@Matin_SenPai 2 месяца назад ⁺³
I usually don't comment anything, but Thanks for short and useful video.
@JayMartMedia 2 месяца назад ⁺¹
Glad you found it helpful! Thanks for the comment!
@Mark_Morad 10 месяцев назад ⁺²
How are you, do you know how can I include the tesseract OCR executable in my python executable file? That way when I distribute my executable other users can use the OCR without installing the machine on their device.
@JayMartMedia 10 месяцев назад ⁺¹
I'm not aware of a way to include the tesseract executable in the python script.
You may be able to create a zip file with the python and tesseract, but this would likely depend on the users each having python installed, and using the same OS (same OS that tesseract is built for).
Alternatively you could check out tesseract.js which runs in the browser, or you could create a Python web app so that users submit images to the website UI using their browser, and then the image file would be processed on the server via tesseract.
@AkhilNagori-v4u 3 месяца назад ⁺¹
Hi, do you know how it would be possible to do live detection with my webcam?
@JayMartMedia 3 месяца назад
Unfortunately I am not aware of a way to do this with tesseract
@AkhilNagori-v4u 3 месяца назад
@@JayMartMedia Oh okay, no problem
@stevetedom7398 6 месяцев назад
Hello, please I would like to know how to improve the precision of tesseract without labeling. I am currently working on an invoice ocerization project, and the problem I encounter is that I have a huge variety in the format of my invoices, I would say nearly 4000 to 5000 different formats, and the problem I encounter with my OCR (I use tesseract) is that it extracts the raw text without taking into account that it is an invoice (the zones etc...), it retrieves the information line by line, I cannot label it given the number of invoice formats, what do you offer me for this? Can bert or spacy be useful in this case?
@NeeharikaJha 4 месяца назад
Hello, I need guidance on this. Any leads on how to proceed?
@minhhu-j1r 24 дня назад
hello sir, i have 100 images, in every image, it's have a code include 6 number of code, i want to extract these 100 images into text, can i do it quickly
@JackDecker-i8k 2 месяца назад
Do you know how to set environment variables in visual studio code similarly to how you did it in windows command prompt?
@derekegenti 6 месяцев назад
How can I edit this script to extract text from scanned documents? Thanks.
@hansimuli 3 месяца назад ⁺¹
Thanks. Great video. ❤ Subscribed
@JayMartMedia 3 месяца назад
Glad you found it helpful!
@YuvrajWithAGuitar 7 месяцев назад
I have some 2000 pdf files which are invoices. I want invoice number, date and total amount from them... Many invoices are of different format . What the nest way to do it?
@banks927 Месяц назад ⁺¹
Hello! Software engineer here. You'll want to start by making sure all your invoices look/translate the same. A lot of people want to couple Tesseract with generative AI in the same way you're looking to do but the problem with that request is mainly that the context IN isn't always the same. If your invoices are pretty much identical in format, then you're one step closer. Assuming they are, you'll want to isolate that data with a little string manipulation so that each time you run the script, you'll essentially only getting the data you need. From there, you'll probably want to use JSON and either write to a database or to an Excel spreadsheet so you can analyze your data now.
@marceloortiz42 7 месяцев назад ⁺²
Nice video! Thanks
Is there a GUI that you recommend to use in windows?
@JayMartMedia 7 месяцев назад
Glad you found it helpful! I haven't used any GUIs with Tesseract, with the exception of this site which runs Tesseract in the browser: tesseract.projectnaptha.com/
Vid: ruclips.net/video/tFW0ExG4QZ4/видео.html
@Ueberkombo 8 месяцев назад ⁺¹
00:14 Only if you use it for English, Russian or Chinese Text everyone!
@rogue771 Месяц назад ⁺¹
Thank you 😎
@JayMartMedia Месяц назад
Glad you found it helpful!
@derekegenti 6 месяцев назад
Thanks for this. Lifesaver.
@sneakyblinder982 5 месяцев назад ⁺¹
Tysm for this video!!
@Muhammad_Aftab_ahmad_97 Месяц назад ⁺¹
Thank you so much
@JayMartMedia Месяц назад
I'm glad you found it helpful!
@SP-kq4qb 9 месяцев назад ⁺³
thanks man :)
@Rafael_Perez21 12 дней назад ⁺¹
interesting thank you
@ayushpathania583 21 день назад ⁺¹
bro trying to beat that random indian guy
@archhangell 7 месяцев назад ⁺¹
Cheers!
@omar.alnounou 10 месяцев назад ⁺²
ty
@JayMartMedia 10 месяцев назад ⁺¹
yw ♥️
@HairoHeria 7 месяцев назад ⁺¹
thank you
@Mollory16 7 месяцев назад
Can you help me on discord??
@Mollory16 7 месяцев назад ⁺²
I do not understand ! you made a video very quickly. I can't understand
@adejobiolajide8011 4 месяца назад
You need to slow down when explaining and show steps involved pls
@foodiee29 26 дней назад ⁺¹
thank you so much
@JayMartMedia 26 дней назад
I'm glad it was helpful for you!

Следующие

Автовоспроизведение

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

How to Preprocess Images for Text OCR in Python (OCR in Python Tutorials 02.02)

The most important Python script I ever wrote

The most important Python script I ever wrote

Detect Text in Images with Python - pytesseract vs. easyocr vs keras_ocr

Detect Text in Images with Python - pytesseract vs. easyocr vs keras_ocr

How Employees Are Coffee Badging To Avoid Full Days At The Office

How Employees Are Coffee Badging To Avoid Full Days At The Office

The Most Illegal Baseball Bat Ever Created

The Most Illegal Baseball Bat Ever Created

Every Home Alone Is Worse Than The Last

Every Home Alone Is Worse Than The Last

Islam Makhachev DENIES Arman Tsarukyan as toughest opponent👀 'I'll make everyone shut up' | ESPN MMA

Islam Makhachev DENIES Arman Tsarukyan as toughest opponent👀 'I'll make everyone shut up' | ESPN MMA

How to Install the Libraries (OCR in Python Tutorials 01.02)

How to Install the Libraries (OCR in Python Tutorials 01.02)

OCR TensorFlow and Python (95.55% accuracy) | Automatic scoring of handwritten test papers

OCR TensorFlow and Python (95.55% accuracy) | Automatic scoring of handwritten test papers

Tmux has forever changed the way I write code.

Tmux has forever changed the way I write code.

Create Stunning Python GUIs in 10 Minutes With Drag & Drop

Create Stunning Python GUIs in 10 Minutes With Drag & Drop

*Next-door 10x Software Engineer* [FULL]

*Next-door 10x Software Engineer* [FULL]

5 Python Libraries You Should Know in 2025!

5 Python Libraries You Should Know in 2025!

Regular Expressions (Regex) Tutorial: How to Match Any Pattern of Text

Regular Expressions (Regex) Tutorial: How to Match Any Pattern of Text

0 to LSP : Neovim RC From Scratch

0 to LSP : Neovim RC From Scratch

[ Image To Text ] Train new Font with Tesseract in Google Colab (5x Faster)

[ Image To Text ] Train new Font with Tesseract in Google Colab (5x Faster)

Comedy Remix😂Thanks 50M subscribers🙏

Comedy Remix😂Thanks 50M subscribers🙏

Обмен сквишами 😱🧸 мама удивила #виолави #шортс #обзор #сквиши #табасквиш #топ

Обмен сквишами 😱🧸 мама удивила #виолави #шортс #обзор #сквиши #табасквиш #топ

黑天使预知未来#short #angel #clown

黑天使预知未来#short #angel #clown

How many stick you counted? 😮🦑 #squidgame

How many stick you counted? 😮🦑 #squidgame

تجربة صيد الكنوز في الماء بأكبر مغناطيس ـ وهذا الذي وجته 🔫😳

تجربة صيد الكنوز في الماء بأكبر مغناطيس ـ وهذا الذي وجته 🔫😳

КТО ЛУЧШЕ ПЕРЕКРИЧАЛ?😂

КТО ЛУЧШЕ ПЕРЕКРИЧАЛ?😂

Противодействие шокеру

Противодействие шокеру

КАТАСТРОФЫ НА АТТРАКЦИОНАХ, О КОТОРЫХ ВЫ НЕ ЗНАЛИ (21 инцидент)

КАТАСТРОФЫ НА АТТРАКЦИОНАХ, О КОТОРЫХ ВЫ НЕ ЗНАЛИ (21 инцидент)