How to use Bounding Boxes with OpenCV (OCR in Python Tutorials 03.02)

Поделиться
HTML-код
  • Опубликовано: 22 май 2024
  • If you enjoy this video, please subscribe.
    ✅Be my Patron: / wjbmattingly
    ✅PayPal: www.paypal.com/cgi-bin/webscr...
    If there's a specific video you would like to see or a tutorial series, let me know in the comments and I will try and make it.
    If you liked this video, check out www.PythonHumanities.com, where I have Coding Exercises, Lessons, on-site Python shells where you can experiment with code, and a text version of the material discussed here.
    You can follow me at:
    / wjb_mattingly

Комментарии • 31

  • @haniihsanuddin9585
    @haniihsanuddin9585 8 месяцев назад +7

    1. Blur image (to identify overall structure, and not focusing on text itself)
    2. Create threshold (and kernal) to separate text block
    3. Perform dilation (~white thickening)
    4. Perform contour (finding boundaries)
    5. Perform loop to only draw boundrary box of specific size (to exclude small bbox)

  • @BrandonJF4
    @BrandonJF4 Год назад +2

    Thank you so much, this really helped me make progress on a project!

  • @letslearn2674
    @letslearn2674 Год назад +2

    This is the one I have been looking for. Thank you so much!

  • @DilipDas-ys5ph
    @DilipDas-ys5ph Год назад

    Great Thanks !!

  • @steffenhalama5558
    @steffenhalama5558 2 года назад +1

    Very nice video helped me a lot.

  • @Atharva_S9
    @Atharva_S9 2 года назад +8

    why can't you provide the code for this

  • @vildanhuseynov6492
    @vildanhuseynov6492 2 года назад +1

    good job man!!!

  • @thenotoriousrkf3012
    @thenotoriousrkf3012 2 года назад +15

    I guess, there is an error in your code. From minute 15:45 on, you define the ROI. However, instead of x+h, w would have to be added to x. Therefore, roi should be defined as: roi = image[y:y+h, x:x+w]
    Since this typo also appears on your GitHub you should change it there as well.
    Kind regards!

  • @vildanhuseynov6492
    @vildanhuseynov6492 2 года назад

    dude, do you have experience in aligned text?

  • @farahjabeen7707
    @farahjabeen7707 2 года назад

    @Python tutorials for digital humanities can you explain how to make bounding box using pixel location?

  • @joshuasmitherman1712
    @joshuasmitherman1712 Год назад +2

    It's not finding the sections for me. It captures the whole document as a section. Any suggestions?

  • @conorforster8853
    @conorforster8853 Год назад +3

    Hi, this tutorial series has been the best thing slince sliced bread, and honestly dont know where id be with out it
    however i am stumped, im trying to read pdfs into jpeg format, the problem arises when i have tables and images within these files that i would like to either skip or try to read into file with out wreacking structure (obviouslty not images within the images). idealy i would like this process to be automated as the final program is not being used by myself but by others less aquainted with technolagy. As of now there is no documentation i can find that helps facilitate this.
    i know its a long shot but honestly ive hit a wall and if by some chance anyone can help and guidence would or advice would mean the world

  • @breezyfeels1802
    @breezyfeels1802 2 года назад

    Hi, can you please tell how I can have bounding boxes around each question in any question paper? I have tried a lot, but unable to get it. I would be really glad if you could help me..Thanks!

  • @ridafatima1739
    @ridafatima1739 Месяц назад

    will this work on colored images as well , if not, what changes should I make for the colored images?

  • @virendartripathi4645
    @virendartripathi4645 10 месяцев назад

    I can't download the images from the course can you help me so that I can practice this

  • @tiennguyentran9358
    @tiennguyentran9358 2 года назад

    *i Love u so much tks u*

  • @kltr007
    @kltr007 Год назад +3

    Short question: in Box [15] it reads "else cents[1]". Is this a typo and should be "else cnts[1]" or did I miss something?
    But great content! Keep going!

    • @pcb5135
      @pcb5135 9 месяцев назад

      i assume its typo

  • @mateussaar4071
    @mateussaar4071 2 месяца назад

    MAGIC

  • @jumbertparrenas3218
    @jumbertparrenas3218 Год назад

    Can I ask this is capable to application or only for desktop..? Im asking because this is same on my title thesis.

    • @nathantafelsky7089
      @nathantafelsky7089 Год назад

      It could be used in the source code of an application, or used on different operating systems. Imports and syntax would vary by language and implementation.

  • @khushibaghel220
    @khushibaghel220 4 месяца назад

    I am trying to run this in google colab but getting an error: TesseractNotFoundError: C:\Program Files\Tesseract-OCR is not installed or it's not in your PATH. See README file for more information. How to resolve this? I have already added pytesseract in my env variables

  • @anjuathouse5370
    @anjuathouse5370 2 года назад +1

    could you please make a video on handwritten scanned document image line segmentation

    • @python-programming
      @python-programming  2 года назад +2

      Sure! I actually wrote that code a year or so ago. I will try and dig it up and make a video on it.

    • @anjuathouse5370
      @anjuathouse5370 2 года назад

      @@python-programming Thank you so much..

  • @ppmanguin
    @ppmanguin Год назад

    at 11:16 I have an error, can you tell me how to fix it? Thank you!
    error: OpenCV(4.7.0) :-1: error: (-5:Bad argument) in function 'boundingRect'
    > Overload resolution failed:
    > - array is not a numerical tuple
    > - Expected Ptr for argument 'array'

    • @ppmanguin
      @ppmanguin Год назад

      fixed by adding a variable, because findContours creates 2 outputs.
      cnts, new_variable = cv2.findContours(dilate, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)

  • @ROKKor-hs8tg
    @ROKKor-hs8tg 7 месяцев назад

    كيف يمكن عرض اشكال مطبوعة ف صورة ممسوحة ضوئيا الى ملفdocx