Extract Key Information from Documents using LayoutLM | LayoutLM Fine-tuning | Deep Learning

Karndeep Singh

Просмотров 57 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 янв 2025

Комментарии •

@HOUDABEKKOURIALAMI 9 месяцев назад
Hello, I would like to know if we prepar data with another tool other than UBIAI. I couldn't sign in to the platform.
@tadakantisreeja6133 Год назад ⁺²
Hello Sir, it was an excellent tutorial but I am unable to install unilm/layoutlm as collab has python version 3.11 by default. I downgraded python version to 3.7 and tried running again now transformers are not installed properly. Please help me.
@RavinderJ-i4i Год назад ⁺¹
there is an issue with downloading decencies when we install layoutlm from unilm
@italykiduniya 2 года назад ⁺³
Is there any way we can get the output of LayoutLM model as key value pairs where the question is key and the answer is the value of it.
@karndeepsingh 2 года назад ⁺¹
Yes, you can use predicted BIOES tag and right a simple code to club these predicted BIOES tags into a single word according to the tags specified
@nurielwainstein9022 2 года назад
hi, you made it?
can you give any directions to achieve this pls?
@gauravlotey4263 Год назад
@italykiduniya I suggest since you already know your keys why don't to through code just create your own hash data. For example in python
Output_dict = {}
For key in list_of_keys:
Val1 = LayoutLM(doc , query)[index_of_value]
Output_dict[Key] = val
Please use this as an inspiration and not the exact code
@iwaqas 2 года назад ⁺¹
I want to use my own dataset already annotated using IOB format json file. Possible to use it as is? Or do I need to translate it to the json format similar to the FUNSD dataset? If yes, is there any (free) converter/translator available to transform the annotation format?
@SushantiBhagwat 2 года назад ⁺¹
Hi @Karndeep Singh...I am using my own dataset and Let us say I want to annotate 3 labels "DATE" , "INVOICE NUMBER" and "OTHER".. So will it possible to automatically annotate rest of all tokens with "OTHER" label Using Layoutlmv3 ?
@henkhbit5748 2 года назад ⁺⁴
Another nice tutorial about layoutlm. Btw: the annotation process is quite a tedious task... Could u also make a tutorial how to cluster relevant keywords in a document? Thanks👍
@karndeepsingh 2 года назад ⁺¹
Sure
@joseph0x45 2 года назад
@@karndeepsingh Great, I'm also looking ahead for that tutorial. Subsribed😁
@muhammedfaisalpj4810 2 года назад ⁺¹
I have documents which is in pdf format. All documents are different. I want to extract some information from the document. What should i do? Each pdf are contains more pages 5-10.
@karndeepsingh 2 года назад ⁺²
You need to pass each page or first find the relevant page and then pass it to the trained LayoutLM model
@muhammedfaisalpj4810 2 года назад
@@karndeepsinghThank you for the reply. You know it has any other solution like named entity recognition?
@karndeepsingh 2 года назад ⁺¹
@@muhammedfaisalpj4810 Yes you can use NER to extract relevant information from the given context. Make sure the quality of PDF or image is good before OCR otherwise it may distort the results of NER.
@j.k.priyadharshini9753 2 года назад
@@karndeepsinghhi, i want to extract some information from pdf and then convert to excel. I want to train a model to recognise key value pairs in my documents but how?
@anagha1615 Год назад
Hey @muhammedfaisalpj4810 did u worked on it? Can we connect?
@DeepakSingh-ji3zo 2 года назад
@Karndeep Singh Will we be able to tag data in FUNSD format using Label Studio?( UBIAI is an online tool, so we can not send our data outside the network)
@thepresistence5935 2 года назад
Thankyou so much for wonderful explanation!
@7_bairapraveen928 Год назад ⁺⁴
The only doubt is how can we get the output in key-value pairs
@lioneluran 2 года назад ⁺⁵
Great tutorial! How do you know which "answer" belongs to which "question"? Is that somehow part of the model output or does it merely "tag" each token? Thanks!
@narayanbandodker5482 4 месяца назад
I think the way you would do it is group them by line/y-position, and group together question/answer pairs at the same y-levels (or close enough). It doesn't really know which one is which
@AdandKidda 2 года назад ⁺¹
hi karan ,
how to extract the key-value in json at the end .
@karndeepsingh 2 года назад ⁺¹
LayoutLM outputs in BIOES format for each word. To make into a certain key, value pair just club BIOES tags for each word to make it final value. Just need to write few lines of code to accumulate the output of LayoutLM into the required formate
@nurielwainstein9022 2 года назад ⁺³
hi, you made it?
can you give any directions to achieve this pls?
@goelnikhils 2 года назад
Amazing Video. Thanks a lot Kamaldeep
@vcjayan8206 Год назад
Hi Karndeep, for a custom invoices parameters extraction while annotating the label, shoul i do it as question answer style ? Like invoice number question, invoice number answer? Or just annotate the answer part only?
@karthikeshk6433 7 месяцев назад
Hello,
Thank you for the precise tutorial. I'm facing issue while running 1st cell in google colab, below is the error:
Collecting lxml==4.5.1 (from layoutlm==0.0)
Downloading lxml-4.5.1.tar.gz (4.5 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 4.5/4.5 MB 16.0 MB/s eta 0:00:00
error: subprocess-exited-with-error

× python setup.py egg_info did not run successfully.
│ exit code: 1
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.
Preparing metadata (setup.py) ... error
error: metadata-generation-failed
× Encountered error while generating package metadata.
╰─> See above for output.
note: This is an issue with the package mentioned above, not pip.
hint: See above for details.
@karthikeshk6433 7 месяцев назад
Can you help me to solve the above issue Sir? Thank you.
@parandhamuduchakali333 Год назад
great tutorial , shall make small video from custom layoutml model please
@ericpeter9343 2 года назад ⁺¹
how can annotate data and train this model on my own form data
@karndeepsingh 2 года назад ⁺²
I have used UBIAI tool on 14 days trial to annotate the dataset. Although, you can try with label studio ( I personally haven’t tried)
@ericpeter9343 2 года назад ⁺¹
@@karndeepsingh thank you
@AdandKidda 2 года назад
@@karndeepsingh ubiai is not allowing to annotate jpg, on free trial. any other option plz.
@gopipatwa4593 2 года назад
Hello I have doubt how may i create personal dataset? Is there any tools or other method?
@The-techmonk 2 года назад ⁺¹
Hi Karn
what is the output of this model ? position co-ordinates along with keyword?
how we'll extract the data in key value pair?
@karndeepsingh 2 года назад
Each word BIOES tag with Bounding box information
@chiragkhandelwal3507 2 года назад ⁺⁵
@@karndeepsingh how to extract that?
@nurielwainstein9022 2 года назад
hi, you made it?
can you give any directions to achieve this pls?
@BaoTran-jo8lj Год назад
Hi, I have an urgent task. Could you please help me which approach I should do to extract data from invoice images and then classify them with required fields? Thanks for your work
@karndeepsingh Год назад
It depends on usecases, complexity and also how many field required to be extracted ?
@taherehrastegarzare9780 6 месяцев назад
Hi, thanks for this video. I keep getting this error when I run the first block of code to clone unlim. error: subprocess-exited -with error and error: metadata-generation-failed
@ShivShankarDutta1 2 года назад
Excellent explanation
@NiketBahety Год назад
Hii, seems like in the latest python version it's throwing this error:
ModuleNotFoundError: No module named 'layoutlm'
I tried changing the python version but still doesn't work can you share updated code or how can I fix this bug
Thank you
@banagarmahesh1747 Год назад
im also facing same error ,could you explain me how can i fix this bug
@NiketBahety Год назад
@@banagarmahesh1747 try restarting the runtime
@pratikpatel6967 9 месяцев назад
@@NiketBahety did that solve?
@NiketBahety 9 месяцев назад
@@pratikpatel6967 yes
@jojushaji3010 2 года назад
can we use pretrained model without any finetuning ,how to get results for that model
@karndeepsingh 2 года назад
Yes you can use pretrained model
@jojushaji3010 2 года назад
@@karndeepsingh how to extract the question and answer only in a key value format
@jojushaji3010 2 года назад
Eg:{"To" : "KA Sparrow",
"From" : "DJ Landro",
"Subject" : "....."
}
@karndeepsingh 2 года назад
You can annotate question and answer formate using UBIAI tool and use DoCQA pretrained model to trained
@jojushaji3010 2 года назад
@@karndeepsingh can I use funds dataset to train a key value extraction model
Output of the model:{"To" : "Ka",
"H": "g",
}
Can u pls suggest me a way for this ,i want to do this on funsd dataset
@pravallikav7454 Год назад
can you please suggest me, how to use LayoutLMV2 for passport data extraction
@karndeepsingh Год назад
For passport may be you can use Object Detection to locate the area of interest and then apply OCR on it.
@pravallikav7454 Год назад
@@karndeepsingh I applied ocr as I tried with easyocr. what I was doing in the process was Reader.readtext("Path of image"), then taking bounding boxes coordinates and text respectively which are having most width among all the bounding boxes in output, then I checked the text with mrz codes and converting into a data frame for convenience. Now, the problem is that sometimes S is considered as 5, and sometimes 0 is considering as O, it is because of the OCR issue I am thinking. How can I get a resolution regarding that, please help me to get out of this.
@karndeepsingh Год назад
Its an issue with OCR. May be you need to change some parameters of OCR that you are using. Or May be consider using different OCR.
@pravallikav7454 Год назад
@karndeepsingh you are so talented. I am glad to have you here.
@TechnoRahul143 11 месяцев назад
Can we use mutiple PDF documents to extract the information using LayoutLM?
@karndeepsingh 11 месяцев назад
Yes
@TechnoRahul143 11 месяцев назад
Thanks @@karndeepsingh
Do we have to first get the screenshot of each page in pdf and use the screenshot image one by one for processing and asking the question? Or is there any API available to directly upload the entire PDF and ask questions?
@amitday5957 2 года назад ⁺¹
excellent. thanks
@tejareddy4416 Год назад
How to train a custom model for key value extraction
@harshavardhanachyuta2055 2 года назад
is the process same for pretrained layoutlmv3 model
@karndeepsingh 2 года назад
Yes
@vishnukantshukla8998 2 года назад
Can you please let me know how can we extract that data as a text from document for further analysis. It;s very urgent
Thanks
@karndeepsingh 2 года назад
the predicted output will be in BIOEU tags associated with labels. You have club these BIOEU tags into one to form a text
@kpcreatzz7965 2 года назад
What are the algorithms used in this model ??
@karndeepsingh Год назад
Combination of OCR, Vision Transformer and Roberta
@kf3199 2 года назад
How to annotate scanned pdf?.
@harshavardhanachyuta2055 2 года назад
The inference time is more. How can we optimise the code ?
@karndeepsingh 2 года назад ⁺¹
You can try Knowledge distillation or quantisation techniques
@harshavardhanachyuta2055 2 года назад
@@karndeepsingh Sure will make a note and try it
@chinjumurali1622 2 года назад ⁺¹
can you please share the github link of this code.It will be very helpfull.
@sunitbehera597 2 года назад
Hi Karn, can we use it for pdf as we train NER models
@karndeepsingh 2 года назад ⁺²
Yes
@anagha1615 Год назад
@sunitbehera597 hey did u worked on ut? Can we connect?
@hd-dc8qq 2 года назад
Wow, ready a great video.
Just out of curiosity, what is the specs of the hardware or service you are using. How Long do you think it will take to train 5000 annotated documents.
@karndeepsingh 2 года назад ⁺¹
It used GTX1080 and it should take atleast 2hrs-3hrs to train
@rebeiamal705 2 года назад
Excellent
@kgkgkkgkg 2 года назад
How to implement this solution on multipage document?
@karndeepsingh 2 года назад
You have to pass each page of a document into the model
@Kidzzz_zonee 2 года назад
Yes, but won't the model get confused as the new page will have different class at the same coordinate on first page.
@AnupamKumar-ob7ek 2 года назад
nice explaination. could you explain how to create own dataset using ocr for layoutlm
@karndeepsingh 2 года назад
You can use UBIAI tool to do it or else you can prepare the data on your own by extracting text wise BIOES tags and respective text bounding boxes
@AnupamKumar-ob7ek 2 года назад
@@karndeepsingh can you show me how I can prepare without any tools plz . It will be great help for me
@AnupamKumar-ob7ek 2 года назад
@@karndeepsingh sir, could you make a video without the use of UBIAI tools, It will great help for me. Thank you in advance
@abdulwajid6725 2 года назад
Sir is there any specific tool for data annotation for this? #urgenthelp
@karndeepsingh 2 года назад
UBIAI
@harshavardhanachyuta2055 2 года назад
@@karndeepsingh if possible can you make a video on how to annotate the text using UBIAI
@karndeepsingh 2 года назад ⁺¹
Sure
@ArunKumar-sg6jf 2 года назад
can make video for layout lm v2 please
@CreatingUtopia 2 года назад
How to make this dataset? any suggestion
@karndeepsingh 2 года назад ⁺¹
Use Paid tool like UBIAI
@chotu8952 2 года назад
Bhaia pls make a video on how to get job as fresher in data science
@imranullah3097 11 месяцев назад
Lol, do extract information in json format. Do not know how this work.
@parker4427 2 года назад
Ｐｒⓞм𝕠𝕤𝐌

Следующие

Автовоспроизведение