Recently just finished a project to find information from random pdf names files. Using the same DLL module with PS to looking for specific string. Which works great. Wish find your channel earlier.
Thanks.... your tutorial was very helpful... I will look through more of your tutorials, hoping to be able to extract pages that have my matching word... psyching up to start at the beginning of your tutorials on powershell...
Works great. Thanks so much. Maybe I didn't noot look enough, but it seems that there is not much in the docs of itext to see how to simply extract text out of a pdf file
Faced with a task of processing financial reports in annual batches 400 files each year I discovered your tutorial using iTextSharp. Excellent presentation and clear explanation. Thank You. As a mid-septuagenarian I am 'Jacked already'. Great work. Using VSCode & Powershell 7.4x and (iTextSharp 5...13.1) I experienced an exception with the base code you published. You used in your sample '$pdf.close() ' . In my experience this would not work as the exception stated 'pdf does not contain the method close() ' WHICH IS TRUE. Line | 13 | $pdf.Close() | ~~~~~~~~~~~~ | Method invocation failed because [System.String] does not contain a method named 'Close'. Any suggestion as to what may of happened here? Does iText close the file automatically? Processing high volumes of pdf's would likely lead to system memory problems if they are not closed. Will continue the research locally. Thanks Al
This video was influential in coding and completing my project. The iTextsharp5 v13.1 & 3 both worked OK but seemed temperamental. So now I want to try iText7. I struggle to get off home plate on this as there is little beginners tutorials of v7. There are many differences from 5. I added 5 new dll's to support this but still a big difference in approach. Any chance you might add a video addressing what you did with this video using iText7? That would be awesome. I hope you reply. Thx's
@@jackedprogrammer Hello, Did you ever attack this task with iText7 or 8. Did you have any sucsess ? Im about to abandon iText completly as I cannot get success with v 5.5 13.1.or 3. Copying your video completely fails with iText errors depending on whatever manipulation I try. (I Currently submit the total pdf source to PDFElement that batch converst and then I copy from there. This is not art of the project design. My suspicion is its related to MS OS updates which hAS BEEN VERY VOLATILE inthe passed year or so. Hope to hear back from you.
Hi, thank you for the tutorial, used it in a work project, one thing i would like to consult you about that i could not find information about online, when checking multiple pdf files as in thousands of files, what is the best practice regard the pdfreader? lets say i have a loop that iterates over the files, do i use the $pdf = new-Object itextSharp.text.pdf.pdfreader -ArgumentList $file every iteration inside the loop? or is there a way to create the reader outside the loop and feed it one file after the other? I'm probably missing some understanding regarding the pdf reader that would help me alot, thank you
OK, finally tracked it down. My question is retracted. Was working on the object $pdf instead of $text. Renamed the $pdf to $text -OK. Was unintentionally recycling the Object $pdf for $text and corrupting the object. Pretty dumb eh! Thx A
Nice sample. Any hints how to read keywords in PDF file [$pdfReader=New-Object iTextSharp.text.pdf.PdfReader($filePath)], clear it, put 0 and modify it to new value [int]$newKeywords = $keywords + 1
Recently just finished a project to find information from random pdf names files. Using the same DLL module with PS to looking for specific string. Which works great. Wish find your channel earlier.
excelente, muchas gracias.
Thanks.... your tutorial was very helpful... I will look through more of your tutorials, hoping to be able to extract pages that have my matching word... psyching up to start at the beginning of your tutorials on powershell...
Great video mate. Was looking for this subject and here you are. Gold 👍🏼
Thanks! Glad it was able to help you out :)
Works great. Thanks so much. Maybe I didn't noot look enough, but it seems that there is not much in the docs of itext to see how to simply extract text out of a pdf file
Hi, I just add a pdf file in my scripts folders, but when I open it on chrome, it said:
Error
Failed to load PDF document.
How to fix that??
Faced with a task of processing financial reports in annual batches 400 files each year I discovered your tutorial using iTextSharp.
Excellent presentation and clear explanation. Thank You. As a mid-septuagenarian I am 'Jacked already'. Great work.
Using VSCode & Powershell 7.4x and (iTextSharp 5...13.1) I experienced an exception with the base code you published.
You used in your sample '$pdf.close() ' . In my experience this would not work as the exception stated 'pdf does not contain the method close() ' WHICH IS TRUE.
Line |
13 | $pdf.Close()
| ~~~~~~~~~~~~
| Method invocation failed because [System.String] does not contain a method named 'Close'.
Any suggestion as to what may of happened here?
Does iText close the file automatically? Processing high volumes of pdf's would likely lead to system memory problems if they are not closed.
Will continue the research locally.
Thanks
Al
Thank you very much!
No problem, glad you're enjoying the videos :)
Excelent. Thank you.
No problem, glad you're enjoying the videos :)
This video was influential in coding and completing my project. The iTextsharp5 v13.1 & 3 both worked OK but seemed temperamental. So now I want to try iText7. I struggle to get off home plate on this as there is little beginners tutorials of v7. There are many differences from 5. I added 5 new dll's to support this but still a big difference in approach.
Any chance you might add a video addressing what you did with this video using iText7?
That would be awesome. I hope you reply. Thx's
Hi Al, I will take a look at it and see what I can find out for you and make a video on it hopefully :)
@@jackedprogrammer
Hello, Did you ever attack this task with iText7 or 8. Did you have any sucsess ? Im about to abandon iText completly as I cannot get success with v 5.5 13.1.or 3. Copying your video completely fails with iText errors depending on whatever manipulation I try. (I Currently submit the total pdf source to PDFElement that batch converst and then I copy from there. This is not art of the project design.
My suspicion is its related to MS OS updates which hAS BEEN VERY VOLATILE inthe passed year or so. Hope to hear back from you.
Hi, thank you for the tutorial, used it in a work project, one thing i would like to consult you about that i could not find information about online, when checking multiple pdf files as in thousands of files, what is the best practice regard the pdfreader? lets say i have a loop that iterates over the files, do i use the $pdf = new-Object itextSharp.text.pdf.pdfreader -ArgumentList $file every iteration inside the loop? or is there a way to create the reader outside the loop and feed it one file after the other? I'm probably missing some understanding regarding the pdf reader that would help me alot, thank you
Pretty good
Thank you, glad you enjoyed the video :)
I'm a beginner and so I just wonder, do I need or should I use the sub-shell in the ForEach statement? 1..$pdf.NumberofPages seems to work too. :)
OK, finally tracked it down. My question is retracted.
Was working on the object $pdf instead of $text. Renamed the $pdf to $text -OK. Was unintentionally recycling the Object $pdf for $text and corrupting the object.
Pretty dumb eh!
Thx
A
Nice sample. Any hints how to read keywords in PDF file [$pdfReader=New-Object iTextSharp.text.pdf.PdfReader($filePath)], clear it, put 0 and modify it to new value [int]$newKeywords = $keywords + 1