#1 - Read PDF and Validate Content using PDFBOX in Selenium

Поделиться
HTML-код
  • Опубликовано: 1 янв 2025

Комментарии • 57

  • @naveenautomationlabs
    @naveenautomationlabs  2 года назад +4

    In this example, we are using driver to launch the browser and url. But not using the driver in PDFBOX code as I could not find the right example online.
    In real time use case, you can click on pdf link from the web page and get the href/url value of the same link and use it in PDFBOX code in URL class object.
    example:
    String url = driver.findlement(pdf_link_element).getAttribute("href);
    URL pdfUrl = new URL(url);

    • @neharai4959
      @neharai4959 Год назад

      getting java.io.IOException: Error: End-of-File, expected line at offset 5565 at pddocument.load(bf) in below program: URL url=new URL(pdfurl);
      URLConnection urc=url.openConnection();

      urc.addRequestProperty("User-Agent", "Mozilla");
      int responseCode = ((HttpURLConnection) urc).getResponseCode();
      if (responseCode == 200) {
      InputStream is=urc.getInputStream();
      BufferedInputStream bf=new BufferedInputStream(is);
      PDDocument pd=PDDocument.load(bf);
      int count=pd.getNumberOfPages();
      System.out.println(count);

      } } please help me out.

  • @peacelilly2200
    @peacelilly2200 2 года назад +1

    I learn a ton of things from your video. The content is straight forward and the explanation every time is crystal clear. Thank you so much for making such videos.

  • @SarangHoley
    @SarangHoley 2 года назад +1

    Long back you had made a video on this, good to see a updated vision of it, Thank you Naveen 😊

  • @ABAutomationHub
    @ABAutomationHub 2 года назад +1

    Thanks for covering topics like this.. It’s very useful..

  • @malleshmalli809
    @malleshmalli809 2 года назад

    Thank you Naveen ..it's very useful video ..thank you so much

  • @AK-rx5yp
    @AK-rx5yp Год назад +1

    Can you pls explain important scenario here as we see multiple tables here say the row with Name as key should contain value as Naveen....
    How to automate this pls???

  • @soumyajitnath1348
    @soumyajitnath1348 2 года назад +1

    Really too useful ! Your videos always gives a kick to me to learn more.
    Please make a video on threadlocal which can be used to run tests parallel at test method level in an automation framework

  • @softwaretestinglearninghub
    @softwaretestinglearninghub 2 года назад +1

    Great content Naveen, thank you!

    • @punampatil7355
      @punampatil7355 2 года назад

      Hi Naveen, I want to read recent downloaded pdf from its downloaded folder and verify it's title.

  • @shwetakatare24
    @shwetakatare24 2 года назад

    Thank you for this video💯😊

  • @yusufalperengürel
    @yusufalperengürel Год назад +1

    Thanks for sharing. I take "java.io.IOException: Error: End-of-File, expected line at offset 636". Do you have any idea to handle it?

  • @nigaraliyeva1240
    @nigaraliyeva1240 2 года назад

    Thank You Naveen!

  • @mrleoim
    @mrleoim 2 года назад +1

    Hi Naveen, your video on PDF validation is very good. Can you do video on using selenium to automate the mainframe screens like IBM personal communications

  • @suryadeepsrivastava7645
    @suryadeepsrivastava7645 Год назад

    Hi Naveen, i am working in a banking project, my application has an embedded pdf, I need to validate the pdf content. When I pass the pdf url, I get a connection timed out exception. Can you please help?

  • @raghadraghad8433
    @raghadraghad8433 Год назад

    Hi
    I got error of java.lang.NoClassDefFoundError: org/apache/pdfbox/pdmodel/PDDocument
    Although I exported fontbox pdfbox jars as external libraries
    What should I do?

  • @raghadraghad8433
    @raghadraghad8433 Год назад

    Hi
    How can I select Save as pdf option from chrome printing dialog and the pdf file?

  • @mayurubale9102
    @mayurubale9102 Год назад +1

    Thank u sir !

  • @surajsurya1414
    @surajsurya1414 Год назад

    Thanks for sharing this. It would be really helpful if you can make a video for same with Cypress. I have a scenerio, where I have to create a sales invoice. On saving it, browser print popup is displayed and I have to assert some values on it.
    Thank you in advance.

  • @YasmeenFatimaAbdi
    @YasmeenFatimaAbdi Год назад

    When I am trying to download pdf file, then pdf file is opening in new tab and unable to handle clicking on save file to my local . How can I save read only pdf file when we are restricted from company to download file? Can you please help me with the code. Thanks

  • @raj-we9yr
    @raj-we9yr Год назад

    Thank you for the nice video. Is it possible to specify a particular table in a page and extract just that specific table from the PDF document

  • @rameshkrishna6103
    @rameshkrishna6103 5 месяцев назад

    Nice Video. Thank you. Can we search a text in the PDF and "move" to the text one by one as we do on a PDF or other document search?

  • @knowledgeTransfer31
    @knowledgeTransfer31 Год назад

    Hi Naveen , I ma getting
    FileNotException what migh tbe the reason but the file is not in the destination path

  • @archanamuthukrishnan6465
    @archanamuthukrishnan6465 2 года назад

    Hello Sir
    In my project am using properties file to read credentials and url .but they asking not to use the same..can you please let me know the alternative?

  • @dhrusoni1
    @dhrusoni1 Год назад

    Does it possible to asserting charts ?

  • @neamafouad57
    @neamafouad57 Год назад

    Thank you for this helpful video, but there are some characters are changed when reading pdf and print it ,Do you have any idea why this ?

  • @arnaldoadiputra681
    @arnaldoadiputra681 Год назад

    is it possible to screenshoot the pdf from the webbrowserview ? like all the way until the last page ?

  • @anjankumar4012
    @anjankumar4012 2 года назад

    Thanks for the video,
    I was searching for a way for my project.
    Really helpful .❤️
    Can you please make a video on how to save screenshots in Word file.
    That will be really helpful

  • @radhakrishnanp2578
    @radhakrishnanp2578 2 года назад

    Hi Naveen will you kindly upload the video on how to assert it and extract the images from the pdf?

  • @naveenkumars9132
    @naveenkumars9132 2 года назад

    Hi Naveen,
    Do we have any option to validate Bold text/sentence in the pdf ? Like i got a scenario to validate a particular sentence in the pdf are bold.

  • @Sai-Ram-1234
    @Sai-Ram-1234 2 года назад

    How to read the content of the pdf content is encrypted using pdf text stripper?

  • @vaishalilahudkar2795
    @vaishalilahudkar2795 2 года назад

    Hi sir,
    Why here headless cromeoption used and passed in driver instance

  • @pawanchandra7158
    @pawanchandra7158 2 года назад

    Hi Naveen,
    Why can't we pass InputStream object directly to PDDocument class..Why are we creating BufferedInputStream class object

  • @Gaurav12081
    @Gaurav12081 Год назад

    Hi Naveen same video can you make for XML validation currently in my company we are validating invoice extract XML against DB thanks.

  • @vigneshelumalai1916
    @vigneshelumalai1916 2 года назад

    can we click a button on pdf to redirect to my application

  • @KARTHIKPANCH97
    @KARTHIKPANCH97 2 года назад

    Hi Naveen.
    I am part of your Selenium Java training batch of 11th Nov
    Would you be covering this topic in that as well.
    It would be great so all topics would be at once place for easy reference
    Thanks.

  • @botchulamunesh2854
    @botchulamunesh2854 Год назад

    Bro how table data like this type

  • @srikanthmaragoni4291
    @srikanthmaragoni4291 Год назад

    Hi sir can u explain how to download and validate the same pdf file using selenium webdriver (without giving url' s)

  • @chakshitvlogs8766
    @chakshitvlogs8766 Год назад

    Hi Brother,
    I have been following your videos so regularly. Can you able to make a video related to extracting tables from pdf file using any third party library

  • @syedwaseemahmed1749
    @syedwaseemahmed1749 Год назад

    How validate pdf contain hiper link??

  • @delankoh3494
    @delankoh3494 2 года назад

    How can we validate images or signatures in pdf?

  • @mangeshmunde9347
    @mangeshmunde9347 2 года назад

    Hey Naveen, can you share API document Pdf....you have explained in video..

  • @vinayakm9389
    @vinayakm9389 9 месяцев назад

    Hi Naveen, really very useful video, I tried to do it, Im facing this error,
    Any suggestions please how to come out, stackOverFlow didn't give answer on same.
    Java.io.IOException: Error: End-of-file, expected line
    Here Scenario is pdf is added inside the regular text page

  • @jobcurator2413
    @jobcurator2413 2 года назад

    Whats the use of driver.url("url of pdf") when we are anyway creating URL for PDF file directly ?

    • @naveenautomationlabs
      @naveenautomationlabs  2 года назад

      yes correct.
      In this example, we are using driver to launch the browser and url. But not using the driver in PDFBOX code as I could not find the right example online.
      In real time use case, you can click on pdf link from the web page and get the href/url value of the same link and use it in PDFBOX code in URL class object.
      example:
      String url = driver.findlement(pdf_link_element).getAttribute("href);
      URL pdfUrl = new URL(url);

  • @aruns5896
    @aruns5896 2 года назад

    Nice Video Naveen. Thanks . When the client or user wants to validate the pdf using selenium because they can directly open the pdf and validate ?Share the real time scenario

  • @swethanainampudi4261
    @swethanainampudi4261 2 года назад

    Hi Naveen, Do we have a way to read the PDF content from the blob url?

    • @naveenautomationlabs
      @naveenautomationlabs  2 года назад +1

      Blob url is not directly support with selenium. You can download the pdf and then launch it selenium or try the blob url directly with pdfbox.

  • @singh07neeraj
    @singh07neeraj 2 года назад

    Hi Naveen how to test if some PDF is open within the browser please cover this too

  • @homaassal2794
    @homaassal2794 2 года назад

    This method does not work if the pdf opens as a popup inside the same browser window

  • @ravirajug1137
    @ravirajug1137 Год назад

    pdfText.contains not searching string = De , rest all doing this.

  • @ravirajug1137
    @ravirajug1137 Год назад +2

    It is really helped me. Thanks for such nice video.