Python Web Scraping Project: Manga Downloader Tutorial | Download any manga to PDF

Поделиться
HTML-код
  • Опубликовано: 11 сен 2024

Комментарии • 18

  • @aniketchatterjee2456
    @aniketchatterjee2456 Месяц назад

    Hey, I don't know if you will see this or not, I did all the things same as your code. The images are being downloaded. but the conversions to pdf stops at 1.pdf. Thank you in advance

  • @january9765
    @january9765 9 месяцев назад +2

    If u make a news scrapper . In how much time u can get the news after it has been released on say CNN

    • @rishabhxchoudhary
      @rishabhxchoudhary  9 месяцев назад

      i think instantly. if its on the website then yes, instantly. But I will have to re-run the script everytime I want to get new news.

    • @january9765
      @january9765 9 месяцев назад

      @@rishabhxchoudhary is there any way u can get info about latest news without rerunning the script .
      Suppose u want to get news about a particular topic just after its released how can u do it

    • @rishabhxchoudhary
      @rishabhxchoudhary  9 месяцев назад

      ​@@january9765 You will need the news API, which, most probably, you will have to pay for that service. if not, then you will have to create a SOCKET server which re runs the script and sends the changes through an API. And through that API you can access it in real time.
      There is no way to do this with only webscraping. You will need to create APIs.
      I am still learning and there might be some way to achieve what you are asking but I currently don't have knowledge about it.

    • @january9765
      @january9765 9 месяцев назад

      @@rishabhxchoudhary ok thanks 👍

    • @AntiMen0
      @AntiMen0 9 месяцев назад

      @@rishabhxchoudhary There is another way too, GITHUB ACTIONS, look for it, learn it & make video on it. maybe it's not as accurate as API when it comes to timing but it's still an automated work which runs without human need once it's set up.

  • @Powerease
    @Powerease 8 месяцев назад +1

    Well I did the same coding and also rechecked it from GITHUB
    but getting error
    line 115, in
    main()
    line 112, in main
    download_manga(chapter, chapters[chapter])
    line 84, in download_manga
    os.mkdir(path)
    please resolve my issue

    • @rishabhxchoudhary
      @rishabhxchoudhary  8 месяцев назад

      I think its because you are using a different operating system. Windows does not allow special symbols in the folder name, so you need to preprocess the name of the chapter string and remove any special symbols in it.
      You can do it like this:
      name = ''.join(char for char in name if char.isalnum() or char.isspace())
      in the first line of the function.
      tell me if that resolved your issue

    • @Powerease
      @Powerease 8 месяцев назад

      @@rishabhxchoudhary name = ''.join(char for char in name if char.isalnum() or char.isspace())
      ^^^^
      UnboundLocalError: cannot access local variable 'name' where it is not associated with a value .
      HAD COPY THIS IN FUNC: def main():

    • @Powerease
      @Powerease 8 месяцев назад

      AND also
      line 112, in
      main()

    • @rishabhxchoudhary
      @rishabhxchoudhary  8 месяцев назад

      @@Powerease I think you should consider studying the basics of Python3 first before asking. both errors on line 112 and 84 are caused by single line line os.mkdir(path). the above should solve both errors.

    • @Powerease
      @Powerease 8 месяцев назад +1

      Okay I Will Completely study the basics and libraries. Well thanks for your support. You gained a subscriber.
      @@rishabhxchoudhary

  • @abhiramsiddanthi8800
    @abhiramsiddanthi8800 9 месяцев назад +1

    where did you get the HEADERS from

    • @rishabhxchoudhary
      @rishabhxchoudhary  9 месяцев назад

      It is quite standard. I just specified user-agent, encoding and host. you can copy it from the source code.

  • @imrantahir69
    @imrantahir69 Месяц назад

    ...

  • @redaibba6414
    @redaibba6414 5 месяцев назад

    hey sir thanks you for you effort, i also work with my python script like this but the link of ww7.manganelo it work perfectly but i need help for something elso how can i contact you?