Chunk large complex PDFs to summarize using LLM

Поделиться
HTML-код
  • Опубликовано: 29 сен 2023
  • In this video, I talk about a technique to do context aware chunking of large PDFs and then summarize the content using map-reduce framework(implemented through Langchain)
    References:
    arxiv.org/pdf/2307.03172.pdf
    developer.adobe.com/document-...
    python.langchain.com/docs/use...
    smith.langchain.com/hub/
    code:github.com/rajib76/langchain_...

Комментарии • 17

  • @fabsync
    @fabsync 5 месяцев назад

    oh man.. thanks for your videos! They are precious gold! I love the way you think and teach!

  • @user-on1qo8un2t
    @user-on1qo8un2t 3 месяца назад

    Man, very great explaination. I m constantly visiting your channel for great new tutorials @@@:)

  • @ianmatthews8342
    @ianmatthews8342 5 месяцев назад

    Excellent, nice idea and very well explained! Thanks!

  • @virunew
    @virunew 5 месяцев назад

    nicely explained!

  • @aarshmehtani5468
    @aarshmehtani5468 2 месяца назад

    Thanks Sir. Actually I am working on some project based on it and find difficult for me to find materials to understand the concept practically. After watching this video, I understand and implement it successfully and step ahead.
    At last Thanks Sir for this video.

  • @user-ll6mn8ky8d
    @user-ll6mn8ky8d 6 месяцев назад +3

    you'll be famous soon

  • @timtensor6994
    @timtensor6994 7 месяцев назад

    Maybe for image, how does GPT4 multimodal models work ?

  • @bilalzahoor5608
    @bilalzahoor5608 5 месяцев назад

    Hi Rajib, Thanks for making this video. It has been really helpful as I try to build a RAG system for a B2B use case. However, I did try setting up the Adobe API but I must say it's not too easy as I am getting stuck at various steps. I am not able to get 201 response code. Can you please share the steps you followed to setup the API? Regards, Bilal

  • @PrabakaranSPpraba984
    @PrabakaranSPpraba984 9 месяцев назад

    Hi , Could you please share your LinkedIn profile ? I am doing the same PoC , I need some clarification.

  • @elephant742
    @elephant742 9 месяцев назад

    Hi Rajib,
    Really insightful video. Especially the Extract API for the context-aware extraction of text from PDF.
    Are you aware of any open-source alternatives for the Extract API?
    Regards,
    Dev

    • @rajibdeb4059
      @rajibdeb4059  9 месяцев назад +1

      :) I was also looking for one. Tabula, Camelot did not work for me. Looks like the enterprise grade solutions come with a price.

    • @MadhanAnbalagan-ff5qt
      @MadhanAnbalagan-ff5qt 7 месяцев назад +1

      Closest open source I found working is unstructuredIO

    • @rajibdeb4059
      @rajibdeb4059  7 месяцев назад +1

      that also did not work for me, in fact i have shared the results with unstructured
      @@MadhanAnbalagan-ff5qt

    • @user-ll6mn8ky8d
      @user-ll6mn8ky8d 6 месяцев назад +1

      if you had to pick an open source alternative for this, what would you chose? ​@@rajibdeb4059

    • @loknathbasak1533
      @loknathbasak1533 6 месяцев назад

      @@MadhanAnbalagan-ff5qt Did you find a okaish opens source one?

  • @vikasrajpurohit8730
    @vikasrajpurohit8730 5 месяцев назад

    Thank you, can you give your LinkedIn handle