This is super helpful, thanks for setting this up. For chunking a PDF document, do I have to convert it to text first, and will I be able to include any data tables with quantitative data in the chunking?
I believe you should be able to use pypdf. I’ll be messing with this in a few weeks with a project I’m working on so standby. Converting a book from German to English
Hey guys I hope you enjoyed the video! If you did please subscribe to the channel! Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg If you want to watch a full course on Langchain check out Datacamp: datacamp.pxf.io/XYD7Qg Want to solve Python data interview questions: stratascratch.com/?via=ryan I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com *Both Datacamp and Stratascratch are affiliate links.
Thank you! How do you iteratively feed this to your LLM? Is there a way to do so?
This is awesome. Thank you for this. Would like to know the name text to token tool you are using.
This is super helpful, thanks for setting this up. For chunking a PDF document, do I have to convert it to text first, and will I be able to include any data tables with quantitative data in the chunking?
I believe you should be able to use pypdf. I’ll be messing with this in a few weeks with a project I’m working on so standby. Converting a book from German to English
Hey guys I hope you enjoyed the video! If you did please subscribe to the channel!
Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg
If you want to watch a full course on Langchain check out Datacamp: datacamp.pxf.io/XYD7Qg
Want to solve Python data interview questions: stratascratch.com/?via=ryan
I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com
*Both Datacamp and Stratascratch are affiliate links.