Python RAG Tutorial (with Local LLMs): AI For Your PDFs

LlamaParse: Convert PDF (with tables) to Markdown

Unstract: How To Convert PDFs, Docx, & CSV Into Structured Data For RAG With AI - Opensource!

We Took 100 Shots vs a Women's Pro Keeper and Scored ___ Goals

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

How to convert PDF DOCX to Structured TXT Formats for RAG! (UNSTRUCTURED Tutorial)

1littlecoder

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 14 янв 2025

Комментарии • 54

@IdPreferNot1 9 месяцев назад ⁺⁹
Going over libraries useful for AI dev is a great video series idea!
@1littlecoder 9 месяцев назад
Thank you. If you have any interesting choices in mind feel free to let me know :)
@eugenmalatov5470 9 месяцев назад ⁺¹
100%
@yusufersayyem7242 9 месяцев назад ⁺²
Honestly, we are lucky to know you..... Many thanks and appreciation to you, Mr. Abdul ❤
@1littlecoder 9 месяцев назад
I'm glad you found it useful :)
@Panacea_archive 9 месяцев назад ⁺¹
This channel is completely underrated! Thanks for this video
@1littlecoder 9 месяцев назад ⁺¹
Glad you think so! Thank you :)
@Nick_With_A_Stick 9 месяцев назад
I was looking for something like this to make a raw text of the hugging face documentation, since no LLM’s are trained in it since it’s available in a very weird website format. This is awesome :)
@sharanbabu2001 9 месяцев назад ⁺¹
Thanks for sharing!!
@BiMoba 9 месяцев назад ⁺⁴
An idea for better video structure would be to have a demo at the beginning, while I have some idea but had to watch until the end to understand what the library can do.
@1littlecoder 9 месяцев назад ⁺²
Thanks for the tip. Do you mean like showing the final output?
@BiMoba 9 месяцев назад ⁺¹
@@1littlecoder yes, something like input and output. It acts as a hook.
@1littlecoder 9 месяцев назад ⁺¹
@@BiMoba Thank you. I'll try to make sure!
@MrKellvalami 9 месяцев назад ⁺¹
I always find out if I'm interested in a particular video by reading the transcript summary.
@1littlecoder 9 месяцев назад
That's a clever way!
@drramasubramaniam6724 9 месяцев назад
Wow thanks Majeed that’s something which I desperately need. Was facing lot of issues for text conversion in my Rag system. Will also be helpful if you can run a tutorial on sentence window retrieval + rerank for RAG.
@faisalIqbal_AI 9 месяцев назад ⁺¹
Informative Thanks
@captainoddessy 9 месяцев назад ⁺¹
wow you are back after a week. You should take some breaks like this. AI is going crazy. You won't miss anything
@1littlecoder 9 месяцев назад ⁺³
I saw a lot of models being launched. In fact been thinking to do a weekly summary line Ai news this time.
@captainoddessy 9 месяцев назад
@@1littlecoder yea I miss you weekly AI news. You should start it again. Not the all AI stuff happened that week but like crazy ground braking invention or paper. Or whatever impresses you. In this way it won't be 20-30 min long. you can make it 10-12 min. There's a youtube channel "the friday checkout" you can follow his format.
@jmirodg7094 9 месяцев назад
Great tool Thanks!🤩
@MrLyonliang 5 месяцев назад
thanks. looking forward to advanced tutorial covering using unstructured to do chunking, rag....
@eugenmalatov5470 9 месяцев назад ⁺¹
Great video!
@1littlecoder 9 месяцев назад
Glad you enjoyed it
@Saranlisto 9 месяцев назад ⁺¹
👏👏👏👏👏
@1littlecoder 9 месяцев назад
Look who's here 😁
@OP-yr6jb 8 месяцев назад
Yes I am looking at unstructured - have you used it? How good is it for tables?
@MrPierreSab 9 месяцев назад ⁺¹
Do you know what is the difference with pandoc?
@1littlecoder 9 месяцев назад
Afaik pandoc helps you generate PDFs.
@eugenmalatov5470 9 месяцев назад
@@1littlecoder and the difference between unstructured html parser and the library html2text? And why are there pages in HTML documents in the first place?
@MrPierreSab 9 месяцев назад
@@1littlecoder I see, thanks. pdfminer is an alternative as you mentionned.
@rounaksen1683 9 месяцев назад
are you also doing vectara advanced rag hackathon ?
@adarmawan1977 9 месяцев назад
I like this !
@nithishkrish3442 7 месяцев назад
After extraction the text how to extract some information and write to a excel
@maizizhamdo 8 месяцев назад
great video boss, it support multilangues
@drmetroyt 7 месяцев назад
Sir , how to install and use this on docker , no video on internet
@1littlecoder 7 месяцев назад
I think llama index as its own docker version
@drmetroyt 7 месяцев назад
@@1littlecoder there is a docker image of unstructured io and they also give option to install as docker container but there are no instructions as how to proceed , a video on it would be very helpful
@DeepakRavi93 9 месяцев назад
PDFs will take longer to process than a text file. This creates a need to use Unstructured Commercial SaaS API. For other formats, it is okay to use.
@nirmaldesai4504 9 месяцев назад
If it is implemented, it is on-premise or calling Unstructured API which is using our ingestion data
@1littlecoder 9 месяцев назад
Whatever we did on this video is on-prem because we aren't calling any api
@mandalorian1992 5 месяцев назад
The problem with this library is that this is very generic... the main problem here is that there are much better libraries that are currently available for each of the formats which do a better job than the underlying libraries used here... Tesseract is easily beaten by easyocr, pdfminer is beaten by pymupdf and so on. It's very good with that. Also, what if docs or pdfs have images in some of the pages.
@Macorelppa 9 месяцев назад ⁺¹
Stop shtposting please 🙏
@1littlecoder 9 месяцев назад
Means
@kalilinux8682 9 месяцев назад ⁺¹
@@1littlecoder he is implying this video is shit. Which I disagree with. Although the video could have been shorter.
@1littlecoder 9 месяцев назад ⁺¹
@@kalilinux8682 i actually asked the question to make sure it's not a bot
@Macorelppa 9 месяцев назад ⁺¹
@@1littlecoder I am not a bot. LMAO.
@1littlecoder 9 месяцев назад ⁺²
@@Macorelppa Glad to know. Dealing with a lot of bots, I'm happy to see humans
@ilianos 8 месяцев назад
Oh, you briefly mentioned it uses pdf.miner under the hood? I hope not! From personal experience with testing different Python libs, I found the results of pyPDF and PyMuPDF much better.
@etahydri 9 месяцев назад
Exactly what I needed. 🥌

Следующие

Автовоспроизведение

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

LlamaParse: Convert PDF (with tables) to Markdown

LlamaParse: Convert PDF (with tables) to Markdown

Unstract: How To Convert PDFs, Docx, & CSV Into Structured Data For RAG With AI - Opensource!

Unstract: How To Convert PDFs, Docx, & CSV Into Structured Data For RAG With AI - Opensource!

We Took 100 Shots vs a Women's Pro Keeper and Scored ___ Goals

We Took 100 Shots vs a Women's Pro Keeper and Scored ___ Goals

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

Imagine Dragons - Take Me To The Beach (feat. Ado) (Official Lyric Video)

The Greatest Comeback Of All Time?

The Greatest Comeback Of All Time?

Turn ANY FOLDER into LLM Knowledge in SECONDS

Turn ANY FOLDER into LLM Knowledge in SECONDS

How to Convert Unstructured to Stuctured Data | OpenAI Function Calls Example

How to Convert Unstructured to Stuctured Data | OpenAI Function Calls Example

Visual PDF Reader: ColPALI for RAG #ai

Visual PDF Reader: ColPALI for RAG #ai

RAG for long context LLMs

RAG for long context LLMs

Unstructured.IO: Get Your Data LLM-Ready

Unstructured.IO: Get Your Data LLM-Ready

Semantic Chunking for RAG

Semantic Chunking for RAG

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

Talk to Your Documents, Powered by Llama-Index

Talk to Your Documents, Powered by Llama-Index

Multi-Vector Retriever for RAG on Tables + Texts Using LANGCHAIN & UNSTRUCTURED

Multi-Vector Retriever for RAG on Tables + Texts Using LANGCHAIN & UNSTRUCTURED

Самая черная краска в мире! #musou #kiwami

Самая черная краска в мире! #musou #kiwami

ДОБЫТЬ МЯСО НА ГОД. ЖИЗНЬ В ТАЙГЕ. ОХОТА НА ЛОСЯ С ПОДХОДА.

ДОБЫТЬ МЯСО НА ГОД. ЖИЗНЬ В ТАЙГЕ. ОХОТА НА ЛОСЯ С ПОДХОДА.

Вчера мы увидели, как отключился интернет

Вчера мы увидели, как отключился интернет

как видит учитель vs что происходит на самом деле ( устное дз )

как видит учитель vs что происходит на самом деле ( устное дз )

No one appreciates Santa Claus.🎅🙏#social #knowledge

No one appreciates Santa Claus.🎅🙏#social #knowledge

Попали в СЕКРЕТНУЮ Игру в Кальмара в Реальной Жизни !

Попали в СЕКРЕТНУЮ Игру в Кальмара в Реальной Жизни !

ЗЛЫЕ РОДИТЕЛИ ЧТО-ТО СКРЫВАЮТ😱Помоги Квинке убежать🥺#роблокс #игры #смешное #интересное #квинка

ЗЛЫЕ РОДИТЕЛИ ЧТО-ТО СКРЫВАЮТ😱Помоги Квинке убежать🥺#роблокс #игры #смешное #интересное #квинка

Инстасамка: Муж Альфонс / Лживый образ / Стыд и искупление

Инстасамка: Муж Альфонс / Лживый образ / Стыд и искупление