Free Scraper Turns ANY WEBSITE into LLM Knowledge INSTANTLY

Turn ANY Website into LLM Knowledge in SECONDS

Stop Paying for Web Crawlers (Use this Instead)

Every Home Alone Is Worse Than The Last

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Madison Police identify school shooter as 15-year-old female student

Crawl4AI - Crawl the web in an LLM-friendly Style

Unclecode

Просмотров 12 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 янв 2025

Комментарии • 20

@miroslavaguzman4653 7 дней назад
I Been trying and the one difficult i found it was the installation, but I think this a great approach to resolve scraping thanks for sharing
@po6577 8 месяцев назад ⁺⁴
Love how you so excited of your project! Keep it up man! Great project
@unclecode788 8 месяцев назад
Thanks! Will do!
@blossom_rx 2 месяца назад
You deserve way more audience. Keep pushing man!
@kenchang3456 3 месяца назад ⁺¹
You wrote this project! U R The Man! :-) Thank you very much.
@saikrishna-vc2wj 6 дней назад
Great video.
I have a question regarding the exclusion of unwanted content during web page extraction. Specifically, how can headers, footers, navigational elements (including side navigation), and tables of contents be effectively removed? Considering that each website follows a different structure and pattern, it seems impractical to configure exclusion rules for every individual site.
This issue becomes even more critical as it can lead to increased storage requirements and, in some cases, false retrieval results for Retrieval-Augmented Generation (RAG) systems due to the presence of unnecessary content.
Could you share any insights or strategies to address this challenge effectively?
@dafivers4127 Месяц назад ⁺¹
I can't get the local lamma to work :(
@xinfeng3022 7 месяцев назад ⁺¹
possible to put up a prebuilt docker image, including the 'models'? I had problem downloading the models during build docker. Thanks!
@unclecode788 7 месяцев назад ⁺²
I will work on that. Trying to have a version without model dependency as well
@plumpy8854 6 месяцев назад ⁺¹
Hey man. I'm going to be honest but i'm new to data scraping and wanted to ask if crawl4ai can be used to scrape data from tiktok. They have implemented some harsh measures with request rate limits and login requirements. From what i saw crawl4ai has some login feature but just wanted to ask you if i'm going in the right direction. Otherwise looks great
@dolboeb-tz4bw День назад
Colab link?
@AWSFan 6 месяцев назад ⁺¹
Very useful Project, I must admit! Is it a recursive crawler, when I say recursive, I mean it, (not restricted to depth threshold). Also How differet is this from FireCrawl, in terms of functionality and other stuffs. I can't wait to get started on using this project, and give it a shot! Thanks!
@MikeLevin 7 месяцев назад
Looks exciting. Have you considered a nix script?
@fieldcommandermarshall 8 месяцев назад
WHAT HAPPENED TO THE FLUTE UNCLE CODE
@unclecode788 8 месяцев назад ⁺¹
Hahahaha!! Ok, ok, message received
@carlosa.villanuevacampoy931 7 месяцев назад
Really cool man! Can I crawl all accessible subpages from a main page? So I crawl 2 levels in total?
@unclecode788 7 месяцев назад ⁺²
You can send multiple links, so first crawl the main page, then get links and send them again. However soon I will release the ability to se the depth and get a cool result for that
@bitcoinquickbytes 8 месяцев назад
i got a result object. how to parse it
@unclecode788 8 месяцев назад
Result is an object like this:
class CrawlResult(BaseModel):
url: str
html: str
success: bool
cleaned_html: str = None
markdown: str = None
extracted_content: str = None
metadata: dict = None
error_message: str = None
So you can access using this property (cleaned_html, markdown, extracted_content), or dump the model into a python dictionary using "result.model_dump()`
@harshshivani4170 3 месяца назад
When I am using AsyncWebCrawler there is a runtime error there is no current event look in thread mainthread

Следующие

Автовоспроизведение

Free Scraper Turns ANY WEBSITE into LLM Knowledge INSTANTLY

Free Scraper Turns ANY WEBSITE into LLM Knowledge INSTANTLY

Turn ANY Website into LLM Knowledge in SECONDS

Turn ANY Website into LLM Knowledge in SECONDS

Stop Paying for Web Crawlers (Use this Instead)

Stop Paying for Web Crawlers (Use this Instead)

Every Home Alone Is Worse Than The Last

Every Home Alone Is Worse Than The Last

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Madison Police identify school shooter as 15-year-old female student

Madison Police identify school shooter as 15-year-old female student

We Made Sushi, It's Scary! (Roblox Scary Sushi)

We Made Sushi, It's Scary! (Roblox Scary Sushi)

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

Build your own WebScrapper using Crawl4AI and Streamlit

Build your own WebScrapper using Crawl4AI and Streamlit

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

How to structure a JS/TS monorepo | From Zero to Turbo - Part 1

How to structure a JS/TS monorepo | From Zero to Turbo - Part 1

This is how I scrape 99% websites via LLM

This is how I scrape 99% websites via LLM

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

Crawl4AI: The Ultimate Web Scraping Tool for AI🚀

Crawl4AI: The Ultimate Web Scraping Tool for AI🚀

LangChain vs LangGraph: A Tale of Two Frameworks

LangChain vs LangGraph: A Tale of Two Frameworks

GroqCall: Function call for Groq & beyond

GroqCall: Function call for Groq & beyond

Дональду Трампу с трудом удалось поцеловать свою жену на церемонии инаугурации

Дональду Трампу с трудом удалось поцеловать свою жену на церемонии инаугурации

ПРОБУЮ ЕДУ, КОТОРУЮ ХОЧЕТСЯ БЕРЕМЕННЫМ

ПРОБУЮ ЕДУ, КОТОРУЮ ХОЧЕТСЯ БЕРЕМЕННЫМ

"Трамп заинтересован лично": когда состоятся переговоры России и США по войне в Украине

"Трамп заинтересован лично": когда состоятся переговоры России и США по войне в Украине

✅ Отними ИГРУШКУ у ЮТУБЕРА Чтобы ВЫЖИТЬ в Майнкрафт!

✅ Отними ИГРУШКУ у ЮТУБЕРА Чтобы ВЫЖИТЬ в Майнкрафт!

СТУДЕНТ (смешное видео, юмор, приколы, поржать, прикол)

СТУДЕНТ (смешное видео, юмор, приколы, поржать, прикол)

Трамп: существует только два пола - мужчины и женщины

Трамп: существует только два пола - мужчины и женщины

블랙핑크챌린지 초등학생이? #춤추는곰돌 #춤추는곰돌 #dance #kpop #music #댄스 #blackpink #killthislove #bp #블랙핑크 #댄스챌린지 #챌린지

블랙핑크챌린지 초등학생이? #춤추는곰돌 #춤추는곰돌 #dance #kpop #music #댄스 #blackpink #killthislove #bp #블랙핑크 #댄스챌린지 #챌린지

💸 Обзор бизнес-класса Emirates A380🤔

💸 Обзор бизнес-класса Emirates A380🤔