Locally-hosted, offline LLM w/LlamaIndex + OPT (open source, instruction-tuning LLM)

Use LLMs To Extract Data From Text (Expert Mode)

SEO in 2025 | My Predictions for SEO in 2025

Minecraft's Best Players Simulate a Cold War on Hardcore Minecraft

Sunday Trailer - "Soloist" | Honkai: Star Rail

Marvel Rivals | Developer Update | December 2024

GPT scrapes + answers from any sites (ft. Chromadb, Trafilatura)

Samuel Chan

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 3 дек 2024

Комментарии • 24

@moreshk Год назад
Another great video in this series!
@SamuelChan Год назад
Thank you! 🙏🏼
@MrNootka Год назад ⁺¹
love your tutorials thanks!
Tip: if you could please make your cam smaller & circular would be a great upgrade to your videos :)
@SamuelChan Год назад
Good tip! And relatively easy to implement! Thank you! :)
@arnaudlacour1188 Год назад ⁺²
when I try this exact thing I get an error that GPTChromaIndex is not in llama_index, can you think of a reason why?
@SamuelChan Год назад ⁺¹
Yes! When this lesson was published the latest version of LlamaIndex was 0.5.7.
2 months later it’s now 0.6.x.
So you can downgrade to the 0.5.7 version to follow along or just use a new environment and then pip install -r requirements.txt from the GitHub repo.
I’m in the middle of upgrading the codebase to the latest version but admittedly have limited time between my day job, so we’ll see! :)
@arnaudlacour1188 Год назад
@@SamuelChan very awesome of you to reply so quickly! Much appreciated, thank you!
@SivaKumar-of7mu Месяц назад
I also cant find repo on your git
@TheShreyas10 2 месяца назад
Hey can you please share the repo, I cant find it on your git
@noualiibrahimyassine1336 Год назад
Great tutorial, thank you.
Question: in my terminal window i'm getting only question/answer, i'm not getting the other additional informations like llm token usage, sentenceTransformer, pytorch device, etc... How can i get those informations ?
@SamuelChan Год назад
Thank you!
You can do logging many different ways and I showed them in many videos later on in this series. For example, in the "building a GPT-powered journal system"
ruclips.net/video/OzDhJOR5IfQ/видео.htmlsi=SZXzbH1hLeJ0QFzH
I use the following technique to wrap the returned results.
import logging
logging.basicConfig(stream=sys.stdout, level=logging.INFO)
logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))
LangChain also has its own tracking utilities:
from langchain.callbacks import get_openai_callback
with get_openai_callback() as cb:
result = llm("Your query")
print(cb)
The context manager (cb) that is printed returns:
Tokens Used: 42
Prompt Tokens: 4
Completion Tokens: 38
Successful Requests: 1
Total Cost (USD): $0.00084
@noualiibrahimyassine1336 Год назад
@@SamuelChan Thank you !
@llmia-n2x Год назад ⁺¹
Please can you make similar video with open source (free) LLM ?
@SamuelChan Год назад ⁺¹
LangChain & LLM tutorials (ft. gpt3, chatgpt, llamaindex, chroma)
ruclips.net/p/PLXsFtK46HZxUQERRbOmuGoqbMD-KWLkOS
I have a lot of videos where I use open source LLMs from huggingface. I also have a video that shows how to use a locally-hosted LLM on your machine! Check out the playlist above! :)
@llmia-n2x Год назад ⁺¹
@@SamuelChan Thanks à lot. I'll check
@utkarshpandey8967 Год назад
I am not able to use GPT ChromaIndex in python 3.10 can you suggest an alternative for this
@SamuelChan Год назад
What does "not able to use" means? did you fork from the github repo? if you install the dependencies it will work with python 3.10 (and I try to keep it up to date with every major version update of LangChain and LlamaIndex) -- cant see any reason why it wont work.
@ramp2011 Год назад
Thank you for the video. I just checked your github and I do not see the code copied over. Could you please copy over this code there? Thank you
@SamuelChan Год назад
Hey its here in the GitHub repo!
github.com/onlyphantom/llm-python/blob/main/6_team.py
@8eck Год назад
This Trafilatura is able to read javascript websites? I mean, can it read react-based websites?
@SamuelChan Год назад
Depends on whether the react side uses SSG (static site generation), SSR (server side rendering) or CSR (client), it works like any other web crawler / scraper :)
@8eck Год назад
@@SamuelChan naah, i was talking exactly about non SSR or static generated.
@8eck Год назад
@@SamuelChan Guess it can read only non-js content.
@SamuelChan Год назад ⁺¹
Yeah not with Trafilatura I don’t think
I think for those cases you can use an automation tool like Selenium to do a wait, wait for 1 second till content has loaded, and then retrieve. If div id not found, wait another 1 second etc in a while loop with break statement?

Следующие

Автовоспроизведение

Locally-hosted, offline LLM w/LlamaIndex + OPT (open source, instruction-tuning LLM)

Locally-hosted, offline LLM w/LlamaIndex + OPT (open source, instruction-tuning LLM)

Use LLMs To Extract Data From Text (Expert Mode)

Use LLMs To Extract Data From Text (Expert Mode)

SEO in 2025 | My Predictions for SEO in 2025

SEO in 2025 | My Predictions for SEO in 2025

Minecraft's Best Players Simulate a Cold War on Hardcore Minecraft

Minecraft's Best Players Simulate a Cold War on Hardcore Minecraft

Sunday Trailer - "Soloist" | Honkai: Star Rail

Sunday Trailer — "Soloist" | Honkai: Star Rail

Marvel Rivals | Developer Update | December 2024

Marvel Rivals | Developer Update | December 2024

Was Pandoras Vault Really Made in Survival?

Was Pandoras Vault Really Made in Survival?

Learn Prompt Engineering: Full Beginner Crash Course (5 HOURS!)

Learn Prompt Engineering: Full Beginner Crash Course (5 HOURS!)

Understanding Embeddings in LLMs (ft LlamaIndex + Chroma db)

Understanding Embeddings in LLMs (ft LlamaIndex + Chroma db)

Let's build GPT: from scratch, in code, spelled out.

Let's build GPT: from scratch, in code, spelled out.

Web scraping and text content extraction - Beginner tutorial for Python and the command-line

Web scraping and text content extraction - Beginner tutorial for Python and the command-line

What is OpenTelemetry?

What is OpenTelemetry?

Langchain vs Llama Index: Which one should you use?

Langchain vs Llama Index: Which one should you use?

Comedy Club: Кастинг на шоу «Конфетка» | Гарик Харламов, Гарик Мартиросян @TNT_television

Comedy Club: Кастинг на шоу «Конфетка» | Гарик Харламов, Гарик Мартиросян @TNT_television

A language for LLM prompt design | Guidance

A language for LLM prompt design | Guidance

Building an AI language tutor: Pinecone + LlamaIndex + GPT-3 + BeautifulSoup

Building an AI language tutor: Pinecone + LlamaIndex + GPT-3 + BeautifulSoup

I helped Santa Claus

I helped Santa Claus

Более 8.000.000 просмотров в соцсетях у блокбастера «Чёрная пантера» Филиппа Киркорова!

Более 8.000.000 просмотров в соцсетях у блокбастера «Чёрная пантера» Филиппа Киркорова!

ВИННИ ПУХ К Р О В Ь И МЁД ! ПОЯВЛЕНИЕ ПЯТАКА...СТРАШНАЯ ИСТОРИЯ В РЕАЛЬНОЙ ЖИЗНИ!

ВИННИ ПУХ К Р О В Ь И МЁД ! ПОЯВЛЕНИЕ ПЯТАКА...СТРАШНАЯ ИСТОРИЯ В РЕАЛЬНОЙ ЖИЗНИ!

СТЕПА ЗАБЕРЕМЕННЕЛ И РОДИЛ ДЕТЕЙ 🤷‍♂️

СТЕПА ЗАБЕРЕМЕННЕЛ И РОДИЛ ДЕТЕЙ 🤷‍♂️

ПОХУДЕЛА НА ОЗЕМПИКЕ?!😵 «у мамы на кухне» на канале Аня Ищук #аняищук #герберы

ПОХУДЕЛА НА ОЗЕМПИКЕ?!😵 «у мамы на кухне» на канале Аня Ищук #аняищук #герберы

Вру при дочке и смотрю на её реакцию 🌝

Вру при дочке и смотрю на её реакцию 🌝

Где реальные 240 герц?

Где реальные 240 герц?

ФИЛЬМ! ЮЛИЙ ЦЕЗАРЬ ИЗ ПРОСТОГО ВОЕНАЧАЛЬНИКА СТАЛ ПОВЕЛИТЕЛЯ РИМА! ЮЛИЙ ЦЕЗАРЬ! Русский фильм

ФИЛЬМ! ЮЛИЙ ЦЕЗАРЬ ИЗ ПРОСТОГО ВОЕНАЧАЛЬНИКА СТАЛ ПОВЕЛИТЕЛЯ РИМА! ЮЛИЙ ЦЕЗАРЬ! Русский фильм