Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Llama 3 - 8B & 70B Deep Dive

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

SimCity (SNES) - Angry Video Game Nerd (AVGN)

2024 NBA Draft Round 1 on ESPN: Live reaction to every pick & trade | Hoop Collective 🏀

GloRilla - TGIF (Official Music Video)

37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)

All About AI

Просмотров 17 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 29 июн 2024
To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)
GitHub Project:
github.com/AllAboutAI-YT/easy...
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
In this video I try to improve a known problem when using RAG in local model like Llama 3 8B on ollama. This local RAG system was improved by just adding around 15 lines of code. Feel free to share and rate on GitHub :)
00:00 Llama 3 Improved RAG Intro
02:01 Problem / Soulution
03:05 Brilliant.org
04:26 How this works
12:05 Llama 3 70B Groq
15:12 Conclusion
Наука

Комментарии • 31

@AllAboutAI 2 месяца назад ⁺³
Brilliant: To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
@pec8377 2 месяца назад ⁺¹
@AllAboutAi the issue is it makes the assumption that the question is related to the content passed, which is not always the case in a conversation. Like suddenly you talk about something else, let's say "How are you", it will be rewritten to be aligned to the precedent context, which is not what you want.. then you need to implement some more mechanism or tweak your prompt to only rephrase when the question seems to be linked to the past. Many discussions about this..
@MattJonesYT 2 месяца назад ⁺⁸
Another approach to this is to just ask for the simple llm to hallucinate an answer to the current chat. That answer will not be correct but it will probably have the phrases needed for the RAG system to find the needed excerpts. There's a technical term for this idea which I can't remember but I came across it on the TwoSetAI channel which has a lot of similar tricks
@robboerman9378 2 месяца назад ⁺³
HyDE, Hypothetical Document Embeddings. Works very well and easy to implement. Similarity search on a vector database using a hallucinated answer to the question instead of the question usually gives better similarity
@AllAboutAI 2 месяца назад ⁺¹
yes this is nice, thnx :)
@kenhtinhthuc 2 месяца назад ⁺²
RAG is a bit too much of an exact match because it is based on concepts and similar concepts. Therefore no match, no return. HyDE makes the search a bit more fuzzy by expanding the query and introducing more concepts. It would be good to have an evaluator to check on the faithfuness of retrieval and the relevance of the ouputs to the original query.
@ASchnacky 2 месяца назад ⁺⁵
Dolphin-llama3 & Groq-llama3
are awesome! Well done!
@ByZaMo64 2 месяца назад
how are they different?
@MarcShade 2 месяца назад ⁺⁵
dolphin-llama3:8b-v2.9-fp16 is so good as an assistant!
@ASchnacky 2 месяца назад ⁺¹
Dolphin-llama3 & Groq-llama3
@nic-ori 2 месяца назад ⁺³
👍👍👍Thanks! Useful information.
@futureworldhealing 2 месяца назад ⁺²
best AI python coding channel hands down
@AllAboutAI 2 месяца назад ⁺¹
thnx a lot :D
@Edoras5916 2 месяца назад
direct, didactic almost verbatim in my book, explanation. excellent
@technolus5742 2 месяца назад ⁺¹
Great job
@AllAboutAI 2 месяца назад
thnx :)
@realorfake4765 2 месяца назад ⁺¹
based on your experience, why is olama better than LMStudio?
@samyio4256 Месяц назад
How is the retrieval so fast? Did you cut the loading time for context out of the video?
@akimezra7178 Месяц назад
Bruuuuuuh, just found this channel, you sure you're human?!?! Wish i had 5% of your brain.... thank you so much for your work! Im learning so much!!
@SeattleShelby Месяц назад
You just need a bigger neck beard. It’s all in the neck beard.
@elsondasilva8636 2 месяца назад ⁺¹
💎💎🌟💎💎💎💎
@iamisobe 2 месяца назад ⁺¹
first
@monstercameron 2 месяца назад ⁺¹
What about doing the same for the output? One pass is the internal voice, compare it to the promo to see if matches up and a second pass for any corections. Like giving LLMs an inner voice like we do.
@AllAboutAI 2 месяца назад
interesting
@buttpub 2 месяца назад ⁺¹
the problem and solution is that your setup is stateless
@AllAboutAI 2 месяца назад
interesting, will look into
@buttpub 2 месяца назад
@@AllAboutAIllms such as those built on transformer architectures, are fundamentally stateless, meaning they do not inherently maintain information about previous inputs across separate input sequences like recurrent neural networks. however; they can emulate state-like behavior through the use of positional and specialized embeddings that incorporate contextual information within a given sequence, processing data in a stateless manner, the autoregressive nature of many llms allows them to generate text by sequentially predicting the next token based on the accumualted outputs, mimicking a form of statefulness. allowing them to handle extensive and complex sequences effectively, tho each processing step inherently lacks a continuous internal state beyond its immediate inputs.

Следующие

Автовоспроизведение

Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Llama 3 - 8B & 70B Deep Dive

Llama 3 - 8B & 70B Deep Dive

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

SimCity (SNES) - Angry Video Game Nerd (AVGN)

SimCity (SNES) - Angry Video Game Nerd (AVGN)

2024 NBA Draft Round 1 on ESPN: Live reaction to every pick & trade | Hoop Collective 🏀

2024 NBA Draft Round 1 on ESPN: Live reaction to every pick & trade | Hoop Collective 🏀

GloRilla - TGIF (Official Music Video)

GloRilla - TGIF (Official Music Video)

Mustard Talks Kendrick Not Like Us, Pop Out Concert, Kendrick vs Drake, New Music | Big Interview

Mustard Talks Kendrick Not Like Us, Pop Out Concert, Kendrick vs Drake, New Music | Big Interview

host ALL your AI locally

host ALL your AI locally

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

I wish every AI Engineer could watch this.

I wish every AI Engineer could watch this.

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?

Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?

Private Chat with your Documents with Ollama and PrivateGPT | Use Case | Easy Set up

Private Chat with your Documents with Ollama and PrivateGPT | Use Case | Easy Set up

Function Calling Local LLMs!? LLaMa 3 Web Search Agent Breakdown (With Code!)

Function Calling Local LLMs!? LLaMa 3 Web Search Agent Breakdown (With Code!)

"okay, but I want Llama 3 for my specific use case" - Here's how

"okay, but I want Llama 3 for my specific use case" - Here's how

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Телефон в воде 🤯

Телефон в воде 🤯

Разобрали Самый мощный маленький ПК на RYZEN 9 8945HS+RADEON 780M 🔥 GEEKOM A8

Разобрали Самый мощный маленький ПК на RYZEN 9 8945HS+RADEON 780M 🔥 GEEKOM A8

🖨️Не выкидывайте чеки! Программируем термопринтер

🖨️Не выкидывайте чеки! Программируем термопринтер

Xiaomi 14 индия ЧЕМ ОТЛИЧАЕТСЯ от Xiaomi 14 европа

Xiaomi 14 индия ЧЕМ ОТЛИЧАЕТСЯ от Xiaomi 14 европа

Я купил ВСЕ НОВИНКИ FIFINE с Aliexpress и протестировал их! Микрофон, колонки, аудиокарта, наушники

Я купил ВСЕ НОВИНКИ FIFINE с Aliexpress и протестировал их! Микрофон, колонки, аудиокарта, наушники

PCIe тормозит вашу видеокарту | Тест шины от 3.0 x4 до 5.0 x16

PCIe тормозит вашу видеокарту | Тест шины от 3.0 x4 до 5.0 x16

Почему нельзя отключить звук камеры на iPhone⁉️

Почему нельзя отключить звук камеры на iPhone⁉️

Для заказа Алисы напишите нам в инстаграм @electronics_latvia.#Алиса #колонка #розыгрыш

Для заказа Алисы напишите нам в инстаграм @electronics_latvia.#Алиса #колонка #розыгрыш