Getting started with Gemma Model | Large Language Model | HuggingFace

How to set up RAG - Retrieval Augmented Generation (demo)

InternLM - A Strong Agentic Model?

THE ALL NEW 2025 C8 CORVETTE ZR1 LEAKED EARLY! 1,064 HORSEPOWER!

Painting A GIANT Canvas In 50 Hours!

RAG with HuggingFace + FAISS + Langchain | Retrievel Augmented Generation

MLWorks

Просмотров 1,6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 17 фев 2024
RAG with Hugging Face, Faiss, and LangChain: A Powerful Combo for Information Retrieval and Generation
Retrieval-augmented generation (RAG) is a technique that combines the strengths of dense retrieval and sequence-to-sequence (seq2seq) models to generate informative and context-aware responses. Here's how it works in this specific setup:
Hugging Face: This open-source platform provides pre-trained models for text embedding and generation, which are key components of RAG.
Faiss: This library implements efficient similarity search algorithms, allowing RAG to quickly find relevant documents from a large corpus.
LangChain: This workflow automation tool helps orchestrate the different steps involved in the RAG pipeline, making it easier to build and deploy complex systems.
RAG cookbook: huggingface.co/learn/cookbook...

Комментарии • 19

@Z4rnii 4 месяца назад ⁺¹
very helpful video! im doing it on my company for it help desk questions, but with llm model with azure openAI, should I deploy ada02 for embedding creation and how would you do the search if its not a simple content column but various pdf documents? Azure AI search service might be helpful but is way to expensive
@mlworks 4 месяца назад
Yes, you can use the ada embedding model for generating an embedding vector and store it in a vector store. If you have large documents, you can chunk them based on various chunking techniques that are available in many RAG applications, ensuring the embedding model doesn't truncate your embedding randomly.
@Z4rnii 4 месяца назад
how would you do the search? What Tool do you recommend me? Im very new to the topic. Thanks in advance@s
@Z4rnii 4 месяца назад
can you tell me about the chunking techniques and the RAG Application? How could I start? :)@@mlworks
@mlworks 4 месяца назад
@@Z4rnii For chunking, follow this notebook.
github.com/FullStackRetrieval-com/RetrievalTutorials/blob/main/5_Levels_Of_Text_Splitting.ipynb
@mlworks 4 месяца назад
@@Z4rnii Simple RAG - huggingface.co/learn/cookbook/en/rag_zephyr_langchain
@atulpathak3514 4 месяца назад
can i use gemma 7b-it model instead of intel?
@mlworks 4 месяца назад
Yes.
@rps6949 5 месяцев назад
What is that page_content_cokumn and context? Because I don't see context in you printed data
Also I have CSV file which I loaded using CSV loader and in that there is columb instructions which has questions and output has answers of it do I have to create embedding of entire data with both the columns or just the column output?
@mlworks 5 месяцев назад ⁺¹
page_content_column is the column that represents, the context column of the dataset. Checkout the context column here huggingface.co/datasets/databricks/databricks-dolly-15k
What is the task you're going to implement with the dataset? QA I assume
@rps6949 5 месяцев назад
@@mlworks yes a QA, and I'm suppose to use LLM model to make it more interactive and that's when I got confused because when I passed the data my bot is responding Grammatically incorrect sentences
@mlworks 5 месяцев назад
@@rps6949 RAG helps LLMs to provide accurate and precise information based on the knowledge base. Are you using pretrained LLM or a custom LLM which you've got after fine tuning llm for your dataset, that has learnt the mapping between QA. If yes, then you can generate embeddings of your answers & store in vector store and ask the query to the system.
@mlworks 5 месяцев назад
@@rps6949 if you mean interactive as in chatbot, you can set the system prompt for the LLM and this act as conversation agent.
@rps6949 5 месяцев назад
@@mlworks i'm using a pretrained model , as in your code you are using datasets and then huggingface loader to load , what i have to do is ui from where user is uploading a csv file so i used CSVLoader but just like context column i have a text column but i'm not able to provide that column for embedding i just loader data from loader.load(), what i can do here?

Следующие

Автовоспроизведение

Getting started with Gemma Model | Large Language Model | HuggingFace

Getting started with Gemma Model | Large Language Model | HuggingFace

How to set up RAG - Retrieval Augmented Generation (demo)

How to set up RAG - Retrieval Augmented Generation (demo)

InternLM - A Strong Agentic Model?

InternLM - A Strong Agentic Model?

THE ALL NEW 2025 C8 CORVETTE ZR1 LEAKED EARLY! 1,064 HORSEPOWER!

THE ALL NEW 2025 C8 CORVETTE ZR1 LEAKED EARLY! 1,064 HORSEPOWER!

Painting A GIANT Canvas In 50 Hours!

Painting A GIANT Canvas In 50 Hours!

I'm Going To The Olympics.

I'm Going To The Olympics.

Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

FAISS Vector Library with LangChain and OpenAI (Semantic Search)

FAISS Vector Library with LangChain and OpenAI (Semantic Search)

End-to-End RAG implementation using Langchain with Zephyr-7b

End-to-End RAG implementation using Langchain with Zephyr-7b

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

What is LangChain?

What is LangChain?

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

GraphRAG: The Most Incredible RAG Strategy Revealed

GraphRAG: The Most Incredible RAG Strategy Revealed

LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners

LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners

Vector Databases simply explained! (Embeddings & Indexes)

Vector Databases simply explained! (Embeddings & Indexes)

CLANCY 🦞 Operation Squid Ink (New Brawler Animation)

CLANCY 🦞 Operation Squid Ink (New Brawler Animation)

КАРАСЕВ: СРОЧНО! ТАКОГО ЕЩЕ НЕ БЫЛО! ЭТО МЕНЯЕТ ВСЁ! БАЙДЕН... США, РФ И КИТАЙ ГОТОВЯТ...

КАРАСЕВ: СРОЧНО! ТАКОГО ЕЩЕ НЕ БЫЛО! ЭТО МЕНЯЕТ ВСЁ! БАЙДЕН... США, РФ И КИТАЙ ГОТОВЯТ...

Я ВЕРНУЛСЯ 🔴 | WICSUR #shorts

Я ВЕРНУЛСЯ 🔴 | WICSUR #shorts

Stay on your way 🛤️✨

Stay on your way 🛤️✨

Что случилось с ШАПОМ? Первый запуск утопленного пикапа!

Что случилось с ШАПОМ? Первый запуск утопленного пикапа!

Your bathroom needs this

Your bathroom needs this

ЛОК СТОК #9 СТАВКА НА ЗНАНИЯ / Korya / SNAILKICK / Ваш / Гараев

ЛОК СТОК #9 СТАВКА НА ЗНАНИЯ / Korya / SNAILKICK / Ваш / Гараев

This Stop Motion is Insane

This Stop Motion is Insane