I built a Math Solver App using Google's Gemini Model with Flutter

Transformers (how LLMs work) explained visually | DL5

The Secret to 90%+ Accuracy in Text Classification

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

Where I've been for the past year...

BABYMONSTER - 'Love In My Heart' M/V

Fine-Tuning T5 on Question Answer dataset using Huggingface Transformer & PyTorch

Developers Hutt

Просмотров 2,2 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 янв 2025

Комментарии • 27

@raihanpahlevi6870 3 месяца назад
very helpful thanks
@superfreiheit1 15 дней назад
Do i have to provide context (input) for asking at the inference stage?
@DevelopersHutt 14 дней назад
It will work well on a trained dataset but if you feed your question with context concatenated, it will work but it will throw garbage. It's a very small transformer to have better reasoning.
@yvonnewu7668 11 месяцев назад
Thanks this video is so helpful. Just having a question, what part are we fine-tuning? are we fine-tuning all layers?
@DevelopersHutt 11 месяцев назад ⁺¹
Yup!
@Ananya-j7g Год назад
Thanks for the video. Can we download the fine tuned tokenizer and model from google colab for later use. If yes how?
@DevelopersHutt Год назад
You can use os.getcwd() with join function or "./model" to save it in the directory you want. Then you simply double click to download
@dhruvtiwari661 Год назад
Can you make a video on document question answering?
@DevelopersHutt Год назад ⁺¹
Sure,
Covering up in upcoming video!
@sauravfarkade1928 2 месяца назад
Hey its a very nice video,
can you guide if i want to create a project of Natural Language to SQL query conversion? Using hugging face and NLTK how can i build this, I am not getting how can I make this..
@DevelopersHutt 2 месяца назад ⁺¹
Use an LLM to convert English to SQL query
@sauravfarkade1928 2 месяца назад
@@DevelopersHutt can you please guide me the flow, like how i can do it?
@tranhuy216 Год назад
Great video, i'm kind of new to pytorch. I've already trained and saved the model, how do i load it again for inferencing?
@DevelopersHutt Год назад ⁺¹
You can use
tokenizer = T5TokenizerFast.from_pretrained("path/to/saved/tokenizer")
Model = T5FotConditionalGeneration.from_pretrained("path/to/saved/model")
And rest of the inference code is the same as in the video.
@tranhuy216 Год назад
@@DevelopersHutt thanks alot! Great video, love the way you write and explain every lines of code.
@umaisimran4383 Год назад
can we fine tune this model to generate interview questions from the job description (as context) or is their any other model that can do such thing?
@DevelopersHutt Год назад ⁺¹
With the right dataset, absolutely yes.
There are many other great models out there but you may find them a bit large and take much computation efforts. So T5 might be the optimal choice
@umaisimran4383 Год назад
@@DevelopersHutt can you please guide how to do so?
@DevelopersHutt Год назад ⁺¹
@@umaisimran4383 First you'll need to prepare a basic dataset.
The dataset should contains following things.
1. Popular questions regarding every skill. Like for for keyword python, there must be some 4 to 5 questions in the dataset.
2. You can also use difficulty level with skill and get the questions accordingly. Like for python you can have multiple rows contains different questions based on experience level.
Similarly you can add up multiple things to make your dataset diverse.
3. Then convert your dataset as per T5 requires.
This is a very high level overview and of course you can breakdown these to sub problems according to your usecase
@umaisimran4383 Год назад
@@DevelopersHutt Thank you so much.
@MuhammadZubair-cu6cx 7 месяцев назад ⁺¹
Hi there,
Thank you! I need your help with translating from Urdu to English using T5 small. Could you please guide me? I'm willing to tip $50 for your assistance. I'm quite new to this. I also have a dataset ready.
Thanks!
@arf_atelier1819 10 месяцев назад
Can i fine tune the model to generate sentence from a set of keywords
@DevelopersHutt 10 месяцев назад
T5 will not be an optimal approach to achieve if you want these kinds of results.
You can pick models like phi-2 or llama 7b and fine tune using adaptors.
@arf_atelier1819 9 месяцев назад
@@DevelopersHutt can it run in local cpu
@DevelopersHutt 9 месяцев назад
@@arf_atelier1819 it can but speed will be terribly slow
@haulu159 8 месяцев назад
with another dataset do like this or i need to cusstome the dataset
@DevelopersHutt 8 месяцев назад
If you're using the same process as in video, then yes you'll need the data in the same format.

Следующие

Автовоспроизведение

I built a Math Solver App using Google's Gemini Model with Flutter

I built a Math Solver App using Google's Gemini Model with Flutter

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

The Secret to 90%+ Accuracy in Text Classification

The Secret to 90%+ Accuracy in Text Classification

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

NEW DRAGON HUNTER NPC FULL GUIDE | DRAGON HEART QUEST? | Blox Fruits...

Where I've been for the past year...

Where I've been for the past year...

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

Buffalo Bills vs. Detroit Lions Game Highlights | NFL 2024 Season Week 15

Buffalo Bills vs. Detroit Lions Game Highlights | NFL 2024 Season Week 15

Applying BERT to Question Answering (SQuAD v1.1)

Applying BERT to Question Answering (SQuAD v1.1)

How to Fine-tune T5 and Flan-T5 LLM models: The Difference is? #theory

How to Fine-tune T5 and Flan-T5 LLM models: The Difference is? #theory

Easily Build Prompt Tuning & Prefix Tuning for LLMs: Soft Prompt Engineering Beats Fine Tuning

Easily Build Prompt Tuning & Prefix Tuning for LLMs: Soft Prompt Engineering Beats Fine Tuning

What BERT Can’t Do: The Transformer's Decoder [Lecture]

What BERT Can’t Do: The Transformer's Decoder [Lecture]

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

All Machine Learning algorithms explained in 17 min

All Machine Learning algorithms explained in 17 min

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

🔴 СРОЧНО Бездомные звезды и сгоревший Голливуд: пожары в Лос-Анджелесе #новости #калифорния #пожары

🔴 СРОЧНО Бездомные звезды и сгоревший Голливуд: пожары в Лос-Анджелесе #новости #калифорния #пожары

МОЛОДОЙ ДЕД - 14я серия (смешное видео, приколы, юмор, поржать)

МОЛОДОЙ ДЕД - 14я серия (смешное видео, приколы, юмор, поржать)

Granny carries the boxes

Granny carries the boxes

ГИТАРИСТ довёл ДЕВУШЕК до СЛЁЗ в ЧАТ РУЛЕТКЕ | Реакция ЛЮДЕЙ на РУССКИЕ ПЕСНИ @musicianshut

ГИТАРИСТ довёл ДЕВУШЕК до СЛЁЗ в ЧАТ РУЛЕТКЕ | Реакция ЛЮДЕЙ на РУССКИЕ ПЕСНИ @musicianshut

ПОБЕГ С ЛЕДЯНОГО ОСТРОВА ПО ТОНКОМУ ЛЬДУ В GTA SAMP

ПОБЕГ С ЛЕДЯНОГО ОСТРОВА ПО ТОНКОМУ ЛЬДУ В GTA SAMP

【大郎哥哥】媳妇真的太过分了，连老妈的鸡蛋也抢，还好我留了一手

【大郎哥哥】媳妇真的太过分了，连老妈的鸡蛋也抢，还好我留了一手

КАК на самом деле выглядел ханбок в Корее?😨 Оголяли грудь? #корея #сеул #shorts #korea

КАК на самом деле выглядел ханбок в Корее?😨 Оголяли грудь? #корея #сеул #shorts #korea

Индекс Биг Мака #россия #сша #рубль

Индекс Биг Мака #россия #сша #рубль

ШОУ не СВОЯ ИГРА: Егор Крид, Гарик Харламов , Джиган , Денис Дорохов #1

ШОУ не СВОЯ ИГРА: Егор Крид, Гарик Харламов , Джиган , Денис Дорохов #1