Webinar Series: Generative AI for everyone(2023-11-24)

New course with Qdrant! Retrieval Optimization: From Tokenization to Vector Quantization is live

[Webinar] LLMs for Evaluating LLMs

How Employees Are Coffee Badging To Avoid Full Days At The Office

Off Grid Cabin Disaster !

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Efficient Fine-Tuning for Llama-v2-7b on a Single GPU

DeepLearningAI

Просмотров 86 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 дек 2024

Комментарии • 64

@thelinuxkid Год назад ⁺¹⁵
Very helpful! Already trained llama-2 with custom classifications using the cookbook. Thanks!
@dinupavithran Год назад ⁺¹
Very informative. Direct and to the point content in a easily understandable presentation.
@craigrichards5472 4 месяца назад
Amazing, can’t wait to play and train my first model 🎉
@thedelicatecook2 7 месяцев назад
Well this was simply excellent, thank you 🙏🏻
@manojselvakumar4262 Год назад ⁺¹
Great content, well presented!
@tomhavy Год назад ⁺²
Thank you!
@andres.yodars Год назад ⁺¹
One of the most complete videos. Must watch
@karanjakhar Год назад ⁺¹
Really helpful. Thank you 👍
@KarimMarbouh Год назад
🖖alignement by sectoring hyperparameters in behaviour, nice one
@zubairdotnet Год назад ⁺¹⁵
Nvidia H100 GPU on Lambda labs is just $2/hr, I am using it for past few months unlike $12.29/hr on AWS as shown in the slide.
I get it, it's still not cheap but just worth mentioning here
@pieromolino_pb Год назад ⁺²
You are right, we reported the AWS price there as it's hte most popular option and it was not practical to show all the pricing of all the vendors. But yes you can get them for cheaper elsewhere like from Lambda, thanks for pointing it out
@rankun203 Год назад
Last time I tried it, H100s are out of stock on Lambda
@zubairdotnet Год назад
@@rankun203 They are available only in specific region mine is in Utah, I don't think they have expanded it plus there is no storage available in this region meaning if you shut down your instance, all data is lost
@Abraham_writes_random_code Год назад ⁺²
together AI is $1.4/hr on your own fine tuned model :)
@PieroMolino Год назад ⁺²
@@Abraham_writes_random_code Predibase is cheaper than that
@ggm4857 Год назад ⁺⁶
I like to kindly request @DeepLearningAI to prepare such hands-on workshop on fine-tunning Source Code Models.
@Deeplearningai Год назад ⁺³
Don't miss our short course on the subject! www.deeplearning.ai/short-courses/finetuning-large-language-models/
@ggm4857 Год назад
@@Deeplearningai , Wow thanks.
@ab8891 Год назад
Excellent xtal clear surgery on GPU VRAM utilization...
@Ev3ntHorizon Год назад
Excellent coverage, thankyou.
@Ay-fj6xf Год назад
Great video, thank you!
@msfasha Год назад ⁺¹
Clear and informative, thanx.
@PickaxeAI Год назад ⁺¹
at 51:30 he says don't repeat the same prompt in the training data. What if I am fine-tuning the model on a single task but with thousands of different inputs for the same prompt?
@brandtbealx Год назад ⁺²
It will cause overfitting. It would be similar to training an image classifier with a 1000 pictures of roses and only one lilly, then asking it to predict both classes with good accuracy. You want the data to have a normal distribution around your problem space.
@satyamgupta2182 Год назад
@PickaxeAI Did you come across a solution for this?
@manojselvakumar4262 Год назад
Can you give an example for the task? I'm trying to understand in what situation you'd need different completions for the same prompt
@nguyenanhnguyen7658 Год назад
Very helpful. Thanks.
@goelnikhils Год назад
Amazing Content of fine tuning LLM
@ayushyadav-bm2to 10 месяцев назад ⁺¹
What's the music in the beginning, can't shake it off
@jirikosek3714 Год назад
Great job, thumbs up!
@rajgothi2633 Год назад
amazing video
@bachbouch Год назад
Amazing ❤
@nekro9t2 Год назад ⁺²
Please can you provide a link to the slides?
@ggm4857 Год назад ⁺¹
Hello everyone, I would be so happy if the recorded video have caption/subtitles.
@kaifeekhan_25 Год назад ⁺¹
Right
@dmf500 Год назад ⁺²
it does, you just have to enable it! 😂
@kaifeekhan_25 Год назад ⁺¹
@@dmf500now it is enabled😂
@rgeromegnace Год назад
Eh, c'était super. Merci beaucoup!
@stalinamirtharaj1353 Год назад
@pieromolino_pb -Is Ludwig allows to locally download and deploy the fine-tuned model?
@dudepowpow 4 месяца назад
28 zoom notifications! Travis working too hard
@hemanth8195 Год назад
Thankyou
@nminhptnk Год назад
I ran Colab T4 and still got into “RuntimeError: CUDA Out of memory”. Any thing else I can do please?
@TheGargalon Год назад ⁺⁶
And I was under the delusion that I would be able to fine-tune the 70B param model on my 4090. Oh well...
@iukeay Год назад
I got a 40b model working on a 4090
@TheGargalon Год назад ⁺²
@@iukeay Did you fine tune it, or just inference?
@ahsanulhaque4811 9 месяцев назад
70B param? hahaha.
@pickaxe-support Год назад ⁺²
Cool video. If I want to fine-tune it on a single specific tassk (keyword extraction), should I first train an instruction-tuned model, and then train that on my specific task? Or mix the datasets together?
@shubhramishra8698 Год назад
also working on keyword extraction! I was wondering if you'd had any success fine tuning?
@rachadlakis1 4 месяца назад
can we have the slides plz ?
@SDAravind Год назад
can you share the slide, please?
@feysalmustak9604 Год назад ⁺³
How long did the entire training process take?
@edwardduda4222 8 месяцев назад
Depends on your hardware, dataset, and hyper parameters you’re manipulating. The training process is the longest phase in developing a model.
@kevinehsani3358 Год назад
epochs=3, since we are fine tunning, would epochs=1 would suffice?
@pieromolino_pb Год назад ⁺³
It really depends on the dataset. Ludwig has also an early stopping mechanism where you can specify the number of epochs (or steps) without improvement before stopping, so you could set epochs to a relatively large number and have the early stopping take care of not wasting compute time
@arjunaaround4013 Год назад
❤❤❤
@Neberheim Год назад
This seems to make a case for Apple Silicon for training. The M3 Max performs close to an RTX 3080, but with access to up to 192GB of memory.
@ahsanulhaque4811 9 месяцев назад
Did you try on Apple silicon M1 Max?
@leepro 8 месяцев назад
Cool! ❤
@mohammadrezagh4881 Год назад
when I run the code in Perform Inference, I frequently receive ValueError: If `eos_token_id` is defined, make sure that `pad_token_id` is defined.
what should I do?
@arnavgrg Год назад
This is now fixed on Ludwig master!

Следующие

Автовоспроизведение

Webinar Series: Generative AI for everyone(2023-11-24)

Webinar Series: Generative AI for everyone(2023-11-24)

New course with Qdrant! Retrieval Optimization: From Tokenization to Vector Quantization is live

New course with Qdrant! Retrieval Optimization: From Tokenization to Vector Quantization is live

[Webinar] LLMs for Evaluating LLMs

[Webinar] LLMs for Evaluating LLMs

How Employees Are Coffee Badging To Avoid Full Days At The Office

How Employees Are Coffee Badging To Avoid Full Days At The Office

Off Grid Cabin Disaster !

Off Grid Cabin Disaster !

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Duramax Diesel "Extreme" Tune and Allison Transmission Service (My Going Ta' Town Rig!)

Drone sightings force New York airport to shut down temporarily

Drone sightings force New York airport to shut down temporarily

Learn to use OpenAI’s o1 model for advanced reasoning tasks in this new course

Learn to use OpenAI’s o1 model for advanced reasoning tasks in this new course

No data prep, no feature engineering: Predictive modeling in 2025

No data prep, no feature engineering: Predictive modeling in 2025

WoSC10 Session 3: Data Pipelines, GraphQL and REST in Serverless Computing (December 2, 2024)

WoSC10 Session 3: Data Pipelines, GraphQL and REST in Serverless Computing (December 2, 2024)

Rubén Ruiz, Principal Applied Scientist at Amazon, speaks about Hexaly

Rubén Ruiz, Principal Applied Scientist at Amazon, speaks about Hexaly

LLMs as Operating Systems: Agent Memory, a new course based on the MemGPT approach

LLMs as Operating Systems: Agent Memory, a new course based on the MemGPT approach

Jochen Cremer - Opportunities & Challenges in Graph-Based Learning for Power System Application

Jochen Cremer - Opportunities & Challenges in Graph-Based Learning for Power System Application

Andrew Ng’s AI Python for Beginners final courses are live! Join now

Andrew Ng’s AI Python for Beginners final courses are live! Join now

"Introducing Multimodal Llama 3.2" is here! Enroll for free

"Introducing Multimodal Llama 3.2" is here! Enroll for free

CMU Verify the Rust Standard Library Final Presentation

CMU Verify the Rust Standard Library Final Presentation

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Let you know sleep all day long. My son is sorry. He is your father and will not do anything to you.

Хотели просто покататься на тракторе! А вот, что получилось!

Хотели просто покататься на тракторе! А вот, что получилось!

Kitsune Dreams | Update 0.32.0 Trailer | Standoff 2

Kitsune Dreams | Update 0.32.0 Trailer | Standoff 2

Раскрыли дело благодаря камерам наблюдения. Увидеть такое не ожидал никто

Раскрыли дело благодаря камерам наблюдения. Увидеть такое не ожидал никто

Сквиши устарели 😒🐾 #шортс #виолави #юмор #сквиши #табалапка

Сквиши устарели 😒🐾 #шортс #виолави #юмор #сквиши #табалапка

FUNNY CHOCOLATE MAGIC

FUNNY CHOCOLATE MAGIC

МАМАША vs Я ЖЕ МАТЬ (смешное видео, юмор, приколы, поржать)

МАМАША vs Я ЖЕ МАТЬ (смешное видео, юмор, приколы, поржать)

Реванш! УСИК - ФЬЮРИ 2 | Слова После Боя | ВЫЗОВ ДЮБУА и ОТВЕТ УСИКА

Реванш! УСИК - ФЬЮРИ 2 | Слова После Боя | ВЫЗОВ ДЮБУА и ОТВЕТ УСИКА