Finetuning Open-Source LLMs

Understanding PyTorch Buffers

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

Boston FBI announce arrest of two Iranians in connection with fatal drone strike

Marvel Rivals | Winter Celebration, Joyful Jubilation

Scaling PyTorch Model Training With Minimal Code Changes

Sebastian Raschka

Просмотров 3,7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 19 дек 2024

Комментарии • 13

@stanislawcronberg3271 Год назад ⁺¹
Love the straightforward video, didn't know about Fabric for quickly upgrading existing PyTorch code
@user-wr4yl7tx3w Год назад ⁺¹
Wow, great presentation.
@SebastianRaschka Год назад
Thanks :)
@user-wr4yl7tx3w Год назад
this is really awesome content.
@dinabandhub Год назад ⁺¹
Great tutorial. 🎉
@SebastianRaschka Год назад
Thank you! 😊
@nguyenhuuuc2311 Год назад ⁺¹
Thanks so much for the tutorial. I learned a lot from you!!!!
I have a question: What modifications should be made to the code fabric.setup(model, optimizer) if I use a learning rate scheduler?
@nguyenhuuuc2311 Год назад ⁺¹
And just personal feedback for an awesome tutorial It would be great if you could consider including a gentle reminder that running code on multiple GPUs often requires the use of scripts rather than executing them directly in a notebook. Sorry if I missed any mention of this information already being included in the tutorial.
@SebastianRaschka Год назад
thanks, and great question. Since normal schedulers don't have any parameters you can use it as usual (no need to put it into fabric.setup). But using fabric.setup also doesn't hurt. I added a quick example here: github.com/rasbt/cvpr2023/blob/main/07_fabric-vit-mixed-fsdp-with-scheduler.py
@SebastianRaschka Год назад
@@nguyenhuuuc2311 Good point. Yeah, notebook (or interactive) environments are generally incompatible with multi-GPU training due to their multiprocessing limitations. Hah, I take it for granted these days but definitely a good time to mention that as a reminder!
@nguyenhuuuc2311 Год назад ⁺¹
@@SebastianRaschka Thanks for spending time on my question and the quick answer with a notebook ❤
@hamzawi2752 Год назад ⁺¹
Thank you so much. Impressive presentation! Do you think it is worth learning lightning, I am a PhD student and I am comfortable with Pytorch. Does lightning have all capabilities like Pytorch? I know that lightening to Pytorch like keras to Tensorflow
@SebastianRaschka Год назад ⁺¹
Good question @hamzawi2752. Fabric (covered in this video) is basically an add-on to PyTorch. It's basically useful for tapping into more advanced features like multi-GPU training, mixed-precision training etc. with minimal code changes. It's essentially a wrapper around PyTorch features, but doing this in pure PyTorch is definitely more work. So, I'd say it's worth it.

Следующие

Автовоспроизведение

Finetuning Open-Source LLMs

Finetuning Open-Source LLMs

Understanding PyTorch Buffers

Understanding PyTorch Buffers

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

Conditional Ordinal Regression for Neural Networks (CORN) With Examples in PyTorch

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

Boston FBI announce arrest of two Iranians in connection with fatal drone strike

Boston FBI announce arrest of two Iranians in connection with fatal drone strike

Marvel Rivals | Winter Celebration, Joyful Jubilation

Marvel Rivals | Winter Celebration, Joyful Jubilation

The History of Super Mario’s Hidden Ending

The History of Super Mario’s Hidden Ending

Build Your First Pytorch Model In Minutes! [Tutorial + Code]

Build Your First Pytorch Model In Minutes! [Tutorial + Code]

Fine-Tuning Large Language Models (LLMs)

Fine-Tuning Large Language Models (LLMs)

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Insights from Finetuning LLMs with Low-Rank Adaptation

Insights from Finetuning LLMs with Low-Rank Adaptation

13.4.1 Recursive Feature Elimination (L13: Feature Selection)

13.4.1 Recursive Feature Elimination (L13: Feature Selection)

Managing Sources of Randomness When Training Deep Neural Networks

Managing Sources of Randomness When Training Deep Neural Networks

The Three Elements of PyTorch

The Three Elements of PyTorch

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Утренник в Сталин-центре

Утренник в Сталин-центре

ХИНКАЛИ (смешное видео, юмор, приколы, поржать)

ХИНКАЛИ (смешное видео, юмор, приколы, поржать)

"Yurayotgan mashinalar yonib ketdi" - guvohlar

"Yurayotgan mashinalar yonib ketdi" — guvohlar

АМИР ГУРБАНОВ - ИЛЬЯ МАКАРОВ, ТАМБИ МАСАЕВ, ЭМИР КАШОКОВ, ТУРАЛ НАТУРАЛ, РУСТАМ ДЖИБИЛОВ | ВГР

АМИР ГУРБАНОВ - ИЛЬЯ МАКАРОВ, ТАМБИ МАСАЕВ, ЭМИР КАШОКОВ, ТУРАЛ НАТУРАЛ, РУСТАМ ДЖИБИЛОВ | ВГР

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Пресс-конференция и прямая линия Путина. Спецэфир с Анной Монгайт и Тихоном Дзядко

Пресс-конференция и прямая линия Путина. Спецэфир с Анной Монгайт и Тихоном Дзядко

УХОД ГУДАЯ, КОНФЛИКТ С ЛИДЕРАМИ, ИМПЕРИЯ РУХНУЛА?

УХОД ГУДАЯ, КОНФЛИКТ С ЛИДЕРАМИ, ИМПЕРИЯ РУХНУЛА?