Discover Valkey: The Path to Now - Hayato Tustsumi, AWS

[1hr Talk] Intro to Large Language Models

Exploring Distributed Caching for Faster GPU Training with NVMe, GDS, and RDMA - Hope Wang & Bin Fan

10 Things You SHOULD Be Buying at Costco in November 2024

Girlfriend VS Sister! Who Knows Me Better?!

First To Get A Full Shiny Legendary Team Wins

Unlocking Local LLMs with Quantization - Marc Sun, Hugging Face

The Linux Foundation

Просмотров 108

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 3 ноя 2024
Unlocking Local LLMs with Quantization - Marc Sun, Hugging Face
This talk will share the story of quantization, its rise in popularity, and its current status in the open-source community. We'll begin by reviewing key quantization papers, such as QLoRA by Tim Dettmers and GPTQ by Elias Frantar. Next, we'll demonstrate how quantization can be applied at various stages of model development, including pre-training, fine-tuning, and inference. Specifically, we'll share our experience in pre-training a 1.58-bit model, show how fine-tuning is achievable using PEFT + QLoRA, and discuss optimizing inference performance with torch.compile or custom kernels. Finally, we'll highlight efforts within the community to make quantized models more accessible, including how transformers incorporate state-of-the-art quantization schemes and how to run GGUF models from llama.cpp.

Комментарии •

Следующие

Автовоспроизведение

Discover Valkey: The Path to Now - Hayato Tustsumi, AWS

Discover Valkey: The Path to Now - Hayato Tustsumi, AWS

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Exploring Distributed Caching for Faster GPU Training with NVMe, GDS, and RDMA - Hope Wang & Bin Fan

Exploring Distributed Caching for Faster GPU Training with NVMe, GDS, and RDMA - Hope Wang & Bin Fan

10 Things You SHOULD Be Buying at Costco in November 2024

10 Things You SHOULD Be Buying at Costco in November 2024

Girlfriend VS Sister! Who Knows Me Better?!

Girlfriend VS Sister! Who Knows Me Better?!

First To Get A Full Shiny Legendary Team Wins

First To Get A Full Shiny Legendary Team Wins

We Bought The RAREST Motorhome (driving it 2,000 miles home)

We Bought The RAREST Motorhome (driving it 2,000 miles home)

"okay, but I want Llama 3 for my specific use case" - Here's how

"okay, but I want Llama 3 for my specific use case" - Here's how

Let's Talk VDML Podcast Episode 11 - AI Assurance & Safety Framework with Dr S. Kate Conroy

Let's Talk VDML Podcast Episode 11 - AI Assurance & Safety Framework with Dr S. Kate Conroy

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

How Can Your OSPO Maximize Open Source Business Value for the Organization? - Masae Shida, Broadcom

How Can Your OSPO Maximize Open Source Business Value for the Organization? - Masae Shida, Broadcom

Entropy in Compression - Computerphile

Entropy in Compression - Computerphile

LLMs | Introduction and Recent Advances | Lec 01

LLMs | Introduction and Recent Advances | Lec 01

But what is a neural network? | Chapter 1, Deep learning

But what is a neural network? | Chapter 1, Deep learning

The Physics and Philosophy of Time - with Carlo Rovelli

The Physics and Philosophy of Time - with Carlo Rovelli

What is Sharepoint | Microsoft Sharepoint Turorial | Learn Sharepoint | Intellipaat

What is Sharepoint | Microsoft Sharepoint Turorial | Learn Sharepoint | Intellipaat

هل غشت الجدة في التحدي؟! 🤯 شربت كل الماء؟!

هل غشت الجدة في التحدي؟! 🤯 شربت كل الماء؟!

СНЯЛИ С МИЛАНОЙ ПОД ТРЕК У И УАА #янгер #shorts

СНЯЛИ С МИЛАНОЙ ПОД ТРЕК У И УАА #янгер #shorts

Poi-Poi-Poi-Poi-Poi-Poi-Po-Pi!! | Baby Zombie vs Baby Herobrine 😁

Poi-Poi-Poi-Poi-Poi-Poi-Po-Pi!! | Baby Zombie vs Baby Herobrine 😁

Now you won't have problems💡🧼#camping #survival #bushcraft #outdoors #lifehack

Now you won't have problems💡🧼#camping #survival #bushcraft #outdoors #lifehack

СКОЛЬКО людей не имеют ни малейшего представления о своем истинном ПОТЕНЦИАЛЕ? #shorts

СКОЛЬКО людей не имеют ни малейшего представления о своем истинном ПОТЕНЦИАЛЕ? #shorts

майнкррафт ИНТЕРВЬЮ 🎙 | WICSUR #shorts

майнкррафт ИНТЕРВЬЮ 🎙 | WICSUR #shorts

風船をキャッチしろ！🎈 Balloon catch Challenges

風船をキャッチしろ！🎈 Balloon catch Challenges