MISTRAL 7B explained - Preview of LLama3 LLM

Understanding 4bit Quantization: QLoRA explained (w/ Colab)

All You Need To Know About Running LLMs Locally

Borderlands 4 - Official Teaser Trailer

Street Fighter 6 - Terry Gameplay Trailer

We NEED To Talk About Black Myth Wukong

New Tutorial on LLM Quantization w/ QLoRA, GPTQ and Llamacpp, LLama 2

code_your_own_AI

Просмотров 15 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 21 авг 2024
LLM Quantization: GPTQ - AutoGPTQ
llama.cpp - ggml.c - GGUL - C++
Compare to HF transformers in 4-bit quantization.
Download Web UI wrappers for your heavily quantized LLM to your local machine (PC, Linux, Apple).
LLM on Apple Hardware, w/ M1, M2 or M3 chip.
Run inference of your LLMs on your local PC, with heavy quantization applied.
Plus: 8 Web UI for GTPQ, llama.cpp or AutoGPTQ, exLLama or GGUF.c
koboldcpp
oobabooga text-generation-webui
ctransformers
lmstudio.ai/
github.com/mar...
github.com/gge...
github.com/rus...
huggingface.co...
github.com/Pan...
cloud.google.c...
huggingface.co...
h2o.ai/platfor...
#quantization
#ai
#webui

Комментарии • 22

@jacehua7334 11 месяцев назад ⁺¹
Have been busy with work but it's so great on the weekend to see absolute great content from you like always!
@ctejada-0 11 месяцев назад ⁺²
Happy to see llama.cpp taking off. Since the beginning of this new wave of AI as a consequence of LLM advancements I've been rooting for llama.cpp as it is (in my opinion) the best approach to enable everyone to have their own LLM and enable a plethora of software solutions (open and closed source) that were never possible before. Thank you for this video focused on it.
@code4AI 11 месяцев назад ⁺¹
Thank you for your comment. Maybe I'll do another video on the latest llamacpp ...
@ViktorFerenczi 11 месяцев назад ⁺²
Excellent video, as always! Thank you. - It would be nice to have a video comparing AWQ with the quantization methods discussed here.
@code4AI 11 месяцев назад ⁺¹
Activation-aware Weight Quantization (AWQ)? Great idea!
@henkhbit5748 11 месяцев назад
Great explanation of the different quatizations methods. Would be nice if we can compare for example llma2 7b models: normal, qlora 4b, qptq 4b, gguf 4b format with different inference questions with an without RAG...
@hoangnam6275 11 месяцев назад
U r the best, best content everyweek
@akashkarnatak3014 11 месяцев назад ⁺¹
Okay, so gqtq is a quantization technique and gguf is a format to store quantized weights, can't we quantize a model using gptq algorithm and store it in gguf format and run using llama.cpp?
@junzhengge407 5 месяцев назад
I have the same question😢 need help
@ChrisBrock-mh8qq 6 месяцев назад
Really Great Videos!
@AK-ox3mv 5 месяцев назад
What does k mean in q4_km?
What's difference between q4 and 4bit? Are they same thing?
@amparoconsuelo9451 11 месяцев назад
Can a subsequent SFT and RTHF with different, additional or lesser contents change the character, improve, or degrade a GPT model?
@devyanshrastogi 9 месяцев назад ⁺¹
Trust me after 20 seconds of your intro I was about to skip this video 🤣🤣 the intro was terrific (Literally).
@spencerfunk6697 6 месяцев назад
need a tutorial on quantizing vision models
@yusufkemaldemir9393 11 месяцев назад
Thanks. Does llama2 cpp 4 bit quantized provide back propagation while running it on m2 MacBook? If yes, do you mind provide ref notebook?
@surajrajendran6528 5 месяцев назад
Quantised models cannot be back-propagated. All training should be done in floating point precision.
@gileneusz 11 месяцев назад
0:08 oh... so maybe I'll watch your next video, sorry....
@code4AI 11 месяцев назад ⁺¹
You are the lucky one ...
@gileneusz 11 месяцев назад ⁺¹
@@code4AI no, no that's just my dream 😢
@ernestoflores3873 3 месяца назад

Следующие

Автовоспроизведение

MISTRAL 7B explained - Preview of LLama3 LLM

MISTRAL 7B explained - Preview of LLama3 LLM

Understanding 4bit Quantization: QLoRA explained (w/ Colab)

Understanding 4bit Quantization: QLoRA explained (w/ Colab)

All You Need To Know About Running LLMs Locally

All You Need To Know About Running LLMs Locally

Borderlands 4 - Official Teaser Trailer

Borderlands 4 - Official Teaser Trailer

Street Fighter 6 - Terry Gameplay Trailer

Street Fighter 6 - Terry Gameplay Trailer

We NEED To Talk About Black Myth Wukong

We NEED To Talk About Black Myth Wukong

Babe wake up, Black Myth: Wukong is peak

Babe wake up, Black Myth: Wukong is peak

Demo: Rapid prototyping with Gemma and Llama.cpp

Demo: Rapid prototyping with Gemma and Llama.cpp

PEFT LoRA Explained in Detail - Fine-Tune your LLM on your local GPU

PEFT LoRA Explained in Detail - Fine-Tune your LLM on your local GPU

AWQ for LLM Quantization

AWQ for LLM Quantization

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

GGUF quantization of LLMs with llama cpp

GGUF quantization of LLMs with llama cpp

Generative AI Fine Tuning LLM Models Crash Course

Generative AI Fine Tuning LLM Models Crash Course

Boost Fine-Tuning Performance of LLM: Optimal Architecture w/ PEFT LoRA Adapter-Tuning on Your GPU

Boost Fine-Tuning Performance of LLM: Optimal Architecture w/ PEFT LoRA Adapter-Tuning on Your GPU

Understanding: AI Model Quantization, GGML vs GPTQ!

Understanding: AI Model Quantization, GGML vs GPTQ!

МЕГА МЕЛКОВЫЙ СЕКРЕТ

МЕГА МЕЛКОВЫЙ СЕКРЕТ

Российские поезда 👌 #тнт #shorts #юмор #шоу #однаждывроссии #дорохов #поезд #россия

Российские поезда 👌 #тнт #shorts #юмор #шоу #однаждывроссии #дорохов #поезд #россия

Такие мясные чипсы в казане на костре понравятся каждому. #рецепт #казан #чипсы

Такие мясные чипсы в казане на костре понравятся каждому. #рецепт #казан #чипсы

Воссоздал СТРАНУ СМЕШАРИКОВ в Майнкрафт ХАРДКОР

Воссоздал СТРАНУ СМЕШАРИКОВ в Майнкрафт ХАРДКОР

Simple Flower Syrup @SpicyMoustache

Simple Flower Syrup @SpicyMoustache

Мама приболела😂@kak__oska

Мама приболела😂@kak__oska

ПРОКЛЯТИЕ ЗАБРОШЕННОГО ЛАГЕРЯ - Страшилки Minecraft

ПРОКЛЯТИЕ ЗАБРОШЕННОГО ЛАГЕРЯ - Страшилки Minecraft

😰Майнкрафт, но Мы Попали в ЗАБРОШЕННЫЙ ДОМ [Страшное прохождение] + Лололошка

😰Майнкрафт, но Мы Попали в ЗАБРОШЕННЫЙ ДОМ [Страшное прохождение] + Лололошка