What is LLM Sharding - LLM Concepts ( EP - 5 ) #llm #artificialintelligence #largelanguagemodels

What are AI Agents?

What is Quantization? - LLM Concepts ( EP - 3 ) #quantization #llm #ml #ai #artificialintelligence

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

🇲🇽 La Matraca (Remix) - DJ NIÑOTE, Dani Flow

We Tried 1-Star Halloween Costumes

What is Post Training Quantization - GGUF, AWQ, GPTQ - LLM Concepts ( EP - 4 )

Akhil Sharma

Просмотров 873

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 9

@har111100 2 месяца назад
awesome playlist. Keep posting.
@sqlsql4304 6 месяцев назад
well expalined.. Lot many concepts got cleared
@mitejmadan8672 6 месяцев назад
Hey akhil, could you make this course in such a way that after doing this one can at least apply at your company for an internship.
If not then atleast make a roadmap mentioning all the keywords one can search and learn from the internet. Since i am a full stack I don't have much idea of the ai landscape.
@theuniversityofthemind6347 14 дней назад
Hi Akhi, Hoping that you can help, I have an Alienware m18 R2 with an Intel i9-14900HX, NVIDIA RTX 4090 (24GB), 64GB RAM, and 8TB storage. For extra information I don't plan to use this for high intensive tasks like model training or any other such high intensive computing tasks, i only mainly will be using it for analysing my business documents and also writing 20 minute elaborate stories based on a five step story structure. I wanted to use the 70B model to generate the best possible results for these smaller less intensive type tasks. Based on my system specs which which optimisation method would you recommend. GPT Q, GGUF, or AWQ ad would you have any additional advice on the best way to optimise based on my use case requirements?
@AkhilSharmaTech 13 дней назад
@@theuniversityofthemind6347 hey bro I don't recommend doing anything on personal PC until you know what you're doing.
Try doing everything on an AWS instance first
@theuniversityofthemind6347 13 дней назад
@@AkhilSharmaTech Thanks for the reply Akhi. Thats understandable. Ok, do you at least think my system would be powerful enough to run a LLama 70B model for the described tasks?
@AkhilSharmaTech 13 дней назад
@@theuniversityofthemind6347 even though the system is quite powerful, I'd still suggest just using a 7B model max on your local system
@rahuldebdas2374 4 месяца назад
very fast explanation. pls be slow from next time. hard to follow.
@AkhilSharmaTech 4 месяца назад ⁺¹
will do. also, try slowing down the video to 0.5x if possible.

Следующие

Автовоспроизведение

What is LLM Sharding - LLM Concepts ( EP - 5 ) #llm #artificialintelligence #largelanguagemodels

What is LLM Sharding - LLM Concepts ( EP - 5 ) #llm #artificialintelligence #largelanguagemodels

What are AI Agents?

What are AI Agents?

What is Quantization? - LLM Concepts ( EP - 3 ) #quantization #llm #ml #ai #artificialintelligence

What is Quantization? - LLM Concepts ( EP - 3 ) #quantization #llm #ml #ai #artificialintelligence

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

🇲🇽 La Matraca (Remix) - DJ NIÑOTE, Dani Flow

🇲🇽 La Matraca (Remix) - DJ NIÑOTE, Dani Flow

We Tried 1-Star Halloween Costumes

We Tried 1-Star Halloween Costumes

The Nursery Nurse // STOP THE WEDDING 😱 // The Annual Halloween Ball

The Nursery Nurse // STOP THE WEDDING 😱 // The Annual Halloween Ball

Video #203 GPTQ: Accurate Post-Training Quantization For Generative Pre-Trained Transformers

Video #203 GPTQ: Accurate Post-Training Quantization For Generative Pre-Trained Transformers

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Understanding: AI Model Quantization, GGML vs GPTQ!

Understanding: AI Model Quantization, GGML vs GPTQ!

Which Quantization Method is Right for You? (GPTQ vs. GGUF vs. AWQ)

Which Quantization Method is Right for You? (GPTQ vs. GGUF vs. AWQ)

AWQ for LLM Quantization

AWQ for LLM Quantization

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Incredibox Sprunki: Oren vs AnythingAlexia - Help Raddy x Wenda x Fun Bot Get Out #shorts

Incredibox Sprunki: Oren vs AnythingAlexia - Help Raddy x Wenda x Fun Bot Get Out #shorts

Теперь мы знаем, к чем стремится Динара #huga #хетагхугаев #детекторлжи

Теперь мы знаем, к чем стремится Динара #huga #хетагхугаев #детекторлжи

爆笑電梯整蠱！今天這個妹子的自我防護意識我給100分！

爆笑電梯整蠱！今天這個妹子的自我防護意識我給100分！

бабл ти гель для душа // Eva mash

бабл ти гель для душа // Eva mash

Nightmare | Update 0.31.0 Trailer | Standoff 2

Nightmare | Update 0.31.0 Trailer | Standoff 2

БРАВЛ ТОЛК ПЕРЕВОД! АНГЕЛЫ ПРОТИВ ДЕМОНОВ! НОВЫЕ БОКСЫ и СТАРДРОПЫ! 2 НОВЫХ БРАВЛЕРА и многое другое

БРАВЛ ТОЛК ПЕРЕВОД! АНГЕЛЫ ПРОТИВ ДЕМОНОВ! НОВЫЕ БОКСЫ и СТАРДРОПЫ! 2 НОВЫХ БРАВЛЕРА и многое другое

Невероятный НОКАУТ В ИСПОЛНЕНИИ ПИРАТА #мма

Невероятный НОКАУТ В ИСПОЛНЕНИИ ПИРАТА #мма

无意间发现了老公的小金库 #一键入戏

无意间发现了老公的小金库 #一键入戏