How to Use LM Studio: A Step-by-Step Guide

Importing Open Source Models to Ollama

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

I DEFEATED BLOX FRUIT 🐲DRAGON UPDATE!!🐲

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

How to Convert/Quantize Hugging Face Models to GGUF Format | Step-by-Step Guide

DigiDecode

Просмотров 3,2 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 4 янв 2025

Комментарии • 7

@am2dan 3 месяца назад ⁺⁶
A couple of differences with the version of llama.cpp as of Sept. 2024:
- convert.py is replace by convert_hf_to_gguf.py which does not take the --vocab-type argument.
- quantize is now llama-quantize.
Otherwise, I think everything else is the same. Thanks for this video,. It clearly presents exactly the info I was looking for.
@LBPFrost 6 месяцев назад ⁺²
actually glad I watched this video.
simple enough I got it set up twice.
Cheers mate.
I had to just run make though instead of curl (cd'd into repo)
Ubuntu WSL2 windows
@DigiDecode 6 месяцев назад
I am glad you found the video useful, thank you & cheers 👍🏻
@aaronchantrill7338 24 дня назад
I usually just use git with git-lfs to download the whole folder
@karavanamektuplar197 3 месяца назад
Hi. How can I convert a language model from pytorch to ggml on the Huggingface site? There is a tutorial on Ggerganov's github page under the title “convert-pt-to-ggml.py” but I don't fully understand it. I want to convert a Chinese language model to ggml and use it in Subtitle Edit. Can you make a video of this tutorial?

Следующие

Автовоспроизведение

How to Use LM Studio: A Step-by-Step Guide

How to Use LM Studio: A Step-by-Step Guide

Importing Open Source Models to Ollama

Importing Open Source Models to Ollama

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

The FULL Guide To Get Fully AWAKENED Draco Race V4 (V1, V2 & V3) | Blox Fruits

I DEFEATED BLOX FRUIT 🐲DRAGON UPDATE!!🐲

I DEFEATED BLOX FRUIT 🐲DRAGON UPDATE!!🐲

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

Neville, Keane & Richards DEBATE Amorim leaving Rashford & Garnacho out of Man United squad

Neville, Keane & Richards DEBATE Amorim leaving Rashford & Garnacho out of Man United squad

Adding Custom Models to Ollama

Adding Custom Models to Ollama

Quantize any LLM with GGUF and Llama.cpp

Quantize any LLM with GGUF and Llama.cpp

How to Quantize an LLM with GGUF or AWQ

How to Quantize an LLM with GGUF or AWQ

Run Any Hugging Face Model with Ollama in Just Minutes!

Run Any Hugging Face Model with Ollama in Just Minutes!

Run GGUF models from Hugging Face Hub on Ollama and OpenWebUI

Run GGUF models from Hugging Face Hub on Ollama and OpenWebUI

Оффлайн-версия ChatGPT? Как установить и настроить LM Studio: Личный AI-ассистент без интернета!

Оффлайн-версия ChatGPT? Как установить и настроить LM Studio: Личный AI-ассистент без интернета!

How to Run Any GGUF AI Model with Ollama Locally

How to Run Any GGUF AI Model with Ollama Locally

Which Quantization Method is Right for You? (GPTQ vs. GGUF vs. AWQ)

Which Quantization Method is Right for You? (GPTQ vs. GGUF vs. AWQ)

Converting Safetensors to GGUF (for use with Llama.cpp)

Converting Safetensors to GGUF (for use with Llama.cpp)

最菜的最喜欢装！#火影忍者 #佐助 #家庭

最菜的最喜欢装！#火影忍者 #佐助 #家庭

Обучалка на танец обоюдно

Обучалка на танец обоюдно

VALVE! Что Вы Наделали в CS2???

VALVE! Что Вы Наделали в CS2???

“Хусури Ман 19” - качество оригинал 4К. Официально!

“Хусури Ман 19” - качество оригинал 4К. Официально!

Мария Захарова, Дюжев и пьяная (путана) Чичерина - этот номер порвал зал

Мария Захарова, Дюжев и пьяная (путана) Чичерина - этот номер порвал зал

Easy magnetic eyelashes

Easy magnetic eyelashes

VERY NICE CHOCOLATE 🍫

VERY NICE CHOCOLATE 🍫

O Café Gelado de Nutella ☕🧊

O Café Gelado de Nutella ☕🧊