FREE Local LLMs on Apple Silicon | FAST!

I'm switching to Mac, after a lifetime of Windows

All You Need To Know About Running LLMs Locally

Over Each Other (Official Music Video) - Linkin Park

"We're NOT Steel Workers!" 😳 | Inside the NBA reacts to Joel Embiid's load management | NBA on TNT

Local ChatGPT on MacBook Air M2 - Running the best Open 13B LLM with llama.cpp from scratch

Enric Domingo - AI Engineering

Просмотров 5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 14

@laobaGao-y7f 10 месяцев назад ⁺¹
Glad to see your video, because I'm trying to use my social media chats to train my own 'digital twin', I'm currently thinking about whether 96GB of m2 max is enough for my needs, because I want to run the model training and deployment locally, if this plan is feasible, I may also do some model training locally in the future related to other more sensitive data, instead of uploading my data to gpts, which I am currently using 16GB The memory of the m2pro doesn't seem to support this idea of mine very much
@sanchogodinho 5 месяцев назад ⁺¹
I would prefer buying a mac enough for your normal coding & stuff and run the AI training on cloud servers instead of your device. You'll save a lot of money since you might hardly require training it!
Just ignore my comment if you really need to frequently train Large AI models. Else, you can consider my suggestion...
@Panpaper 10 месяцев назад ⁺³
I dont see a convert-pth-to-ggml.py file anywhere in the llama.cpp repository. Was it recently removed? Can’t proceed at all, appreciate any help
@enricd 10 месяцев назад
Thanks for you question, apparently they have recently changed this in the llama.cpp project and now they have a more general script called convert.py that can handle different weights files formats as input and convert any of them to ggml. You can check the details from the llama.cpp GitHub reamde but it should work by running python convert.py (but I haven't tried it)
@manishraj-rz4lh 8 месяцев назад
@@enricdSo , what should be code like ?
@DocuFlow Год назад ⁺¹
Apologies if I missed it, but did the GPU get used, and if so was shared memory useful? I'm wondering if I should get a Mac Mini with max RAM to run in GPU mode.
@enricd Год назад
Hey no worries, at the end of the video I showed the gpu monitor graph and the cpu one and everything related to the LLM is running only on cpu. gpu is only used for other apps like screen recording and so.
@human-pl7kx Год назад ⁺²
How many RAM does your Macbook have?
@enricd Год назад
24gb but it was barely using 8gb while running it, having some chrome tabs open and the screen recording software
@human-pl7kx Год назад ⁺¹
@@enricd 13B model?
@enricd Год назад
@@human-pl7kx yes, you can check at the end of this video where I showed the Mac's Activity Monitor with the RAM around 8-9GB: ruclips.net/video/T4mJcz7dRvE/видео.html
@human-pl7kx Год назад ⁺¹
@@enricd I cannot run llama 2 13B on a mac with 8GB. Looks like I ran out of memory.
@enricd Год назад
@@human-pl7kx oh interesting... and does it work with the 7B version? Have you also any other apps open using ram apart from llama.cpp?

Следующие

Автовоспроизведение

FREE Local LLMs on Apple Silicon | FAST!

FREE Local LLMs on Apple Silicon | FAST!

I'm switching to Mac, after a lifetime of Windows

I'm switching to Mac, after a lifetime of Windows

All You Need To Know About Running LLMs Locally

All You Need To Know About Running LLMs Locally

Over Each Other (Official Music Video) - Linkin Park

Over Each Other (Official Music Video) - Linkin Park

"We're NOT Steel Workers!" 😳 | Inside the NBA reacts to Joel Embiid's load management | NBA on TNT

"We're NOT Steel Workers!" 😳 | Inside the NBA reacts to Joel Embiid's load management | NBA on TNT

Planet X Returns! - Part 1

Planet X Returns! - Part 1

Обзор MacBook Air (M2) - идеальный Mac?

Обзор MacBook Air (M2) — идеальный Mac?

M2 Mac - 8GB vs 16GB RAM - Avoid This Costly Mistake!

M2 Mac - 8GB vs 16GB RAM - Avoid This Costly Mistake!

New Tutorial on LLM Quantization w/ QLoRA, GPTQ and Llamacpp, LLama 2

New Tutorial on LLM Quantization w/ QLoRA, GPTQ and Llamacpp, LLama 2

Base model M2 MacBook Air 9 months later - How did it hold up??

Base model M2 MacBook Air 9 months later - How did it hold up??

Unleash the power of Local LLM's with Ollama x AnythingLLM

Unleash the power of Local LLM's with Ollama x AnythingLLM

Claude has taken control of my computer...

Claude has taken control of my computer...

Why Are Open Source Alternatives So Bad?

Why Are Open Source Alternatives So Bad?

Большой обзор MacBook Pro 14" и 16" c M2 Max / Pro

Большой обзор MacBook Pro 14" и 16" c M2 Max / Pro

Wizard-Vicuna: 97% of ChatGPT - with Oobabooga Text Generation WebUI

Wizard-Vicuna: 97% of ChatGPT - with Oobabooga Text Generation WebUI

Я СБЕЖАЛ ИЗ ГОРОДА МЁРТВЫХ В МАЙНКРАФТЕ!

Я СБЕЖАЛ ИЗ ГОРОДА МЁРТВЫХ В МАЙНКРАФТЕ!

ХРУСТНУЛА ЧЕЛЮСТЬ! Полный Бой Хамзат Чимаев VS Роберт Уиттакер UFC 308 Chimaev Whittaker full fight

ХРУСТНУЛА ЧЕЛЮСТЬ! Полный Бой Хамзат Чимаев VS Роберт Уиттакер UFC 308 Chimaev Whittaker full fight

новое обновление майна 1.22 | WICSUR #shorts

новое обновление майна 1.22 | WICSUR #shorts

Which one would you choose for Christmas? #toys #gelblasters #gelblasterguns #airsoft

Which one would you choose for Christmas? #toys #gelblasters #gelblasterguns #airsoft

Роднянский - когда и как заканчивать войну / вДудь

Роднянский – когда и как заканчивать войну / вДудь

Random Emoji Beatbox Challenge #beatbox #tiktok

Random Emoji Beatbox Challenge #beatbox #tiktok

Бокс - Финты Дмитрия Бивола

Бокс - Финты Дмитрия Бивола

Кто «слил» Израиль: почему из-за секретных данных ЦАХАЛ отложил удар по Ирану?

Кто «слил» Израиль: почему из-за секретных данных ЦАХАЛ отложил удар по Ирану?