host ALL your AI locally

How is this Website so fast!?

Buying a Brand New PC is Dumb...

$1 vs $1000 LEGO SET...

Lil Uzi Vert - EA2 (The Beginning of the Era)

Luigi's Mansion, But He Brought Daisy

A Journey with Llama3 LLM: Running Ollama on Dell R730 Server with Nvidia P40 GPU & Web UI Interface

Mukul Tripathi

Просмотров 2,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 26 окт 2024

Комментарии • 27

@kenpark4783 5 месяцев назад ⁺¹
I subscribed. My 730 is on the way now and I have my 2 P40s sitting on my desk. I was amazed to find your videos of the same setup I have planned. Thanks for doing these. So far it has been really informative. I'm a Windows guy with only limited exposure to Ubuntu so far, so it has been really nice having a sanity check for that as well.
@MukulTripathi 5 месяцев назад ⁺¹
You'll love the setup. If your 730 has enterprise idrac license then you'll be able to remote start stop it as well. Those cables that go into P40 and riser are cheaply made usually. So that's sometime you'll have to keep in mind too.
@kenpark4783 5 месяцев назад
@@MukulTripathi I've been researching the power cable issues and am going to pick a proven vendor if I can't find genuine OEM. I've seen a few references posted on Reddit. Anyway, I'm looking forward to watching the rest of the series. Thanks again!
@MukulTripathi 5 месяцев назад
I'm glad there's an audience for this stuff! I'm planning to make a series on my research work on AI articles I've published.
@ThomasBattle-fh8gg 5 месяцев назад
So I'm building this system right now and your video is a godsend. Thank you.
@QuantumLeapEvent 5 месяцев назад
Subscribed! Great Stuff and inspired me make a similar setup with r730xd (After I figure out a way to remove what seems to be fixed in place dual drive ssd back flex bay to make space for the 2nd GPU). How are you keeping the system cool fan noise minimal with two GPU and LLM running? Or what do you suggest on keeping the system as cool as possible? Glad I found you! Thank you for making these videos!
@renobodyrenobody 13 дней назад
Ok, thanks. More or less what I was experimenting and it is nice to see your explanations. Anyway I think maybe the video lacks a comparizon with the usage of CPU instead of your GPU. Thanks anyway.
@MukulTripathi 13 дней назад ⁺¹
I have a bunch of videos that you'd like if you're interested in server builds. LLMs love CUDA cores so I build these with Nvidia GPUs. I agree I could have done some comparisons with CPUs, for sure!
@saniel_cz 3 месяца назад
How many tokens per second did you achieve during inference? I am thinking about buying p40 because of vram, but I would like to know how fast it performs inference. I can't find this anywhere, so your insight would be greatly appriciated :)
@stamy 6 месяцев назад
Very nice vidéo !
What happens when you download the 70b llama LLM file and try to put this 40BG file into the 8GB VRAM on your GPU ? Does it work ?
I am asking because I tried to use the 70b LLM on my CPU, and despite having 32GB of RAM it was not enough :)
@MukulTripathi 6 месяцев назад ⁺²
I'll address this in next video :) you need two P40 GPUs to fit a 70b model essentially. Two P40s give you 48 GB of vram. Llama3 70b fits perfectly with 40GB size on it.
@wlgt3257 6 месяцев назад
@@MukulTripathi looking forward to it man.
@MukulTripathi 6 месяцев назад
I am almost done with it. It'll be coming soon :)
@sridevmisra 6 месяцев назад
Informative video!!
@MukulTripathi 6 месяцев назад
Glad it was helpful!
@danv8086 4 месяца назад
Is this using 1 or 2? Did a CPU only experiment which was unusable for anytbing practical but I've beem looking at p40s and p100s
@MukulTripathi 4 месяца назад
I have done two videos. One with single GPU and another one with dual GPU setup. With dual GPUs it uses them both
@jacksonpham2974 2 месяца назад
How to passthrough the P40 driver into ubutu vm?
@MukulTripathi 2 месяца назад
I used ESXi's passthrough feature for it.
@mk.host.here. 4 месяца назад
How you pass gpu true virtual machine?
@MukulTripathi 4 месяца назад ⁺¹
I show a step by step installation and GPU passthrough guide in esxi, if you watch my previous videos. There are two of them. They are an hour long, but jam packed with information.
@mk.host.here. 4 месяца назад
@@MukulTripathi thanks
@jacksonpham2974 Месяц назад
@@MukulTripathi Can you please give me your video links for that? I need to see GPU, Cuda of P40, in Ubutu VM.
@MukulTripathi Месяц назад
Here is the server build playlist:
ruclips.net/p/PLteHam9e1Fecmd4hNAm7fOEPa4Su0YSIL
Here is one of the videos in there in esxi VM for what you're looking for:
ruclips.net/video/BO5YPIToJKo/видео.html
@callmebigpapa 3 месяца назад
I have the same setup! Lik'd and Sub'd to see where this channel goes!

Следующие

Автовоспроизведение

host ALL your AI locally

host ALL your AI locally

How is this Website so fast!?

How is this Website so fast!?

$1 vs $1000 LEGO SET...

$1 vs $1000 LEGO SET...

Lil Uzi Vert - EA2 (The Beginning of the Era)

Lil Uzi Vert - EA2 (The Beginning of the Era)

Luigi's Mansion, But He Brought Daisy

Luigi's Mansion, But He Brought Daisy

FULL INNING: Dodgers win Game 1 after Freeman hits FIRST WALK-OFF GRAND SLAM in WORLD SERIES HISTORY

FULL INNING: Dodgers win Game 1 after Freeman hits FIRST WALK-OFF GRAND SLAM in WORLD SERIES HISTORY

Linus Torvalds: Speaks on Hype and the Future of AI

Linus Torvalds: Speaks on Hype and the Future of AI

Advanced Wi-Fi Security 101: Test Your Wi-Fi Security Like a Pro | Ethical Hacking

Advanced Wi-Fi Security 101: Test Your Wi-Fi Security Like a Pro | Ethical Hacking

NVIDIA Nemotron 70b Local AI Testing - The BEST Open Source LLM?

NVIDIA Nemotron 70b Local AI Testing - The BEST Open Source LLM?

These AI glasses broke my phone addiction | Meta Ray-Ban 2 months later

These AI glasses broke my phone addiction | Meta Ray-Ban 2 months later

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run your own AI (but private)

Run your own AI (but private)

Thank You for Trying, Intel - Core Ultra 285K & 245K Review

Thank You for Trying, Intel - Core Ultra 285K & 245K Review

Getting Started with Ollama and Web UI

Getting Started with Ollama and Web UI

Build Your Own AI Server: 2X NVIDIA 3090 for free Premium, Private & Uncensored ChatGPT | Full Guide

Build Your Own AI Server: 2X NVIDIA 3090 for free Premium, Private & Uncensored ChatGPT | Full Guide

青椒把子肉做好了，大家看看怎么样#food #shorts

青椒把子肉做好了，大家看看怎么样#food #shorts

🔥 ПРЕМЬЕРА 2024! 🔥 Взгляд русалки (2024). 1 серия. Детективный сериал.

🔥 ПРЕМЬЕРА 2024! 🔥 Взгляд русалки (2024). 1 серия. Детективный сериал.

爆笑電梯整蠱！今天這個妹子的自我防護意識我給100分！

爆笑電梯整蠱！今天這個妹子的自我防護意識我給100分！

Путин ответил западному журналисту про участие НАТО в конфликте, военных КНДР и Трампа

Путин ответил западному журналисту про участие НАТО в конфликте, военных КНДР и Трампа

UFC 308: Шара Буллет - Слова после боя

UFC 308: Шара Буллет - Слова после боя

Plants vs Zombies 2: Please help Eggman pass the IQ test image Shin Sonic family! 👍 #shorts

Plants vs Zombies 2: Please help Eggman pass the IQ test image Shin Sonic family! 👍 #shorts

MAGIC TIME ⁠@Whoispelagheya

MAGIC TIME ⁠@Whoispelagheya

Распаковываю Детский Спиннинг-ручку! #shorts

Распаковываю Детский Спиннинг-ручку! #shorts