Deploy ANY Open-Source LLM with Ollama on an AWS EC2 + GPU in 10 Min (Llama-3.1, Gemma-2 etc.)

host ALL your AI locally

How to Build Effective AI Agents (without the hype)

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

where i have been.

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Expert Guide: Installing Ollama LLM with GPU on AWS in Just 10 Mins

Fast and Simple Development

Просмотров 10 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 24 янв 2025

Комментарии • 30

@fastandsimpledevelopment 11 месяцев назад ⁺³
Need a heavy GPU machine? Check out this video on setting AWS EC2 GPU instance. If you like this one check out my video on setting up a full RAG API with Llama3, Ollama, Langchain and ChromaDB - ruclips.net/video/7VAs22LC7WE/видео.html
@alphistube1077 17 дней назад ⁺¹
Bro ❤ great tutorial. Quick and easy
@joshwaphly 4 месяца назад ⁺⁴
OMG!!! I freaking love you, I've been struggling with deployment on AWS with llama and you've made it crystal clear. I'll do anything to support ur channel. UR THE BEST!!!
@fastandsimpledevelopment 4 месяца назад
Thanks for the comments
@christague2084 10 месяцев назад ⁺¹
Cannot wait for part two with LangChain! This video was fantastic
@ExpertKNowledgeGroup 11 месяцев назад
What a simple way to setup Ollama LLM with GPU support in only a few minutes, thanks!
@sahillakhe1093 5 месяцев назад ⁺¹
Briliant! Its that simple only because you explained it simply :). Thank you!
@ShoeBoxHydroponics 5 месяцев назад
Thanks, glad you enjoyed it!
@prodbyindigo 2 месяца назад ⁺¹
I'm not sure if you mentioned in the video or not but you need to allow traffic to port 11434 on the AWS security group
@fastandsimpledevelopment 2 месяца назад
Good catch. Thanks
@bingbingxv 10 месяцев назад
Thank you so much! Your video helps me a lot. I am looking forward to your new video.
@danilchurko2882 8 месяцев назад ⁺¹
Thanks man a lot! Great video!
@fastandsimpledevelopment 8 месяцев назад
Glad you enjoyed it
@hebertgodoy5039 7 месяцев назад
Excellent. Thank you very much for sharing.
@paulluka7594 8 месяцев назад ⁺¹
Thanks a lot for the video !!
Question : Is it possible to start the instance only if we do a request to the server ? It can be usfull to limit the costs.
I think it is feasable with kubernetes and docker, but i would enjoy a video about it :) !
Thnks again, very good video
@sachin1250 8 месяцев назад
How to add openwebui to it, and expose the openwebui to be accessible from macbook browser?
@123arskas 10 месяцев назад
Thank you. This was helpful
@Gerald-iz7mv 10 месяцев назад ⁺¹
can you also use the ubuntu 22.04 image and install cuda etc? why use this deep learning image?
@fastandsimpledevelopment 9 месяцев назад
I only select this AMI since it already has teh other code I need like Python
@Gerald-iz7mv 9 месяцев назад
@@fastandsimpledevelopment if i correctly understand you can select the base ubuntu 22.04 image and install all yourself: nvidia driver, cuda driver, tensorflow, python etc?
@yashshinde8185 6 месяцев назад
The Video was awesome and prety helpful but can you cover the security point of view too like anyone with the IP and portnumber can access it So how can we avoide that?
@pushkarpadmnav 9 месяцев назад ⁺¹
How do you make it scalable ?
@fastandsimpledevelopment 9 месяцев назад
By itself it is not, you need to add a front end like Nginx and then have several Ollama servers running, that is the only way that I am aware today. There is new updates all the time to keep track of Ollama updates
@adityanjsg99 3 месяца назад
So ollama detects and uses the GPU automatically?
@fastandsimpledevelopment 3 месяца назад
Yes, if the OS has support and you have an AMD or Nvidia GPU installed and the latest version does auto-detect. You can also set it to NOT use the GPU in the Ollama config files but by default it does auto-detect.
@adityanjsg99 2 месяца назад ⁺¹
@@fastandsimpledevelopment It detects only Nvidia GPU. I tried on AWS g4ad (AMD ) and g4dn.xlarge (Nvidia). Only the latter worked. This is FYI.
@fastandsimpledevelopment 2 месяца назад
@@adityanjsg99 Thanks for your input. I have not tried anything other than NVidia GPU's, I've finally decided to get a few 4090 boards and see how they run. I'm trying to build an on-prem system since there is no affordable cloud solution. I'll externalize the LLM API via ngrok, not what I wanted :(
@ctoxyz 8 месяцев назад
good vid!
@blackalert.agency 9 месяцев назад
thanks buddy
@emineyoubah7418 9 месяцев назад
Cannot wait for part two with LangChain! This video was fantastic

Следующие

Автовоспроизведение

Deploy ANY Open-Source LLM with Ollama on an AWS EC2 + GPU in 10 Min (Llama-3.1, Gemma-2 etc.)

Deploy ANY Open-Source LLM with Ollama on an AWS EC2 + GPU in 10 Min (Llama-3.1, Gemma-2 etc.)

host ALL your AI locally

host ALL your AI locally

How to Build Effective AI Agents (without the hype)

How to Build Effective AI Agents (without the hype)

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

where i have been.

where i have been.

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Death Of A Unicorn | Official Trailer HD | A24

Death Of A Unicorn | Official Trailer HD | A24

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

GPU vs CPU: Running Small Language Models with Ollama & C#

GPU vs CPU: Running Small Language Models with Ollama & C#

Unlocking The Power Of GPUs For Ollama Made Simple!

Unlocking The Power Of GPUs For Ollama Made Simple!

I Analyzed My Finance With Local LLMs

I Analyzed My Finance With Local LLMs

Deploy Ollama and OpenWebUI on Amazon EC2 GPU Instances

Deploy Ollama and OpenWebUI on Amazon EC2 GPU Instances

Ollama: Run LLMs Locally On Your Computer (Fast and Easy)

Ollama: Run LLMs Locally On Your Computer (Fast and Easy)

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

How to Use Bolt.new for FREE with Local LLMs (And NO Rate Limits)

How to Use Bolt.new for FREE with Local LLMs (And NO Rate Limits)

Understand Ollama and LangChain Chat History in 10 minutes

Understand Ollama and LangChain Chat History in 10 minutes

The Fastest Way To Make A Salad!

The Fastest Way To Make A Salad!

Extreme Adjustments for Anna: The Intern's Big Test!

Extreme Adjustments for Anna: The Intern's Big Test!

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

Создал Пилу в Dota 2 Победитель Получает 100.000 Рублей !

▼ ДЕД ВЕРНИ СЫНА !! 🤬

▼ ДЕД ВЕРНИ СЫНА !! 🤬

МАНЬЯК ДЕД МОРОЗ! УДАРИЛ ПРЯМ В НОС! (УГАРНЫЙ МАНЬЯК В КС2)

МАНЬЯК ДЕД МОРОЗ! УДАРИЛ ПРЯМ В НОС! (УГАРНЫЙ МАНЬЯК В КС2)

ЭТО надо СЛЫШАТЬ! ⚡️ ТРАМП в ДАВОСЕ ошеломил ВСЕХ 🛑 ПЕРВЫЕ ГРОМКИЕ ЗАЯВЛЕНИЯ @golosameriki

ЭТО надо СЛЫШАТЬ! ⚡️ ТРАМП в ДАВОСЕ ошеломил ВСЕХ 🛑 ПЕРВЫЕ ГРОМКИЕ ЗАЯВЛЕНИЯ @golosameriki

😳 Купил китайский кроссовер, но не ожидал такой "сюрприз" на утро! | Новостничок

😳 Купил китайский кроссовер, но не ожидал такой "сюрприз" на утро! | Новостничок

ЛЮБОЕ ПРИКОСНОВЕНИЕ ОСТАЕТСЯ НА ТВОЕМ ЛИЦЕ НАВСЕГДА❌тгк:NIKUSHANS CLUB💖

ЛЮБОЕ ПРИКОСНОВЕНИЕ ОСТАЕТСЯ НА ТВОЕМ ЛИЦЕ НАВСЕГДА❌тгк:NIKUSHANS CLUB💖