AutoGen AI Agency Service in 20 MINUTES

Unlimited AI Agents running locally with Ollama & AnythingLLM

Deploy LLMs using Serverless vLLM on RunPod in 5 Minutes

Squid Game: Season 2 | You’re Invited | Netflix

Minecraft but I get CAPTURED in PVP CIVILIZATION

Gymnastics Dress to Impress ft/ Salish Matter

How to Run Any LLM using Cloud GPUs and Ollama with Runpod.io

Tyler AI

Просмотров 5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 1 окт 2024

Комментарии • 20

@ZodakZach Месяц назад ⁺²
My question is why would u use runpod and still pay their rate when you can just throw a llama 405b model or whatever model in a aws server and deploy it yourself and only being charged for hosting that aws server which would be probably cheaper and probably is what runpod is doing anyways.
@TylerReedAI Месяц назад
Generally they are doing the same thing yes, I just made this video on runpod because it is a little simpler to setup compared to AWS. It can be daunting to some people, but I don't disagree with you. I haven't ran the prices though to see! The better the model though the higher the costs, just need to be careful of that though
@Larimuss 2 месяца назад ⁺³
Can you do a new guide for text gen ui as well please? The bloke doesnt work anymore.
@chukypedro818 10 дней назад
when you close the terminal tab, the model stops running. That is not cool, the endpoint should always work unless I choose to shut down the pod
@attilavass6935 6 месяцев назад ⁺³
How using Runpod serverless and pods differ in this use case, considering eg. costs? How can we minimize our costs eg. with stopping running the pod after usage?
@TylerReedAI 6 месяцев назад ⁺¹
Well, the idea is if you don't have a local machine that can run models well (if at all), then depending on the model you need, you can 'rent' a cheap server on this platform. The one I have in the example if it's up and running, was .79 per hour. If I stop it, then it says it costs .0006 per hour. So the cost of holding it until you want to run it again without actually TERMINATING it, is minimal.
I will look into the scheduling (if its possible) of the servers so like in AWS you can have it run for a certain amount of time per day
@BradDStephensAIFPV 6 месяцев назад ⁺²
Since you can run a python file there in runpod, I’m assuming you can also serve a gradio ui from there? Kinda like in your RUclips service video. I really appreciate all of your hard work on your channel. One of my favorite ag centric channels.
@HistoryIsAbsurd 6 месяцев назад ⁺¹
Yes you should be able to do that for sure
@TylerReedAI 6 месяцев назад
Yes you absolutely should be able to do this! Thank you I appreciate it 👍
@jarad4621 4 месяца назад
Hi what is the difference between this method and using vllm I saw in runpod data centric video which way is better
@lololoololdudusoejdhdjswkk347 4 месяца назад ⁺¹
Is it possible to host the server here, or is the run pod just used for fine tuning and training models
@TylerReedAI 4 месяца назад
you can absolutely host a server here!
@lololoololdudusoejdhdjswkk347 4 месяца назад ⁺¹
Just found out how and got it, apparently you need to host it on port 80 but apparently I didn’t select that option when I made the GPU.
@TylerReedAI 4 месяца назад
Ah gotcha I’m glad you got it figured out 👍
@johnbarros1 6 месяцев назад ⁺¹
This is the sauce! Thanks you! 🙏🏾
@TylerReedAI 6 месяцев назад
Thank you 🙌
@MichaelTrader1 5 месяцев назад
Is it possible to use a model on the server and parse it to the local Ollama to use it in any software locally?
@TylerReedAI 5 месяцев назад ⁺¹
yeah so I think like, if you had an api to retrieve something from the runpod.io llm, and then bring it locally for anything, then absolutely. You would just need the url for the runpod to grab the request. Hope that made sense. I do plan on having a video where we have something more 'production' ready
@JSON_bourne 6 месяцев назад ⁺¹
Thanks!
@TylerReedAI 6 месяцев назад
You are welcome!

Следующие

Автовоспроизведение

AutoGen AI Agency Service in 20 MINUTES

AutoGen AI Agency Service in 20 MINUTES

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

Deploy LLMs using Serverless vLLM on RunPod in 5 Minutes

Deploy LLMs using Serverless vLLM on RunPod in 5 Minutes

Squid Game: Season 2 | You’re Invited | Netflix

Squid Game: Season 2 | You’re Invited | Netflix

Minecraft but I get CAPTURED in PVP CIVILIZATION

Minecraft but I get CAPTURED in PVP CIVILIZATION

Gymnastics Dress to Impress ft/ Salish Matter

Gymnastics Dress to Impress ft/ Salish Matter

Overnight in 6 Micro Trampoline Houses!

Overnight in 6 Micro Trampoline Houses!

Function Calling with Local Models & LangChain - Ollama, Llama3 & Phi-3

Function Calling with Local Models & LangChain - Ollama, Llama3 & Phi-3

Don't Use Effects 🚫 and What To Do Instead 🌟 w/ Alex Rickabaugh, Angular Team

Don't Use Effects 🚫 and What To Do Instead 🌟 w/ Alex Rickabaugh, Angular Team

RAG from the Ground Up with Python and Ollama

RAG from the Ground Up with Python and Ollama

Run your own AI (but private)

Run your own AI (but private)

Watch this BEFORE buying a LAPTOP for Machine Learning and AI 🦾

Watch this BEFORE buying a LAPTOP for Machine Learning and AI 🦾

Run ANY LLM Using Cloud GPU and TextGen WebUI (aka OobaBooga)

Run ANY LLM Using Cloud GPU and TextGen WebUI (aka OobaBooga)

Using docker in unusual ways

Using docker in unusual ways

The Ollama Course: Intro to Ollama

The Ollama Course: Intro to Ollama

Unleash the power of Local LLM's with Ollama x AnythingLLM

Unleash the power of Local LLM's with Ollama x AnythingLLM

Мужицкий трос Ильдара Автоподбор

Мужицкий трос Ильдара Автоподбор

Свадьба Раяна Асланбекова ❤️

Свадьба Раяна Асланбекова ❤️

не имей 100 рублей, а имей 100 эмо друзей

не имей 100 рублей, а имей 100 эмо друзей

СОЛНЕЧНОЕ ЗАТМЕНИЕ 2 ОКТЯБРЯ 2024 года САМОЕ ПОЗИТИВНОЕ для всех знаков зодиака от ANGELA PEARL

СОЛНЕЧНОЕ ЗАТМЕНИЕ 2 ОКТЯБРЯ 2024 года САМОЕ ПОЗИТИВНОЕ для всех знаков зодиака от ANGELA PEARL

Иран атаковал Израиль ракетами. Бои за Угледар. Теракт в Тель-Авиве. Убийство военных РФ командирами

Иран атаковал Израиль ракетами. Бои за Угледар. Теракт в Тель-Авиве. Убийство военных РФ командирами

Простые куриные голени в кефире

Простые куриные голени в кефире

Qizim 109-qism | Anons |Guldek o'g'lingizga it tekkan juvonni olib berdingizmi?

Qizim 109-qism | Anons |Guldek o'g'lingizga it tekkan juvonni olib berdingizmi?

Сбегаю от РОДИТЕЛЕЙ ЗОМБИ. SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ

Сбегаю от РОДИТЕЛЕЙ ЗОМБИ. SCHOOLBOY RUNAWAY В РЕАЛЬНОЙ ЖИЗНИ