Deep Dive: Optimizing LLM inference

Simple Overview of Text to SQL Using Open-WebUI Pipelines

TWRTW Ep #2 - Nuclear Power, GPU Buildouts, Semi-Stateful Workloads, LLM security, GPT-5 Speculation

Luigi's Mansion, But He Brought Daisy

Retreat - The Skibidi Saga 12 (Part 2)

BLACK'S SAD ORIGIN STORY! Incredibox Sprunki Animation

Overview of an Example LLM Inference Setup

Jordan Nanos

Просмотров 3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 12

@FarhadOmid 2 месяца назад
Great work, Jordan! Gonna start scraping the parts together...
@rodrimora 2 месяца назад ⁺²
I feel jalous of that 8xH100 server. Currently using a 4x3090 at home. I actually use a pretty similar setup with vLLM for the full precision models and exllama or llama.cpp for quantized models + OpenwebUI as a frontend.
@MadeInJack 2 месяца назад ⁺¹
Why would you need more than that? Be glad for what you already have or you won't find happiness :)
@ricardocosta9336 2 месяца назад
Bitch i have a p40 and im over the moon. Being poor in ml is hard.
@fakebizPrez 2 месяца назад
Sweet rig. Is that your daily driver? 😀😀
@niceshotapps1233 2 месяца назад ⁺²
- what are you using it for?
- .... stuff
@0101-s7v 2 месяца назад
AI, apparently. (LLM = Large Language Model)
@KCM25NJL 2 месяца назад ⁺²
The cost of such a setup is circa $500,000........ amma get me 2 :)
@ZiggyDaZigster 2 месяца назад
30k GC? 8 of them?
@nesdi6653 2 месяца назад
Word
@nesdi6653 2 месяца назад
Why not podman tho

Следующие

Автовоспроизведение

Deep Dive: Optimizing LLM inference

Deep Dive: Optimizing LLM inference

Simple Overview of Text to SQL Using Open-WebUI Pipelines

Simple Overview of Text to SQL Using Open-WebUI Pipelines

TWRTW Ep #2 - Nuclear Power, GPU Buildouts, Semi-Stateful Workloads, LLM security, GPT-5 Speculation

TWRTW Ep #2 - Nuclear Power, GPU Buildouts, Semi-Stateful Workloads, LLM security, GPT-5 Speculation

Luigi's Mansion, But He Brought Daisy

Luigi's Mansion, But He Brought Daisy

Retreat - The Skibidi Saga 12 (Part 2)

Retreat - The Skibidi Saga 12 (Part 2)

BLACK'S SAD ORIGIN STORY! Incredibox Sprunki Animation

BLACK'S SAD ORIGIN STORY! Incredibox Sprunki Animation

Megan Thee Stallion - Bigger In Texas [Official Video]

Megan Thee Stallion - Bigger In Texas [Official Video]

TWRTW Ep. #1 - Intel issues, NVIDIA delays, JPY carry trades, Google antitrust

TWRTW Ep. #1 - Intel issues, NVIDIA delays, JPY carry trades, Google antitrust

LLM RAG - Chat with MYSQL using Streamlit

LLM RAG - Chat with MYSQL using Streamlit

Local LLM with Ollama, LLAMA3 and LM Studio // Private AI Server

Local LLM with Ollama, LLAMA3 and LM Studio // Private AI Server

Building Customized Text-To-SQL Pipelines in Open WebUI

Building Customized Text-To-SQL Pipelines in Open WebUI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Running a High Throughput OpenAI-Compatible vLLM Inference Server on Modal

Running a High Throughput OpenAI-Compatible vLLM Inference Server on Modal

Demo and Code Review for Text-To-SQL with Open-WebUI

Demo and Code Review for Text-To-SQL with Open-WebUI

Фронт От Шахтерска До Покровска Рухнул🎖 ВСУ Отступают⚔️ Военные Сводки И Анализ За 27.10.2024

Фронт От Шахтерска До Покровска Рухнул🎖 ВСУ Отступают⚔️ Военные Сводки И Анализ За 27.10.2024

不靠谱的爸爸带娃！把孩子扔一边自己呼呼大睡，真是欠收拾！#funny#萌娃#搞笑

不靠谱的爸爸带娃！把孩子扔一边自己呼呼大睡，真是欠收拾！#funny#萌娃#搞笑

Поехал за кунгом, а купил Эксклюзив из СССР!!! Строю дом на колёсах 4х4.

Поехал за кунгом, а купил Эксклюзив из СССР!!! Строю дом на колёсах 4х4.

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

ДЖИМЕН ВСЕХ СПАСЁТ ! | Сюжет skibidi toilet 77 (part 4)

Срочно! Путин на БРИКС жестко РАЗНЕС журналиста из НАТО!

Срочно! Путин на БРИКС жестко РАЗНЕС журналиста из НАТО!

СУМАСШЕДШИЙ МОД PvZ #pvz #pvzmods #пвз #shorts

СУМАСШЕДШИЙ МОД PvZ #pvz #pvzmods #пвз #shorts

Nightmare | Update 0.31.0 Trailer | Standoff 2

Nightmare | Update 0.31.0 Trailer | Standoff 2

How to Cut Glass Bottles: DIY Techniques for Creative Projects!

How to Cut Glass Bottles: DIY Techniques for Creative Projects!