Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

These Are The Worst Job Interviews Ever

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84

Stanford MLSys Seminars

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 28 дек 2024
Наука

Комментарии • 8

@voncolborn9437 Год назад ⁺²
Great presentation. It is interesting to see the practical side of running a bunch of LLMs. Ops makes it happen. Coming from the old, really old, school of computing with massive multi-user, time-share systems, it is interesting to see how no matter how much computing changes, aspects of it remain the same. Through-put, latency, caching and scheduling is still central. All that seems to have changed is the problem domain. We do, in deed, live in intereswting times.
@conan_der_barbar Год назад ⁺¹
great talk! still waiting for the open source release 👀
@Gerald-iz7mv 9 месяцев назад
hi, do you have any links to benchmarks you can run to measure latency, throughput for different model and frameworks etc?
@suleimanshehu5839 Год назад
Please create a video on fine tuning MoE LLM using LoRa adapters such as Mixtural 8x7B MoE LLM within your framework
@fastcardlastname3353 Год назад
This shall change the landscape of multiple agents if it's promised.
@mohamedfouad1309 Год назад
Github link😅
@nithinrao7191 Год назад
Second
@absbi0000 Год назад
First

Следующие

Автовоспроизведение

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

Foundation Models on Consumer Devices - Tianqi Chen | Stanford MLSys #85

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

These Are The Worst Job Interviews Ever

These Are The Worst Job Interviews Ever

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

Jarahn - On My Way (Official Music Video) Jarahn feat. Studd Cruiser x Yansa Q

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

Avengers wake up, Marvel Rivals is fire

Avengers wake up, Marvel Rivals is fire

LoRAX: Serve 1000s of Fine-Tuned LLMs on a Single GPU - Travis Addair, Predibase, Inc.

LoRAX: Serve 1000s of Fine-Tuned LLMs on a Single GPU - Travis Addair, Predibase, Inc.

ChatGPT & LLM Ethics: History, Architecture, and Debate | MLBBQ | Theodore LaGrow

ChatGPT & LLM Ethics: History, Architecture, and Debate | MLBBQ | Theodore LaGrow

DDD & LLMs - Eric Evans - DDD Europe

DDD & LLMs - Eric Evans - DDD Europe

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

LoRA Land: How We Trained 25 Fine-Tuned Mistral-7b Models that Outperform GPT-4

LoRA Land: How We Trained 25 Fine-Tuned Mistral-7b Models that Outperform GPT-4

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87

Training and deploying open-source large language models

Training and deploying open-source large language models

The Next 100x - Gavin Uberti | Stanford MLSys #92

The Next 100x - Gavin Uberti | Stanford MLSys #92

Snowflake + Predibase: Smaller, faster & cheaper LLMs that beat GPT-4

Snowflake + Predibase: Smaller, faster & cheaper LLMs that beat GPT-4

КАК СОБРАТЬ КОМПЬЮТЕР 2025? ПОДРОБНЫЙ ГАЙД ШАГ ЗА ШАГОМ ДЛЯ НОВИЧКОВ! СОБИРАЙТЕ ПК КАК ПРОФЕССИОНАЛ

КАК СОБРАТЬ КОМПЬЮТЕР 2025? ПОДРОБНЫЙ ГАЙД ШАГ ЗА ШАГОМ ДЛЯ НОВИЧКОВ! СОБИРАЙТЕ ПК КАК ПРОФЕССИОНАЛ

Smart Appliances! New Gadgets, Versatile Utensils, Tool Items #shorts #gadgets

Smart Appliances! New Gadgets, Versatile Utensils, Tool Items #shorts #gadgets

ПЛОХО ловит сети ПОСЛЕ УДАРА / Смартфон Honor View 20 (РЕМОНТ)

ПЛОХО ловит сети ПОСЛЕ УДАРА / Смартфон Honor View 20 (РЕМОНТ)

iOS 18.2 обновление! Что нового iOS 18.2? Полный обзор iOS 18.2, ИИ iOS 18.2, скорость, батарея

iOS 18.2 обновление! Что нового iOS 18.2? Полный обзор iOS 18.2, ИИ iOS 18.2, скорость, батарея

Normal users keyboard #pc #pcgaming #gamingpc #pcbuild #keyboard

Normal users keyboard #pc #pcgaming #gamingpc #pcbuild #keyboard

Бизнес на сборках ПК - Я совершил ОШИБКУ и потерял деньги

Бизнес на сборках ПК - Я совершил ОШИБКУ и потерял деньги

Какой айфон выбрать в 2025, чтобы НЕ ПОЖАЛЕТЬ? Всё очень просто!

Какой айфон выбрать в 2025, чтобы НЕ ПОЖАЛЕТЬ? Всё очень просто!

WOW 😱 How to Make AirPods from Regular Headphones

WOW 😱 How to Make AirPods from Regular Headphones