AWS Certified Cloud Practitioner Training 2020 - Full Course

Linus Torvalds on the kernel, GenAI, EVs, programming languages and more…

Ethics in the Cloud: Safeguarding Responsible AI Development in Asia - Quiana Berry, Red Hat

Answering Tough Questions✅

The GENIUS new porch idea everyone's copying for Christmas!

Olivia Rodrigo Explores #ASMR | W Magazine

Self-Hosted LLM Agent on Your Own Laptop or Edge Device - Michael Yuan, Second State

The Linux Foundation

Просмотров 266

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 ноя 2024
Don't miss out! Join us at our upcoming conference: Open Source Summit + AI_Dev: Open Source GenAI & ML Summit in Tokyo from October 28-29, 2024. Connect with peers as the community gathers to further the education and advancement of open source and GenAI. Learn more at events.linuxfo...
Self-Hosted LLM Agent on Your Own Laptop or Edge Device | 在自己的笔记本电脑或边缘设备上自托管LLM Agent - Michael Yuan, Second State
As LLM applications evolve from chatbots to copilots to AI agents, there are increasing needs for privacy, customization, cost control, and value alignment. Running open-source LLMs and agents on personal or private devices is a great way to achieve those goals. With the release of a new generation of open-source LLMs, such as Llama 3, the gap between open-source and proprietary LLMs is narrowing fast. In many cases, open source LLMs are already outperforming SaaS-based proprietary LLMs. For AI agents, open-source LLMs are not just cheaper and more private. They allow customization through finetuning and RAG prompt engineering using private data. This talk shows you how to build a complete AI agent service using an open-source LLM and a personal knowledge base. We will use the open-source WasmEdge + Rust stack for LLM inference, which is fast and lightweight without complex Python dependencies. It is cross-platform and achieves native performance on any OSes, CPUs, and GPUs.
随着LLM应用程序从聊天机器人发展到副驾驶员再到AI代理，对隐私、定制、成本控制和价值对齐的需求越来越大。在个人或私人设备上运行开源LLMs和代理是实现这些目标的好方法。随着新一代开源LLMs（如Llama 3）的发布，开源和专有LLMs之间的差距迅速缩小。在许多情况下，开源LLMs已经超越了基于SaaS的专有LLMs。对于AI代理来说，开源LLMs不仅更便宜、更私密，还允许通过微调和使用私人数据进行RAG提示工程来进行定制。本次演讲将向您展示如何使用开源LLM和个人知识库构建完整的AI代理服务。我们将使用开源的WasmEdge + Rust堆栈进行LLM推理，这种方法快速轻便，不需要复杂的Python依赖。它是跨平台的，在任何操作系统、CPU和GPU上都能实现原生性能。

Комментарии •

Следующие

Автовоспроизведение

AWS Certified Cloud Practitioner Training 2020 - Full Course

AWS Certified Cloud Practitioner Training 2020 - Full Course

Linus Torvalds on the kernel, GenAI, EVs, programming languages and more…

Linus Torvalds on the kernel, GenAI, EVs, programming languages and more…

Ethics in the Cloud: Safeguarding Responsible AI Development in Asia - Quiana Berry, Red Hat

Ethics in the Cloud: Safeguarding Responsible AI Development in Asia - Quiana Berry, Red Hat

Answering Tough Questions✅

Answering Tough Questions✅

The GENIUS new porch idea everyone's copying for Christmas!

The GENIUS new porch idea everyone's copying for Christmas!

Olivia Rodrigo Explores #ASMR | W Magazine

Olivia Rodrigo Explores #ASMR | W Magazine

Donald Trump ‘acted like a man’, says Putin

Donald Trump ‘acted like a man’, says Putin

Self-Hosted LLM Agent on Your Own Laptop or Edge Device | 在自己的笔记本电脑或边缘设备上自托管LLM Agent - Michael Yuan

Self-Hosted LLM Agent on Your Own Laptop or Edge Device | 在自己的笔记本电脑或边缘设备上自托管LLM Agent - Michael Yuan

🎤 Viktor Vedmich | Kubernetes Security: Attacks & Best Practices | AWS Community Day Armenia 2024

🎤 Viktor Vedmich | Kubernetes Security: Attacks & Best Practices | AWS Community Day Armenia 2024

Edge Devices and LLMs: What's Ahead for AI

Edge Devices and LLMs: What's Ahead for AI

Exploring CXL Memory: Configuration and Emulation - Yasunori Goto, Fsas Technologies Inc.

Exploring CXL Memory: Configuration and Emulation - Yasunori Goto, Fsas Technologies Inc.

LF Live Webinar: GenAI & Coding: Prompts for Maximum Workflow

LF Live Webinar: GenAI & Coding: Prompts for Maximum Workflow

Kubernetes 101 workshop - complete hands-on

Kubernetes 101 workshop - complete hands-on

Improving Bpftrace Reliability - Daniel Xu, Meta

Improving Bpftrace Reliability - Daniel Xu, Meta

Democratizing Diffusion Models with Diffusers - Sayak Paul, Hugging Face

Democratizing Diffusion Models with Diffusers - Sayak Paul, Hugging Face

Sonic and Super Sonic vs Shadow x Silver x Knuckles. (Perfect Outlines)

Sonic and Super Sonic vs Shadow x Silver x Knuckles. (Perfect Outlines)

iPhone включил камеру 📲

iPhone включил камеру 📲

My pace for shorter and shorter races.

My pace for shorter and shorter races.

Трамп победил. Чего ждать от нового президента США?

Трамп победил. Чего ждать от нового президента США?

НОВЫЙ ГЕРОЙ И 4-Й АКТ | KEZ - САМЫЙ СЛОЖНЫЙ ГЕРОЙ ДОТЫ | РАЗБОР 4-ГО АКТА ПАВШЕЙ КОРОНЫ | DOTA 2

НОВЫЙ ГЕРОЙ И 4-Й АКТ | KEZ - САМЫЙ СЛОЖНЫЙ ГЕРОЙ ДОТЫ | РАЗБОР 4-ГО АКТА ПАВШЕЙ КОРОНЫ | DOTA 2

АЛМАЗНЫЙ ЛУК 🏹 | WICSUR #shorts

АЛМАЗНЫЙ ЛУК 🏹 | WICSUR #shorts

Academeg про главный минус китайских авто! #авто

Academeg про главный минус китайских авто! #авто

Russian soldiers flee after their T-80 tank is hit by Javelin missile

Russian soldiers flee after their T-80 tank is hit by Javelin missile