The COMPLETE TRUTH About AI Agents (2024)

Testing Framework Giskard for LLM and RAG Evaluation (Bias, Hallucination, and More)

Build Anything with AI Agents, Here's How

EBK Jaaybo - Kaboom (Official Music Video)

Risk of Rain 2: Seekers of the Storm - Chef Survivor Showcase Trailer

Borderlands 4 is HAPPENING!!! (New Reveal Trailer Analysis)

Arize AI Phoenix: Open-Source Tracing & Evaluation for AI (LLM/RAG/Agent)

AI Anytime

Просмотров 1,7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 22 авг 2024
Welcome to my tutorial on using Phoenix by Arize AI, the open-source AI observability platform that's revolutionizing experimentation, evaluation, and troubleshooting in AI applications. In this video, I’ll walk you through the powerful features of Phoenix, including tracing, evaluation, and inference analysis.
What You'll Learn:
1. Tracing: Understand how to trace your LLM application’s runtime using OpenTelemetry-based instrumentation.
2. Evaluation: Discover how to leverage LLMs to benchmark your application’s performance using response and retrieval evaluations.
3. Inference Analysis: Learn to visualize inferences and embeddings with dimensionality reduction and clustering to identify drift and performance degradation.
Why Phoenix?
Phoenix is vendor and language agnostic with out-of-the-box support for popular frameworks like 🦙 LlamaIndex, 🦜⛓ LangChain, and 🧩 DSPy, and LLM providers like OpenAI and Bedrock. Whether you’re working on a Jupyter notebook, local machine, containerized deployment, or in the cloud, Phoenix has you covered.
Don’t forget to like, comment, and subscribe for more tutorials on AI and machine learning tools. Your support helps me create more content to help you on your AI journey!
🔔 Hit the bell icon to get notified whenever I post a new video.
Join Discord: / discord
Join this channel to get access to perks:
/ @aianytime
To further support the channel, you can contribute via the following methods:
Bitcoin Address: 32zhmo5T9jvu8gJDGW3LTuKBM1KPMHoCsW
UPI: sonu1000raw@ybl
GitHub: github.com/AIA...
#arize #ai #aiagents

Комментарии • 9

@sneharoy3566 Месяц назад
Nxt level
@yusufersayyem7242 Месяц назад
Thank you for your wonderful efforts Mr. Sonu ❤
@muhammedajmalg6426 Месяц назад
Evaluation all you need, thanks for sharing
@AIAnytime Месяц назад
Glad it was helpful!
@muhammedajmalg6426 Месяц назад
@@AIAnytime I have sent you a connection request on LinkedIn please accept, I don't have any connection note left, thanks
@samimbsnl Месяц назад
Nice...............Evaluation is highly needed
@IdPreferNot1 Месяц назад
Just tried Langfuse... run standalone in a Docker container...not in detail though. This looks cool with the 'evaluator' for QA correctness and hallucinations. Tried with mini as the eval model... think it missed one... 4o seems good. But the groq llama 70B (or the faiss embeddings and retreiver) seems iffy. Come on llama 400B next week!... and the groq version since i don't have a 1TB GPU ;)
@user-iu4id3eh1x Месяц назад
Wow this is what I need
@atultiwari88 Месяц назад
Hi Sonu. a good tutorial like always. In your opinion which is the best LLM evaluator?

Следующие

Автовоспроизведение

The COMPLETE TRUTH About AI Agents (2024)

The COMPLETE TRUTH About AI Agents (2024)

Testing Framework Giskard for LLM and RAG Evaluation (Bias, Hallucination, and More)

Testing Framework Giskard for LLM and RAG Evaluation (Bias, Hallucination, and More)

Build Anything with AI Agents, Here's How

Build Anything with AI Agents, Here's How

EBK Jaaybo - Kaboom (Official Music Video)

EBK Jaaybo - Kaboom (Official Music Video)

Risk of Rain 2: Seekers of the Storm - Chef Survivor Showcase Trailer

Risk of Rain 2: Seekers of the Storm - Chef Survivor Showcase Trailer

Borderlands 4 is HAPPENING!!! (New Reveal Trailer Analysis)

Borderlands 4 is HAPPENING!!! (New Reveal Trailer Analysis)

Lil Green Man has a Meltdown

Lil Green Man has a Meltdown

How to evaluate an LLM-powered RAG application automatically.

How to evaluate an LLM-powered RAG application automatically.

Episode 1- Efficient LLM training with Unsloth.ai Co-Founder

Episode 1- Efficient LLM training with Unsloth.ai Co-Founder

Chollet's ARC Challenge + Current Winners

Chollet's ARC Challenge + Current Winners

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

I wish every AI Engineer could watch this.

I wish every AI Engineer could watch this.

Build a RAG Evaluation Tool and Python Library

Build a RAG Evaluation Tool and Python Library

$0 Embeddings (OpenAI vs. free & open source)

$0 Embeddings (OpenAI vs. free & open source)

AI passed the Turing Test -- And No One Noticed

AI passed the Turing Test -- And No One Noticed

Italians vs @BayashiTV_ SO CLOSE

Italians vs @BayashiTV_ SO CLOSE

Кулинарный AMONG US в РЕАЛЬНОЙ ЖИЗНИ! Масленников, Янчик, Егорик, Сатир, Жидковский, ЯЯна

Кулинарный AMONG US в РЕАЛЬНОЙ ЖИЗНИ! Масленников, Янчик, Егорик, Сатир, Жидковский, ЯЯна

老公说在家无聊，想出去打牌，我不让他去，就陪他在家这样玩#夫妻搞笑视频#惊不惊喜意不意外 #万万没想到 #逗比夫妻日常 #这操作都看傻了

老公说在家无聊，想出去打牌，我不让他去，就陪他在家这样玩#夫妻搞笑视频#惊不惊喜意不意外 #万万没想到 #逗比夫妻日常 #这操作都看傻了

Əliyev və Putin kilsədə şam yandırıblar

Əliyev və Putin kilsədə şam yandırıblar

Ik Heb Aardbeien Gemaakt Van Kip🍓🐔😋

Ik Heb Aardbeien Gemaakt Van Kip🍓🐔😋

Доказали ли японцы, что человек на 78% состоит из воды? Алексей Водовозов.

Доказали ли японцы, что человек на 78% состоит из воды? Алексей Водовозов.

🔥 Кургинян: ВСЕХ РАСФИГАЧИМ?! А "РАСФИГАЛКА" ЕСТЬ? В Курске - детский лепет!/ О ситуации в России

🔥 Кургинян: ВСЕХ РАСФИГАЧИМ?! А "РАСФИГАЛКА" ЕСТЬ? В Курске – детский лепет!/ О ситуации в России

Тренд ты мой зайчик по очереди 🐰

Тренд ты мой зайчик по очереди 🐰