Видео 289
Просмотров 259 786

Building and Evaluating Agentic RAG with Vectara and Arize

47:05

Introduction to LangGraph and Agent Evaluation

4:44

Phoenix 5.0 - Auth

17:07

Evaluating Agents and Assistants: The AI Conference

24:16

Architecting Trust - Challenges & Strategies for GenAI Safety and Alignment

24:03

Mission Critical: Scale to Conquer the Million User Milestone

28:18

AI Agents Masterclass with Chi Wang, Founder of AutoGen

We’re excited to be joined by Chi Wang, founder of AutoGen, to discuss this multi-agent framework for enabling next-gen AI applications.
Check out AutoGen: github.com/autogen-ai
Check out Phoenix: phoenix.arize.com/
Slides: bit.ly/agents-autogen

Видео

Building and Evaluating Agentic RAG with Vectara and Arize

47:05

Building and Evaluating Agentic RAG with Vectara and Arize

Просмотров 624 часа назад

Join us as we walk through how to build AI assistants using Agentic Retrieval-Augmented Generation (RAG). You’ll learn how to create dynamic AI systems that can adapt their retrieval strategies in real-time. We’ll showcase Vectara-agentic - a Python library for developing powerful AI assistants - and cover how to integrate Arize Phoenix to monitor and evaluate the performance of your RAG-based ...

Introduction to LangGraph and Agent Evaluation

4:44

Introduction to LangGraph and Agent Evaluation

Просмотров 2407 часов назад

LangGraph is a powerful framework to help you build AI agents in a structured format. In this video, Greg shows how you can build agents in LangGraph, and how you can use Arize to observe, trace, and evaluate those agents. 📓 Full write-up: arize.com/blog/langgraph

17:07

Phoenix 5.0 - Auth

Просмотров 8012 часов назад

Phoenix 5.0 Auth demo, covering: • What it looks like to deploy Phoenix with Auth • How to create users • How to create API keys • How to administer a system with authentication Product docs: docs.arize.com/phoenix/setup/authentication Github: github.com/Arize-ai/phoenix

Evaluating Agents and Assistants: The AI Conference

24:16

Evaluating Agents and Assistants: The AI Conference

Просмотров 29219 часов назад

Jason Lopatecki, Co-Founder and CEO of Arize AI, dives into the world of evaluating AI agents and assistants. Lopatecki offers a practical and technical perspective on agent architectures, common deployment patterns, and the challenges of bringing agents from proof-of-concept to production. This talk is essential viewing for AI practitioners looking to bridge the gap between experimental AI age...

Architecting Trust - Challenges & Strategies for GenAI Safety and Alignment

24:03

Architecting Trust - Challenges & Strategies for GenAI Safety and Alignment

Просмотров 5919 часов назад

In this panel, Hira Dangol (Bank of America), Soheil Koushan (Anthropic), Prateek Burman (CCC Intelligent Solutions), and Jonathan Steuck (Innodata) explore the key challenges and strategies for safely integrating large language models (LLMs) into enterprise systems. As LLM technology advances, businesses must address trust, compliance, and alignment with operational goals. Topics discussed inc...

Mission Critical: Scale to Conquer the Million User Milestone

28:18

Mission Critical: Scale to Conquer the Million User Milestone

Просмотров 2119 часов назад

In this panel discussion, industry leaders from Salesforce, Wayfair, and DoorDash, moderated by Aman Khan from Arize, share their experiences and best practices for scaling AI and large language models (LLMs) to serve millions of users. The panel dives into the technical challenges of deploying AI in production, such as infrastructure scaling, latency versus accuracy trade-offs, and the importa...

Building Better AI: Enhancing LLM Safety and Relevance

40:02

Building Better AI: Enhancing LLM Safety and Relevance

Просмотров 3321 час назад

Chapters 00:00 - Building Better AI: Enhancing LLM Safety and Relevance 00:03 - Code-Based Evaluations vs. LLM Evaluations 00:45 - Retrieval Performance and Evaluation in RAG Pipelines 01:44 - Human-Centric Evaluations and Relevance 03:37 - Improvements in Relevancy through Context 05:08 - CICD Tests and Experimentation in LLMs 8:31 - Pre-production vs. Production Evaluation 9:43 - Experimentat...

Google's NotebookLM and the Future of AI-Generated Audio

44:21

Google's NotebookLM and the Future of AI-Generated Audio

Просмотров 13121 час назад

Google's NotebookLM has been making waves, but perhaps not for the reasons originally anticipated. Though its "chat over docs" features are notable, its ability to generate highly realistic-sounding podcasts is what has really caught the attention of the AI community. Join us as we dive into NotebookLM’s unique features, including its ability to generate realistic-sounding podcast episodes from...

AI Agents Masterclass with Jerry Liu and Jason Lopatecki

48:32

AI Agents Masterclass with Jerry Liu and Jason Lopatecki

Просмотров 1,1 тыс.21 час назад

This masterclass is led by Jason Lopatecki (Founder of Arize AI), Jerry Liu (Co-Founder & CEO of LlamaIndex), and John Gilhuly (Developer Advocate at Arize). They delve into LlamaIndex workflows and the future of agent-driven architectures. In this class the experts explore the benefits of event-driven architectures in building intelligent agents and how these approaches compare to traditional ...

7:58

Object Detection Modeling in Arize

Просмотров 5914 дней назад

A quick demo of the object detection modeling and the capabilities Arize has around computer vision. Get a better idea of what’s going on in your CV datasets and what’s underperforming. Dive in with and see what’s possible, including: - Pull out patterns within the embeddings space that might identify different types of objects or locations that are being picked up in the images. - Look at the ...

Building Better AI: Improving Safety and Reliability of LLM Applications

13:09

Building Better AI: Improving Safety and Reliability of LLM Applications

Просмотров 10914 дней назад

Building Better AI: Improving Safety and Reliability of LLM Applications

AI Agent Mastery: Is Your Agent Stuck in a Loop?

53:27

AI Agent Mastery: Is Your Agent Stuck in a Loop?

Просмотров 22014 дней назад

AI Agent Mastery: Is Your Agent Stuck in a Loop?

4:52

Which Eval Model should you use?

Просмотров 21314 дней назад

Which Eval Model should you use?

Exploring OpenAI's o1-preview and o1-mini

46:58

Exploring OpenAI's o1-preview and o1-mini

Просмотров 32221 день назад

Exploring OpenAI's o1-preview and o1-mini

1:01:07

AI Agent Mastery: Evaluating Agents

Просмотров 35921 день назад

AI Agent Mastery: Evaluating Agents

Debug your AI with AI - Arize's AI Agent Search

4:17

Debug your AI with AI - Arize's AI Agent Search

Просмотров 23021 день назад

Debug your AI with AI - Arize's AI Agent Search

AI Agent Mastery: Comparing Agent Frameworks

1:00:53

AI Agent Mastery: Comparing Agent Frameworks

Просмотров 903Месяц назад

AI Agent Mastery: Comparing Agent Frameworks

1:01:28

AI Agent Mastery: Agent Architectures

Просмотров 1,1 тыс.Месяц назад

AI Agent Mastery: Agent Architectures

Breaking Down Reflection Tuning: Enhancing LLM Performance with Self-Learning

29:56

Breaking Down Reflection Tuning: Enhancing LLM Performance with Self-Learning

Просмотров 111Месяц назад

Breaking Down Reflection Tuning: Enhancing LLM Performance with Self-Learning

How to Trace a Groq Application in Phoenix

5:23

How to Trace a Groq Application in Phoenix

Просмотров 158Месяц назад

How to Trace a Groq Application in Phoenix

10:51

How To Set Up CrewAI Observability

Просмотров 344Месяц назад

How To Set Up CrewAI Observability

15:40

Trace a Vercel AI powered Chat App

Просмотров 164Месяц назад

Trace a Vercel AI powered Chat App

10:50

Build and Evaluate an Image Classifier

Просмотров 103Месяц назад

Build and Evaluate an Image Classifier

How Bazaarvoice Navigated the Challenges of Deploying an LLM App

3:28

How Bazaarvoice Navigated the Challenges of Deploying an LLM App

Просмотров 53Месяц назад

How Bazaarvoice Navigated the Challenges of Deploying an LLM App

Trace and Evaluate Haystack Pipelines with Phoenix

9:03

Trace and Evaluate Haystack Pipelines with Phoenix

Просмотров 90Месяц назад

Trace and Evaluate Haystack Pipelines with Phoenix

Prompt Optimization Using Datasets and Experiments

15:36

Prompt Optimization Using Datasets and Experiments

Просмотров 3552 месяца назад

Prompt Optimization Using Datasets and Experiments

Phoenix: Use Annotations to collect Human Feedback from your LLM App

9:06

Phoenix: Use Annotations to collect Human Feedback from your LLM App

Просмотров 2552 месяца назад

Phoenix: Use Annotations to collect Human Feedback from your LLM App

Community Paper Reading: Judging the Judges

41:04

Community Paper Reading: Judging the Judges

Просмотров 1722 месяца назад

Community Paper Reading: Judging the Judges

How Atropos Health Accelerates Research with LLM Observability

2:05

How Atropos Health Accelerates Research with LLM Observability

Просмотров 612 месяца назад

How Atropos Health Accelerates Research with LLM Observability

@aashishrulz 11 месяцев назад
great session
@vivekpadman5248 Год назад
Wonderful channel and great video ❤
@shehanbartholomeusz7511 Год назад
Great content. Learnt a lot. thank you
@ray-qs3nq Год назад
That is great content here! I learned a lot, hope to know more detail of RLS
@robertosala1974 2 года назад
Very interesting point: to have an eye on the accuracy of the knowledge of the model and amend the training of the model where it’s not obvious where it’s failing - this is true in the are I work for at Oracle which is text classification to predict the class (context) of the issues of storage devices
@pktechnik9668 2 года назад
👏 𝚙𝚛𝚘𝚖𝚘𝚜𝚖

Arize AI

Комментарии