Enterprise Vector-based Information Retrieval & Prompt Engineering at Scale using Open Source on AWS

From Spark to Ray: An Exabyte-Scale Production Migration Case Study

Deploying Many Models Efficiently with Ray Serve

The 4:30 Movie Official Trailer (Domestic)

Tom Aspinall vs Curtis Blaydes KO

Astro Bot DualSense Controller Reveal | PS5

Practical Data Considerations for Building Production-Ready LLM Applications

Anyscale

Просмотров 1,6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 окт 2023
Large Language Models (LLM’s) are starting to revolutionize how users can search for, interact with, and generate new content. Some recent stacks and toolkits around Retrieval Augmented Generation (RAG) have emerged where users are building applications such as chatbots using LLMs on their own private data. This opens the door to a vast array of applications. However while setting up a naive RAG stack is easy, there is a long-tail of data challenges that the user must tackle in order to make their application production-ready. In this talk, we give practical tips on how to manage data for building a scalable/robust/reliable LLM software system, and how LlamaIndex + Ray provide the tools to do so.
Find the slide deck here: drive.google.com/file/d/1n0SC...
About Anyscale
---
Anyscale is the AI Application Platform for developing, running, and scaling AI.
www.anyscale.com/
If you're interested in a managed Ray service, check out:
www.anyscale.com/signup/
About Ray
---
Ray is the most popular open source framework for scaling and productionizing AI workloads. From Generative AI and LLMs to computer vision, Ray powers the world’s most ambitious AI workloads.
docs.ray.io/en/latest/
#llm #machinelearning #ray #deeplearning #distributedsystems #python #genai

Комментарии • 1

@anantwag19 8 месяцев назад
That initial 4 lines of code for Reading lot of pdf documents and creating index really resulted in Hallucination and non accurate answers .

Следующие

Автовоспроизведение

Enterprise Vector-based Information Retrieval & Prompt Engineering at Scale using Open Source on AWS

Enterprise Vector-based Information Retrieval & Prompt Engineering at Scale using Open Source on AWS

From Spark to Ray: An Exabyte-Scale Production Migration Case Study

From Spark to Ray: An Exabyte-Scale Production Migration Case Study

Deploying Many Models Efficiently with Ray Serve

Deploying Many Models Efficiently with Ray Serve

The 4:30 Movie Official Trailer (Domestic)

The 4:30 Movie Official Trailer (Domestic)

Tom Aspinall vs Curtis Blaydes KO

Tom Aspinall vs Curtis Blaydes KO

Astro Bot DualSense Controller Reveal | PS5

Astro Bot DualSense Controller Reveal | PS5

Paris Olympics organisers apologise for opening ceremony's unintentional Last Supper parody

Paris Olympics organisers apologise for opening ceremony's unintentional Last Supper parody

Lessons Learned on LLM RAG Solutions

Lessons Learned on LLM RAG Solutions

Building Production-Ready RAG Applications: Jerry Liu

Building Production-Ready RAG Applications: Jerry Liu

Emerging Architectures for LLM Applications - Raja Iqbal

Emerging Architectures for LLM Applications - Raja Iqbal

Emerging architectures for LLM applications

Emerging architectures for LLM applications

Chat and RAG with Tabular Databases Using Knowledge Graph and LLM Agents

Chat and RAG with Tabular Databases Using Knowledge Graph and LLM Agents

Developing and Serving RAG-Based LLM Applications in Production

Developing and Serving RAG-Based LLM Applications in Production

How to simplify execution of cloud-native model training & validation with CodeFlare: A HandsOn Demo

How to simplify execution of cloud-native model training & validation with CodeFlare: A HandsOn Demo

Practical Data Considerations For Building Production-Ready LLM Applications, Jerry Liu, LlamaIndex

Practical Data Considerations For Building Production-Ready LLM Applications, Jerry Liu, LlamaIndex

Ray Observability 2.0: How to Debug Your Ray Applications with New Observability Tooling

Ray Observability 2.0: How to Debug Your Ray Applications with New Observability Tooling

Почему адаптеры Apple могут быть опасны?

Почему адаптеры Apple могут быть опасны?

Jelly Hack Pack 😈 Who Knew Denial Could Taste So Sweet? #comedy

Jelly Hack Pack 😈 Who Knew Denial Could Taste So Sweet? #comedy

Каха заблудился в горах

Каха заблудился в горах

Отправь знакомому Косицину!🎤🎤 #shorts #slowslowcow #плохиепесни #стендап

Отправь знакомому Косицину!🎤🎤 #shorts #slowslowcow #плохиепесни #стендап

ТРОЙНОЕ ПУТЕШЕСТВИЕ В КИТАЙ, ГОНКОНГ и КОРЕЮ

ТРОЙНОЕ ПУТЕШЕСТВИЕ В КИТАЙ, ГОНКОНГ и КОРЕЮ

РУБИН - ЗЕНИТ: ВСЕ ГОЛЫ

РУБИН — ЗЕНИТ: ВСЕ ГОЛЫ

🔥ОСЄЧКІН: путін У ПАНІЦІ від нового звіту спецслужб! Заколотники ВЖЕ ГОТУЮТЬСЯ, глибинка лютує від..

🔥ОСЄЧКІН: путін У ПАНІЦІ від нового звіту спецслужб! Заколотники ВЖЕ ГОТУЮТЬСЯ, глибинка лютує від..

Глеб теперь знает движения! #iribaby #shorts

Глеб теперь знает движения! #iribaby #shorts