Is Semantic Search All You Need?

Beyond RAG Partitions: Per-User, Per-Chunk Access Policy

Challenges in Structured Document Data Extraction at Scale with LLMs

Stray Kids Answers 30 Questions As Quickly As Possible

Superman - Teaser Trailer Tomorrow

BABYMONSTER - 'Love In My Heart' M/V

LIVE! Nov 13 South Bay Unstructured Data Meetup

Zilliz

Просмотров 161

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 2 фев 2025
5:30 - 6:00 - Welcome/Networking/Registration
6:05 - 6:30 - Dinesh Chandrasekhar, Challenges in Structured Document Data Extraction at Scale with LLMs
6:35 - 7:00 - James Luan, Milvus 2.4
7:05 - 7:30 - Rob Quiros, Beyond RAG Partitions: Per-User, Per-Chunk Access Policy
7:30 - 8:00 - Networking
Tech Talk 1: Challenges in Structured Document Data Extraction at Scale with LLMs
Speaker: Dinesh Chandrasekhar, Unstract
Abstract: All businesses have to deal with unstructured documents at some level. Some have to deal with them at scale. While an LLM-powered approach to this problem is most certainly head and shoulders above traditional machine learning-based approaches, it is not without its challenges. Top concerns being accuracy and cost, which can really begin to hurt at scale.
In this talk, we will look at how Unstract, an open source platform purpose-built for structured document data extraction, solves these challenges. Dealing with 5M+ pages of structured content extraction per month, Unstract uses various techniques to attain accuracy and cost efficiency.
Topics Covered
Introduction to Unstructured Data Processing
Processing Document Data
Extraction Difficulties
Unstract to the rescue
Demo
Tech Talk 2: Milvus 2.4
Speaker: James Luan, VP of Engineering, Zilliz
Tech Talk 3: Beyond RAG Partitions: Per-User, Per-Chunk Access Policy
Speaker: Rob Quiros, CEO & Co-Founder, Caber Systems, Inc.
Abstract: Partitioning vector databases has proven to be a useful tool for privacy and per-tenant isolation. Recent releases of vector db software, including Milvus, have continued to improve partitioning capabilities such as pushing the number of partitions into the millions and providing improved selection of partitions per tenant.
Despite these advances, management overhead increases with the number of partitions. Relative to the capabilities enterprises require and have come to expect from their existing storage systems and databases, there is still a shortfall. New capabilities specific to how vector databases store data and how they are used in RAG applications are needed.
Topics Covered:
- Origins of enterprise requirements for granular access control and policy in storage systems.
Sensitive data identification: data classification versus access control.
The problem data-duplication in enterprise datasets presents when copying permissions from documents to chunks.
How enterprise access requirements can be met with per-user, per-chunk access control
Case study and example implementation.

Комментарии •

Следующие

Автовоспроизведение

Is Semantic Search All You Need?

Is Semantic Search All You Need?

Beyond RAG Partitions: Per-User, Per-Chunk Access Policy

Beyond RAG Partitions: Per-User, Per-Chunk Access Policy

Challenges in Structured Document Data Extraction at Scale with LLMs

Challenges in Structured Document Data Extraction at Scale with LLMs

Stray Kids Answers 30 Questions As Quickly As Possible

Stray Kids Answers 30 Questions As Quickly As Possible

Superman - Teaser Trailer Tomorrow

Superman - Teaser Trailer Tomorrow

BABYMONSTER - 'Love In My Heart' M/V

BABYMONSTER - 'Love In My Heart' M/V

Islam Makhachev DENIES Arman Tsarukyan as toughest opponent👀 'I'll make everyone shut up' | ESPN MMA

Islam Makhachev DENIES Arman Tsarukyan as toughest opponent👀 'I'll make everyone shut up' | ESPN MMA

Evaluating RAG Pipelines Built on Unstructured Data

Evaluating RAG Pipelines Built on Unstructured Data

Bruin: The unified Analytic Platform, for Ingestion, Transformation, Data Quality and Governance

Bruin: The unified Analytic Platform, for Ingestion, Transformation, Data Quality and Governance

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Building Production RAG Over Complex Documents

Building Production RAG Over Complex Documents

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

RAG Pipelines with Real-Time Data

RAG Pipelines with Real-Time Data

Challenge: Can you Break the LLM?

Challenge: Can you Break the LLM?

Yale NLP/LLM Interest Group - Session 23 Jimin Huang

Yale NLP/LLM Interest Group - Session 23 Jimin Huang

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

ПРОФЕССИЯ (смешное видео, юмор, приколы, поржать)

ПРОФЕССИЯ (смешное видео, юмор, приколы, поржать)

New Colour Match Puzzle Challenge - Incredibox Sprunki

New Colour Match Puzzle Challenge - Incredibox Sprunki

ДВУХГОЛОВЫЙ ГУСЬ

ДВУХГОЛОВЫЙ ГУСЬ

Горы Бесплатной пиццы

Горы Бесплатной пиццы

Вилка хвастается Варе! Часть 2 #shorts

Вилка хвастается Варе! Часть 2 #shorts

В Украине взяли в плен военных из Татарстана и Башкортостана #война

В Украине взяли в плен военных из Татарстана и Башкортостана #война

У Украины «нет шансов». ВСУ в Курской области. Могут ли США переломить ход конфликта? / Ширяев

У Украины «нет шансов». ВСУ в Курской области. Могут ли США переломить ход конфликта? / Ширяев