How to use @postman to test LLMs with audio data (Transcribe and Understand)

A Complete Overview of Word Embeddings

Don’t Build AI Products The Way Everyone Else Is Doing It

Megan Thee Stallion - Rattle [Official Audio]

JT & Jeezy - OKAY (Official Video)

We Got KICKED OUT of a Garage Sale! With Shane Dawson and Ryland Adams

How to Apply LLMs on Audio Recordings with Multiple Speakers

AssemblyAI

Просмотров 3,5 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 28 июн 2024
Get AssemblyAI API key for this tutorial: www.assemblyai.com/?...
LLMs work wonders on text data but if you want to use audio or video files instead of text, things get a bit trickier. An easy solution is to transcribe the audio or video files. This would work but you will lose valuable information, especially in multi-speaker situations, like how many people were speaking and who said what.
In this video, we’ll learn how to build a RAG application in 10 minutes that can take multiple speakers into account when answering a question.
Colab notebook: github.com/deepset-ai/haystac...
AssemblyAI-Haystack Integration docs: www.assemblyai.com/docs/integ...
Blog post of this video: haystack.deepset.ai/blog/leve...
00:00 Introduction
00:32 Effect of Speaker Labels
01:49 Libraries and example files
04:43 Transcription Pipeline
07:52 RAG Application
10:34 Results
11:52 Try it out yourself!
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: www.assemblyai.com/?...
🐦 Twitter: / assemblyai
🦾 Discord: / discord
▶️ Subscribe: ruclips.net/user/AssemblyAI?...
🔥 We're hiring! Check our open roles: www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#MachineLearning #DeepLearning
Наука

Комментарии • 6

@Tech_Knowledge99 2 месяца назад ⁺²
Excellent work.
I have a request.
Please make a video about
"Authentication of user identify through voice."
@serkanbeyaz57 3 месяца назад
Great, I was looking for something like this. 🙏
@AssemblyAI 3 месяца назад
Great timing!
@user-sj6eu8sp9v 2 месяца назад
Thanks for the awesome tutorial!
Is there some way to map Speaker A to a known speaker? I was thinking of something like speaker embeddings? Also, is it possible to use this in a realtime application?
@MarxOrx 3 месяца назад
FIRST 🎉
@nirutg5130 3 месяца назад
Second 🙂

Следующие

Автовоспроизведение

How to use @postman to test LLMs with audio data (Transcribe and Understand)

How to use @postman to test LLMs with audio data (Transcribe and Understand)

A Complete Overview of Word Embeddings

A Complete Overview of Word Embeddings

Don’t Build AI Products The Way Everyone Else Is Doing It

Don’t Build AI Products The Way Everyone Else Is Doing It

Megan Thee Stallion - Rattle [Official Audio]

Megan Thee Stallion - Rattle [Official Audio]

JT & Jeezy - OKAY (Official Video)

JT & Jeezy - OKAY (Official Video)

We Got KICKED OUT of a Garage Sale! With Shane Dawson and Ryland Adams

We Got KICKED OUT of a Garage Sale! With Shane Dawson and Ryland Adams

Celine Dion Shares Scary Footage of Her Suffering Spasm in Tearful Documentary Scene | E! News

Celine Dion Shares Scary Footage of Her Suffering Spasm in Tearful Documentary Scene | E! News

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

transcription and speaker identification OpenAI-Whisper and Pyannote [Python]

transcription and speaker identification OpenAI-Whisper and Pyannote [Python]

CS Professor Sounds Alarm on AI and Programmers

CS Professor Sounds Alarm on AI and Programmers

How I'd Learn AI (If I Had to Start Over)

How I'd Learn AI (If I Had to Start Over)

RAG But Better: Rerankers with Cohere AI

RAG But Better: Rerankers with Cohere AI

Free AI Audio Tools You Won't Believe Exist

Free AI Audio Tools You Won't Believe Exist

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

ВЕЛИКАЯ ЭВОЛЮЦИЯ ЗВУКА: от 8-bit до Hi-Res | РАЗБОР

ВЕЛИКАЯ ЭВОЛЮЦИЯ ЗВУКА: от 8-bit до Hi-Res | РАЗБОР

Какие телефоны запрещены в разных странах мира ?(Часть 2) 📱

Какие телефоны запрещены в разных странах мира ?(Часть 2) 📱

Как покрасить видеокарту и сломать свой компьютер? Оказалось легко... #games #компьютер #nvidia #rtx

Как покрасить видеокарту и сломать свой компьютер? Оказалось легко... #games #компьютер #nvidia #rtx

Мошенники эволюционируют, подняли самый гнилой асус на планете, как ломают ноутбуки обслуживанием?

Мошенники эволюционируют, подняли самый гнилой асус на планете, как ломают ноутбуки обслуживанием?

Чем ОТЛИЧАЕТСЯ Lixiang L6 от L7 и L9: Покажу наглядно!

Чем ОТЛИЧАЕТСЯ Lixiang L6 от L7 и L9: Покажу наглядно!

Комп работает как часы#юмор #коментарі

Комп работает как часы#юмор #коментарі

Что взять до $400 (до 40000 РУБЛЕЙ)? | ТОП-10 смартфонов в 2024

Что взять до $400 (до 40000 РУБЛЕЙ)? | ТОП-10 смартфонов в 2024

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?