The Most Accurate Speech-to-text APIs in 2025

Speculative Decoding: When Two LLMs are Faster than One

Structured Output from LLMs: Grammars, Regex, and State Machines

REBUILDING A PORSCHE 911 GT3RS FROM SCRATCH

Every Home Alone Is Worse Than The Last

Trying EVERY Fast Food Holiday Item!

Speech LLMs: Models that listen and talk back

Efficient NLP

Просмотров 2,9 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 9 фев 2025

Комментарии • 11

@SlamShee 2 дня назад
Awesome content
@escesc1 3 месяца назад
Very interesting video, as usual!
@NLPprompter 3 месяца назад ⁺¹
wow this is exactly what I've been looking for, subscribed instantly. do you interested to cover more models? such as kyutai moshi, hertz-dev? they seems use different architecture.
@EfficientNLP 3 месяца назад ⁺¹
Great suggestions! I haven't looked at these two, but they are certainly relevant.
@NLPprompter 3 месяца назад
@EfficientNLP awesome, can't wait until next video. and... well they are pretty similar but i think the architecture inside is different, however they aren't as smart as openai realtime API. oh this one = llama-omni this one base on llama 3 with similar realtime AI Conversation
@isaakcarteraugustus1819 2 месяца назад ⁺¹
Can you also make a video about Moshi or Mimi and how they have been trained?
Edit: maybe also mini-omni2?
@EfficientNLP 2 месяца назад ⁺¹
Thanks for the suggestion; I will keep it in mind for the next video!
@lounes9777 2 месяца назад
didn't check moshi from Kyutai ??
@EfficientNLP 2 месяца назад ⁺¹
You are correct; this is a relevant model, and the field is evolving rapidly. However, the principles in this video should still apply.
@weizhou6544 3 месяца назад
Can it support RAG?
@EfficientNLP 3 месяца назад ⁺¹
Neither of the two models in this video have RAG, but it is possible to add a retrieval system prior to generation, since text tokens can be interleaved into speech LLMs.

Следующие

Автовоспроизведение

The Most Accurate Speech-to-text APIs in 2025

The Most Accurate Speech-to-text APIs in 2025

Speculative Decoding: When Two LLMs are Faster than One

Speculative Decoding: When Two LLMs are Faster than One

Structured Output from LLMs: Grammars, Regex, and State Machines

Structured Output from LLMs: Grammars, Regex, and State Machines

REBUILDING A PORSCHE 911 GT3RS FROM SCRATCH

REBUILDING A PORSCHE 911 GT3RS FROM SCRATCH

Every Home Alone Is Worse Than The Last

Every Home Alone Is Worse Than The Last

Trying EVERY Fast Food Holiday Item!

Trying EVERY Fast Food Holiday Item!

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

BLACK BAG - Official Trailer [HD] - Only in Theaters March 14

Residual Vector Quantization for Audio and Speech Embeddings

Residual Vector Quantization for Audio and Speech Embeddings

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

The KV Cache: Memory Usage in Transformers

The KV Cache: Memory Usage in Transformers

Multimodal AI: LLMs that can see (and hear)

Multimodal AI: LLMs that can see (and hear)

Can Whisper be used for real-time streaming ASR?

Can Whisper be used for real-time streaming ASR?

2024 in Post-Transformer Architectures: State Space Models, RWKV [Latent Space LIVE! @ NeurIPS 2024]

2024 in Post-Transformer Architectures: State Space Models, RWKV [Latent Space LIVE! @ NeurIPS 2024]

A better Hugging Face model search with OpenAI, RAG, pgvector

A better Hugging Face model search with OpenAI, RAG, pgvector

Jason Wei: Scaling Paradigms for Large Language Models

Jason Wei: Scaling Paradigms for Large Language Models

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

"ЗИНДАГИ 7" - КАЧЕСТВО ОРГИНАЛ 4К. ОФИЦИАЛЬНО

"ЗИНДАГИ 7" - КАЧЕСТВО ОРГИНАЛ 4К. ОФИЦИАЛЬНО

Исчезновение двух мальчиков. Семь лет спустя один вернулся... но совсем другим

Исчезновение двух мальчиков. Семь лет спустя один вернулся... но совсем другим

Secret to sawing daughter in half

Secret to sawing daughter in half

Первый день отдыха в Сочах. Антон Теляков #пранк

Первый день отдыха в Сочах. Антон Теляков #пранк

ЧТО БЫ твоя мама СЪЕЛА за 10.000$

ЧТО БЫ твоя мама СЪЕЛА за 10.000$

Insane COOKIE MAGIC Trick 🤯🍪 #shorts @juliet_smxll

Insane COOKIE MAGIC Trick 🤯🍪 #shorts @juliet_smxll

Страна без взяток. Как это возможно?

Страна без взяток. Как это возможно?

ПОДРИФТИЛ С БАБУЛЕЙ #shorts

ПОДРИФТИЛ С БАБУЛЕЙ #shorts