Linformer: Self-Attention with Linear Complexity (Paper Explained)

Neural Architecture Search without Training (Paper Explained)

SANE2018 | Yu Zhang - Towards End-to-end Speech Synthesis

Atlanta Falcons Highlights in win vs. Philadelphia Eagles | 2024 Regular Season Week 2

10 NEW Costco Deals You NEED To Buy in September 2024

Den of Thieves 2: Pantera (2025) Official Trailer - Gerard Butler, O’Shea Jackson Jr.

End-to-End Adversarial Text-to-Speech (Paper Explained)

Yannic Kilcher

Просмотров 14 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 20 сен 2024

Комментарии • 39

@MrAmirhossein1 4 года назад ⁺⁶
Hey Yannic!
just wanted to thank you for the excellent content that you provide.
Keep it up man :)
@HarisGulzar-d9c 6 месяцев назад
Never enjoyed paper explanations this much.
Thanks, Yannic!
@rishabhkumar722 7 месяцев назад ⁺¹
Wow... Why not more TTS papers explanation
@zhivebelarus560 Месяц назад
Yannic, thanks for doing this! Quick question: why instead of fiddling with the aligner they did not start training from smaller samples like one phoneme long and then as loss drops gradually increase the sample length to 2, 3, etc? It seems too much black magic going on in training a tts model. Do you have a suggestion for the most clean architecture that works well? Is there a good review of one step tts models? How can a speaker embedding can be integrated for voice cloning into such model? Sorry for too many questions…
@rvalusa 4 года назад ⁺³
Awesome. Superb explanation. Love the channel and content 👍👏🙂
@kimchi_taco 4 года назад ⁺¹
Thank you! It includes so many ad-hoc. I wonder why it's better than combination of Tacotron+WaveNet?
@motherbear55 3 года назад
Quality wise it’s not better than tacotron (see MOS scores in the paper-tacotron is about 4.5, this approach is about 4.0). But unlike tacotron, it’s not autoregressive, so inference can be much faster.
@alaapdhall8541 4 года назад ⁺³
ah always so fast, I heard the google released pre trained weights for big transfer, could you also make a video on BiT?
@alaapdhall8541 4 года назад
@Mallow Marsh oh ok, I'll go through his videos then
@Haapavuo 2 года назад
Whose videos? The comment was deleted. Thanks.
@revanthadiga329 2 года назад ⁺¹
anyone knows where to find this code implementation
@hannesstark5024 4 года назад ⁺¹
Visual Transformers tomorrow?
@ushasr2821 3 года назад
Great explaination Thank you so much
@avihudekel4709 3 года назад
Great work!
@henkjekel4081 6 месяцев назад
You're the best
@myungchulkang5716 4 года назад ⁺¹
Nice !
@shivamraisharma1474 4 года назад ⁺²
Amazing! Do we have any GitHub code or pretrained model weights available?
@YannicKilcher 4 года назад
I don't think so
@СергейПавлович-г2и 4 года назад ⁺¹
Can I try it somewhere?
@YannicKilcher 4 года назад
Not sure. I've linked their website in the description
@ziqiangshi8167 4 года назад
Awesome.
@DinaEl-Kholy-- 3 года назад
Thank you!!
@bossgd100 4 года назад ⁺¹
Its working in real time ?
@herp_derpingson 4 года назад ⁺⁵
Anything can be real time if you have enough compute
@YannicKilcher 4 года назад ⁺²
I don't think so
@bossgd100 4 года назад
@@herp_derpingson the singularity is far 😵
@koheimatsuura3610 4 года назад
@@YannicKilcher Hi :) why do you think so? this seems non-autoregressive model and I think its inferences are so fast...
@screenapple1660 4 года назад ⁺¹
people want realistic TTS voice that sounds high-quality humans. not robot voice. Robot Voice is usually free. But it's stupid.
Most businesses use high-quality human voice synthesis.
@snippletrap 4 года назад ⁺¹
I think Tacotron sounds better
@bossgd100 4 года назад ⁺¹
First !
@yabdelm 4 года назад ⁺⁵
I absolutely love the content but I vote for not saying "As always if you like this work subscribe" I believe if people are exploring AI videos, they probably know where the subscribe button is, and if they like the videos, they'll probably subscribe. Plus we've heard it a billion times in every video on RUclips ever made. It just becomes noise at a certain point. At this point I’m thinking of training an AI to skip every time someone says that.
Nevertheless, they're your videos, and a personal choice, not a democracy. Feel free to disagree. Don't mean to be mean or anything.
@lakshay510 4 года назад ⁺⁴
Hi but I also don't agree with you, When I am doing any kind of research I just open 10s of tab and start exploring it one by one and sometimes if I get the right content I learn the stuff and leave, Also there are analytics that youtube provide which might show that most of his viewers are not his subscribers.
@yabdelm 4 года назад ⁺¹
Lakshay Chhabra You think the majority of people will subscribe because he reminded them to subscribe? I don’t doubt that that might occur as I really have no way of checking that. I agree that some way of determining that from the analytics would be better.
@siyn007 4 года назад ⁺²
For me I usually have a few trial videos before I subscribe but I must admit being told to subscribe lets me evaluate if I should subscribe instead of just exiting like what Lakshay suggested. I agree with not telling people where the subscribe button is though.
@yabdelm 4 года назад ⁺²
@@siyn007 Oh sorry but I don't think Yannic specified where the subscribe button was. I just meant to point to saying whether or not to subscribe.
I see. Good to know that there's the opposite take there. It's definitely not the end of the world. :D I still love Yannic and his videos.
@YannicKilcher 4 года назад ⁺⁴
This is one of the things that, yes, is slightly annoying, but you'd be surprised how many people who aren't subscribed go "oh yes, I could do that". So I try to give you the high level before I say that so that you can decide to skip the video without having to listen to it :)

Следующие

Автовоспроизведение

Linformer: Self-Attention with Linear Complexity (Paper Explained)

Linformer: Self-Attention with Linear Complexity (Paper Explained)

Neural Architecture Search without Training (Paper Explained)

Neural Architecture Search without Training (Paper Explained)

SANE2018 | Yu Zhang - Towards End-to-end Speech Synthesis

SANE2018 | Yu Zhang - Towards End-to-end Speech Synthesis

Atlanta Falcons Highlights in win vs. Philadelphia Eagles | 2024 Regular Season Week 2

Atlanta Falcons Highlights in win vs. Philadelphia Eagles | 2024 Regular Season Week 2

10 NEW Costco Deals You NEED To Buy in September 2024

10 NEW Costco Deals You NEED To Buy in September 2024

Den of Thieves 2: Pantera (2025) Official Trailer - Gerard Butler, O’Shea Jackson Jr.

Den of Thieves 2: Pantera (2025) Official Trailer – Gerard Butler, O’Shea Jackson Jr.

VIEWER BEWARE... DIGITAL CIRCUS EPISODE 3 IS NEAR!

VIEWER BEWARE... DIGITAL CIRCUS EPISODE 3 IS NEAR!

DALL-E: Zero-Shot Text-to-Image Generation | Paper Explained

DALL-E: Zero-Shot Text-to-Image Generation | Paper Explained

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

RepNet: Counting Out Time - Class Agnostic Video Repetition Counting in the Wild (Paper Explained)

RepNet: Counting Out Time - Class Agnostic Video Repetition Counting in the Wild (Paper Explained)

SIREN: Implicit Neural Representations with Periodic Activation Functions (Paper Explained)

SIREN: Implicit Neural Representations with Periodic Activation Functions (Paper Explained)

The Attention Mechanism in Large Language Models

The Attention Mechanism in Large Language Models

Group Normalization (Paper Explained)

Group Normalization (Paper Explained)

When BERT Plays the Lottery, All Tickets Are Winning (Paper Explained)

When BERT Plays the Lottery, All Tickets Are Winning (Paper Explained)

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

NVAE: A Deep Hierarchical Variational Autoencoder (Paper Explained)

NVAE: A Deep Hierarchical Variational Autoencoder (Paper Explained)

НУБ И ПРО ПРОХОДЯТ ПОДВОДНУЮ ТРОЛЛИНГ КАРТУ В МАЙНКРАФТ ! НУБИК И ПРО В ЛОВУШКЕ MINECRAFT

НУБ И ПРО ПРОХОДЯТ ПОДВОДНУЮ ТРОЛЛИНГ КАРТУ В МАЙНКРАФТ ! НУБИК И ПРО В ЛОВУШКЕ MINECRAFT

Включаем рубильник крана спустя 27 лет простоя! Жахнет?!

Включаем рубильник крана спустя 27 лет простоя! Жахнет?!

Bike Vs Tricycle Fast Challenge

Bike Vs Tricycle Fast Challenge

Ромарио стал Ромой

Ромарио стал Ромой

Eco-hero strikes again! ♻️ DIY king 💪🏻

Eco-hero strikes again! ♻️ DIY king 💪🏻

КОГДА ОЧЕНЬ НЕ ЛЮБИШЬ ДЕЛИТЬСЯ:😂😂😂 #пранкнаддругом #юмор #прикол

КОГДА ОЧЕНЬ НЕ ЛЮБИШЬ ДЕЛИТЬСЯ:😂😂😂 #пранкнаддругом #юмор #прикол

ОЧЕНЬ "ХОРОШИЙ" ХОЗЯИН ► Fears to Fathom - Woodbury Getaway #2

ОЧЕНЬ "ХОРОШИЙ" ХОЗЯИН ► Fears to Fathom - Woodbury Getaway #2

МАМА В 16 | 2 СЕЗОН, 3 ВЫПУСК | ИРИНА, САНКТ-ПЕТЕРБУРГ

МАМА В 16 | 2 СЕЗОН, 3 ВЫПУСК | ИРИНА, САНКТ-ПЕТЕРБУРГ