High Fidelity Neural Audio Compression | Paper & Code Explained

How to Build a Deep Learning Machine - Everything You Need To Know

DeepMind Perceiver and Perceiver IO | Paper Explained

Mutliple tornadoes hit Oklahoma on Sunday morning. What we know

Apex Legends: From the Rift Gameplay Trailer

SIDEMEN AMONG US BUT THE WHOLE LOBBIES INFECTED

AudioGen: Textually Guided Audio Generation | Text To Audio | Paper Explained

Aleksa Gordić - The AI Epiphany

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 5 ноя 2024

Комментарии • 13

@TheAIEpiphany 2 года назад ⁺¹¹
Text To Image, Text To Audio, Text To Video - multimodal research is going strong
@calebmcirvin7708 2 года назад
Multimodal algorithms are fascinating! I've recently started playing around with DALL-E 2, and I'm just having a great time making art I wouldn't be able to create on my own and seeing the ideas others in the community have come up with. Textually guided audio generation definitely sounds like it has useful applications as well - can't wait to see the directions additional research takes!
@swyxTV 2 года назад ⁺⁴
love you for doing all these paper walkthrus man, keep it up
@jianghong6444 Год назад ⁺²
I believe audiogen uses encoder and decoder from encodec, a successor of soundstream made by meta ai, but for some reason, by the time AudioGen is published, encodec is not published yet, and encodec is not referred in the paper. If you look at encodec's paper you find that the LSTM at the 2nd to last layer is their invention
@momirmilutinovic2161 2 года назад ⁺¹
I think I understood the explanation of complex-valued STFTs, so I'll try to back it up with an example of my own.
The problem with using amplitude as a representation of complex numbers is that is purely based on the distance of a given point from the origin. What this means is that 1, -1, i, -i, and the rest of the points on the unit circle (circle with radius 1 and center at origin) are treated as the same number i.e. 1, when they aren't the same. When we use both the real and imaginary parts to represent these complex numbers, they'll become different values. Otherwise, we end up squashing a 2D plane into a 1D line, so some information must be lost.
@fahnub 8 месяцев назад
Thank you for doing this
@convolutionalnn2582 2 года назад ⁺¹
I wanna do research in RL...I have learn most Supervised and Unsupervised Algorithms and is able to implement,know some of the maths and use it.....I am now learning maths for ml book...
1) Should i do Deep Learning course or wait until I could derive the whole maths behind Supervised and Unsupervised Algorithms ?
2) Is your blog How to start RL can be follow by me whose aim is research in RL?
@Neptutron 2 года назад ⁺²
Whoah, thank you for bringing this to my attention, I've been waiting for something like this! How did you first hear of this paper? (I'm looking for ways to keep up to date on things lol)
@TheAIEpiphany 2 года назад ⁺²
Twitter :)) as always
@mehular0ra Год назад
Amazing video! Pls do the code walkthrough too
@johnpope1473 2 года назад
I wonder if augmenting whisper text encoder is going to yield a breakthrough in this space…
@jmoneydroid 2 года назад
Any idea if work is being done on Human Behavior modality? Imagine this could be hugely powerful for both past triggers, as well as forward predictive analysis
@frankhovis 6 месяцев назад
Simple eh?

Следующие

Автовоспроизведение

High Fidelity Neural Audio Compression | Paper & Code Explained

High Fidelity Neural Audio Compression | Paper & Code Explained

How to Build a Deep Learning Machine - Everything You Need To Know

How to Build a Deep Learning Machine - Everything You Need To Know

DeepMind Perceiver and Perceiver IO | Paper Explained

DeepMind Perceiver and Perceiver IO | Paper Explained

Mutliple tornadoes hit Oklahoma on Sunday morning. What we know

Mutliple tornadoes hit Oklahoma on Sunday morning. What we know

Apex Legends: From the Rift Gameplay Trailer

Apex Legends: From the Rift Gameplay Trailer

SIDEMEN AMONG US BUT THE WHOLE LOBBIES INFECTED

SIDEMEN AMONG US BUT THE WHOLE LOBBIES INFECTED

bo6 glitch: AFTER PATCH: BOAT PILE UP GLITCH for terminus island easy camo glitch and XP glitch

bo6 glitch: AFTER PATCH: BOAT PILE UP GLITCH for terminus island easy camo glitch and XP glitch

MusicLM Generates Music From Text [Paper Breakdown]

MusicLM Generates Music From Text [Paper Breakdown]

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

How to get started with Graph ML? (Blog walkthrough)

How to get started with Graph ML? (Blog walkthrough)

OpenAI GLIDE (Diffusion) | ML Coding series | Towards Photorealistic Image Generation and Editing

OpenAI GLIDE (Diffusion) | ML Coding series | Towards Photorealistic Image Generation and Editing

OpenAI CLIP: ConnectingText and Images (Paper Explained)

OpenAI CLIP: ConnectingText and Images (Paper Explained)

NeurIPS 2022 - interviews with AI experts and conference walk-through

NeurIPS 2022 - interviews with AI experts and conference walk-through

MusicGen: Simple and Controllable Music Generation Explained

MusicGen: Simple and Controllable Music Generation Explained

VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)

VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)

5 Open Source Generative Music Models You Can't Miss

5 Open Source Generative Music Models You Can't Miss

При живой матери. Выпуск М/Ж от 15.02.2017

При живой матери. Выпуск М/Ж от 15.02.2017

Angel Edgar VS Demon Mortis - Animation

Angel Edgar VS Demon Mortis - Animation

Почему ХАОС - это не СЛУЧАЙНОСТЬ, а ПОРЯДОК? - ТОПЛЕС

Почему ХАОС — это не СЛУЧАЙНОСТЬ, а ПОРЯДОК? — ТОПЛЕС

БАСКЕТБОЛИСТЫ ИГРАЮТ В НАСТОЛЬНЫЙ ТЕННИС #иванабрамов #дедищев #баскетбол #пингпонг #shorts

БАСКЕТБОЛИСТЫ ИГРАЮТ В НАСТОЛЬНЫЙ ТЕННИС #иванабрамов #дедищев #баскетбол #пингпонг #shorts

ВОТ ЧТО МЫ КУПИЛИ НА ALIEXPRESS

ВОТ ЧТО МЫ КУПИЛИ НА ALIEXPRESS

馨馨：老頭太過分了，跑得慢還怨我，不就是吃個飯至於這麼麻煩嗎？ #萌娃#親子#爸爸帶娃#搞笑

馨馨：老頭太過分了，跑得慢還怨我，不就是吃個飯至於這麼麻煩嗎？ #萌娃#親子#爸爸帶娃#搞笑

Результаты выборов США. Что сказал Трамп. Где Харрис? Как следят за выборами из России и Украины

Результаты выборов США. Что сказал Трамп. Где Харрис? Как следят за выборами из России и Украины

Lp. Сердце Вселенной #40 ПЕРВОЕ ЛЕКАРСТВО [С последствиями...] • Майнкрафт

Lp. Сердце Вселенной #40 ПЕРВОЕ ЛЕКАРСТВО [С последствиями...] • Майнкрафт