High Fidelity Neural Audio Compression | Paper & Code Explained

Поделиться
HTML-код
  • Опубликовано: 5 ноя 2024

Комментарии • 11

  • @TheAIEpiphany
    @TheAIEpiphany  2 года назад +8

    I'm back! :)) Murphy law is real folks - I had a series of events happen to me over the past 2, 3 weeks.
    Also - going forward I'll be focusing much more on practical ML projects than on papers.

  • @NicholasRenotte
    @NicholasRenotte 2 года назад +4

    Yo Aleksa, I just finished reading your blog post on Medium around getting into Deep Mind. Loved how detailed you went when it comes to learning the nuts and bolts of ML. Just bought the Mathemetics for ML book on your recommendation. Keep up the awesome work, you're smashing it!!

    • @TheAIEpiphany
      @TheAIEpiphany  2 года назад +1

      Thanks Nicholas!! :)) Glad to hear that!

  • @Nova-mt6ks
    @Nova-mt6ks 4 месяца назад

    Hi, guys. The paper mentioned two difference setup "non-streamable" and "streamable". seem like two difference CNN padding scheme? Do you know which parts of codes implement them, thanks

  • @convolutionalnn2582
    @convolutionalnn2582 2 года назад

    I wanna ask you some advice...If a person want to do research in CV...Will you recommend him to learn Classic CV first or go to reading research paper ?

  • @SergioBelevskij
    @SergioBelevskij Год назад

    Please tell me what the phrase "single multiscale spectrogram adversary" mean?

  • @alialzubaidy2591
    @alialzubaidy2591 7 месяцев назад

    Is this method lossy or lossless?

  • @chinthangu929
    @chinthangu929 Год назад

    Please send particular code for this project

  • @musajonestagiraena8828
    @musajonestagiraena8828 2 года назад

    I'm stupid liberal arts major
    so i don't understand those technical explanatiion
    so i will ask u directly
    1) Is encodec loseless???
    2) can it replace DSD or FLAC for super high quality audio?
    3) It's github explanation mentioned that' non causal 48kHz model' was deep learnt with music data.
    Then
    can it depict some new kind of sound which it didnt learn yet?
    like conventional audio format using non neural algorithm?

    • @tibs7095
      @tibs7095 2 года назад +1

      1 & 2) It is not trying to be lossless.
      3) It's probably worse for stuff very unlike what it wasn't trained on, especially since it's using codebooks. There's no reason why it couldn't be trained on more varied audio though.