Automatic Speech Recognition - An Overview

Поделиться
HTML-код
  • Опубликовано: 24 дек 2024

Комментарии • 97

  • @bharatha1206
    @bharatha1206 7 лет назад +26

    At 42:10 the MRI scan is goosebumps. Fascinating Stuff Madam. The amount of thought that goes into an ASR system is mind boggling.
    It was a great communication of what you guys are working on. thank you.

  • @kdalkafoukis
    @kdalkafoukis 4 года назад +1

    watching the video almost 2 years later with real-time translation and you can see improvements and still some misses, very nice one

  • @girl4632
    @girl4632 Год назад

    Isn't not only about using words as basic unit.
    But about comparison of input data and the stored one asa slight difference between background noise, speed,voices, amplitudes will cause problems in comparison

  • @3bdo3id
    @3bdo3id 11 месяцев назад

    ‏‪1:03:19‬‏
    Graves & Jaitley, "Towards End-to-End Speech Recognition with Recurrent
    Neural Networks" ICML 2014
    --:--:--
    Maas et al, "Lexicon Free Conversional Speech Recognition with Neural Networks", NAACL 2015
    1:09:36
    Chan et al., "Listen, Attend and Spell: A NN for large Vocabulary Conversional Speech Recognition", ICASSP 2016

  • @YouTubist666
    @YouTubist666 2 года назад +1

    Prof. Jyothi is presenting at Microsoft on an Apple laptop.

  • @quasinx5606
    @quasinx5606 6 лет назад +1

    Dear Microsoft Research,
    At 1:09:00 you are at the section "What's next?" and talk about several problems. One of those problems is handling noisy real-life settings with many speakers (e.g., meetings, parties). Is it possible to implement the coktail party algorithm into your system? It may be possible to solve the problem partially.
    Yours sincerely,
    D.

  • @soundpassion-africa2874
    @soundpassion-africa2874 4 года назад +1

    A very intelligent researcher. I am passionate about ASR

  • @muditjain7667
    @muditjain7667 7 лет назад +15

    Very good introduction and overview to ASR. Thanks.

    • @zes7215
      @zes7215 6 лет назад

      nst as gx or not

  • @SpaceSoftSystem
    @SpaceSoftSystem Год назад

    how i can change live videos voice , like English voice to Hindi , how i can change your any video English voice to Hindi voice , i am not able to read caption

  • @farooq8fox
    @farooq8fox 3 года назад +1

    16:20, subtitles have adapted to accent now

  • @SAI-kg6bb
    @SAI-kg6bb 6 лет назад +4

    Very good explanation :) It is very helpful to understand easily for this kind of topics, instead of going through vast research papers. Anyways thank you.

  • @ahmedalsaady-y7s
    @ahmedalsaady-y7s Месяц назад

    pdf of this lecture?

  • @TechVizTheDataScienceGuy
    @TechVizTheDataScienceGuy 4 года назад +1

    Really nice introduction 👍

  • @naveengabriel9368
    @naveengabriel9368 4 года назад

    Where can i get the graph at 1:13:10 title Languages with ASR

  • @shivamsoni9763
    @shivamsoni9763 6 лет назад +1

    Hello Ma'am Can you suggest some material or source related to speech recogniton.
    I want to work on this in my M.Tech thesis.

  • @dikshadhote7786
    @dikshadhote7786 5 лет назад

    heyy....can I get the links of all videos use in this seminar

  • @jamesyang8432
    @jamesyang8432 Год назад

    Vice nice introduction about ASR

  • @kirill_bykov
    @kirill_bykov 4 года назад

    27:42 subtitles are wrong and no way to fix them. I dislike who is not open for contribution.

  • @ItsOkaySandy
    @ItsOkaySandy 2 года назад

    Impressive Presentation Skills

  • @ebaitarh1753
    @ebaitarh1753 3 года назад +1

    thanks very much for this video. i wish to ask, which university is that? i wish to take part in your lectures. Multilingual speech recognition is my topic of research

    • @aryanchauhan8066
      @aryanchauhan8066 3 года назад +1

      Indian institute of technology Bombay best college in india

  • @fugurerme8581
    @fugurerme8581 6 лет назад +1

    Very clearly overview for ASR, pretty good!

  • @DrNudratUniverse
    @DrNudratUniverse 2 года назад

    Wonderful explanation of ASR. Please share the PPT for this presentation with me.

  • @takapaisa5450
    @takapaisa5450 6 лет назад +1

    Can I have the slide used in this video, please...?

    • @maddai1764
      @maddai1764 5 лет назад +2

      www.cse.iitb.ac.in/~pjyothi/cs753/slides/lecture1.pdf

    • @arnavdas3139
      @arnavdas3139 5 лет назад

      @@maddai1764 thanks...

  • @dheerajkurugod
    @dheerajkurugod 5 лет назад

    Excellent talk about ASR by Preethi. Thank you very much

  • @NikkieBiteMe
    @NikkieBiteMe 5 лет назад +13

    This was so insightful and clearly presented! Thank you so much!!! :)

  • @Presserp
    @Presserp 2 года назад

    Not all mistakes were caught. Palin for example said "that entrepreneurial spirit", not "their entrepreneurial spirit."

  • @monkeymonkey7501
    @monkeymonkey7501 5 лет назад

    Got you! Prof Alan Black from CMU is among the audience!

  • @nadjaseeberg1753
    @nadjaseeberg1753 4 года назад

    Very neat and concise overview on ASR. Thanks a lot!

  • @aariskazi9002
    @aariskazi9002 4 года назад

    Can I get the name of the speaker ?

  • @monad_tcp
    @monad_tcp 4 года назад

    Dynamic Belgian Networks ? I guess from context its "Bayesian". I can understand English, but I had to watch this with subtitles on, to test how good CC was.

  • @fridaynightfunkinmoni7600
    @fridaynightfunkinmoni7600 4 года назад +1

    Thank you very much my sir
    please my project for Speech Samples Recognition
    and i need heelp please. Can i get on your emial please

  • @enthdegree
    @enthdegree Год назад +1

    why does she mispronounce phoneme for the first half hour

  • @springeve159
    @springeve159 4 года назад +1

    Thank you so much. ..Such an excellent presentation!

  • @rajasudalaimuthu
    @rajasudalaimuthu 5 лет назад

    Excellent overview of ASR. Thanks you.

  • @kshitijmishra2750
    @kshitijmishra2750 3 года назад

    Absolutely amazing and very insightful session.

  • @nisharshah7167
    @nisharshah7167 5 лет назад

    Good introduction of ASR system.

  • @monad_tcp
    @monad_tcp 4 года назад +3

    51:04, lol, youtube CC has the joke in its model. let us pray vs lettuce spray.

    • @Aditya-te7oo
      @Aditya-te7oo 2 года назад

      Luiz Felipe 😂😂
      Btw, I want to ask a question, what one has to do if someone is interested in it ? I mean what educational background he/she has to have ?

    • @monad_tcp
      @monad_tcp 2 года назад +1

      @@Aditya-te7oo my background is computing science. I guess that and specialization in statistics/data analysis how help.

    • @Aditya-te7oo
      @Aditya-te7oo 2 года назад

      @@monad_tcp
      Okay, thanks for that.
      Btw, it's 5:15 in the morning and I didn't sleep the whole night. 😂😅

  • @saransuriya5638
    @saransuriya5638 4 года назад

    Sister you look like our state people (Tamil) I don't know exactly but I think u r from Tamil Nadu..
    Anyway nice explanation... Preethi akka(sister)...

  • @elizabethsherly7693
    @elizabethsherly7693 6 лет назад +1

    good lecture. But expected something more on End-To-End System with RNN

    • @11hamma
      @11hamma 4 года назад

      ya same

    • @tinku067
      @tinku067 Год назад

      this comment didn't age well, Attention is all you need to change the paradigm to more transformer-based models.

  • @BadriNathJK
    @BadriNathJK 2 года назад

    Wow. Well presented

  • @hadjdaoudmomo9534
    @hadjdaoudmomo9534 6 лет назад

    Great clear presentation

  • @muhammadbinzafar2216
    @muhammadbinzafar2216 6 лет назад

    That's a lot of talks, beautifully explained!

  • @nevinguo5556
    @nevinguo5556 5 лет назад +1

    Very helpful introduction. Thanks!

  • @RAVINDRABACHATE
    @RAVINDRABACHATE 5 лет назад +1

    Shall I get the ppt for reference?
    Very nice lecture. Thank you.

  • @archanasm3402
    @archanasm3402 5 лет назад

    Very good explanation. Thanks a lot

  • @chrissidiras
    @chrissidiras 5 лет назад

    Funny when she says 'clearly speech recognition is not difficult for humans'. It takes at least 1-2 years for humans gain this skill and a decade to be fully developed, if you include speech in noise recognition. It is far from easy.

  • @miitan9421
    @miitan9421 6 лет назад

    Very nicely explained!

  • @muhammadnomankhanassistant3793
    @muhammadnomankhanassistant3793 4 года назад +1

    Excellent presentation, Stay Blessed.

  • @wiamfadel7321
    @wiamfadel7321 3 года назад

    Thank u, and Thanks to the ASR of RUclips that i can understand clearly what u said as i m not english speaker.

  • @anwarshome
    @anwarshome 7 лет назад

    Your awesome, please post more lectures

    • @PragyAgarwal
      @PragyAgarwal 7 лет назад +2

      The recordings aren't available, but you can get all other material here: www.cse.iitb.ac.in/~pjyothi/cs753/

  • @aojing
    @aojing 7 месяцев назад

    When was this talk recorded? It's way too much outdated and definitely not fashion even back in 2016...

    • @ReArNiDcOM
      @ReArNiDcOM 7 месяцев назад

      I am currently learning about ASRs and I am trying to find good resources on modern ASRs so that I can make my own ASR system. This is largely to learn and have a new project on my resume. Do you have any recommendation for resources more relevant than this video?

  • @juniorsilva5713
    @juniorsilva5713 9 месяцев назад

    Great lecture! Thanks a lot! =)

  • @jon4136
    @jon4136 5 лет назад

    needed this for my research sub.

  • @joserene6467
    @joserene6467 6 лет назад +3

    Muito esclarecedor , obrigado!

  • @redwanboukhalfa4264
    @redwanboukhalfa4264 6 лет назад

    Slides link please

    • @maddai1764
      @maddai1764 5 лет назад

      www.cse.iitb.ac.in/~pjyothi/cs753/slides/lecture1.pdf

  • @amarchaudhary5832
    @amarchaudhary5832 6 лет назад

    Nice session....

  • @mymathsclass5980
    @mymathsclass5980 2 года назад

    Well explained.

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    It’s Point Presentation.

  • @prakashchaudhary8491
    @prakashchaudhary8491 4 года назад

    IITB my favourite 💟💟💟

  • @spwim
    @spwim 5 лет назад +2

    this should be subtitled for sure :-D

  • @priyanshusharma4484
    @priyanshusharma4484 Год назад

    18:02

  • @laenetmoloto9716
    @laenetmoloto9716 2 года назад

    31:05

  • @eggstrovaganza
    @eggstrovaganza 5 лет назад +5

    so...at microsoft research they use macbook 🤔🤗

  • @priyanshusharma4484
    @priyanshusharma4484 Год назад

    14:18

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Vocabulary ✌🏿

  • @Aditya-te7oo
    @Aditya-te7oo 2 года назад

    Liked it.

  • @ifstory
    @ifstory 4 года назад

    Am I missing something - the moderator is using an APPLE computer - Isn't this a Microsoft event? Bill Gates must be going crazy.!!

  • @moalghifari
    @moalghifari 5 лет назад

    Cool laptop

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Speech white ❤️

  • @李白-f5u
    @李白-f5u 4 года назад

    I was attracted by this women

  • @christopherilayaraja1399
    @christopherilayaraja1399 6 лет назад +1

    I like you preethi!!!!

  • @aylasedai2317
    @aylasedai2317 6 лет назад +1

    While the talk is interesting, I am so sick of hearing "X is sexist" or any other "-ist" or "-ism". Can we just say that the application is too narrowly focused?
    For heaven's sake, it was a toy dog with a magnet, not a woman or elderly person hating machine.
    I

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Texture Infact co so many likely presentation.
    F(0^=) Prime index Equal Point Proof 0

  • @Tadesan
    @Tadesan 2 года назад

    She's wearing her formal pajamas.