Automatic Speech Recognition - An Overview

Поделиться
HTML-код
  • Опубликовано: 18 ноя 2024

Комментарии • 97

  • @bharatha1206
    @bharatha1206 6 лет назад +25

    At 42:10 the MRI scan is goosebumps. Fascinating Stuff Madam. The amount of thought that goes into an ASR system is mind boggling.
    It was a great communication of what you guys are working on. thank you.

  • @kdalkafoukis
    @kdalkafoukis 4 года назад +1

    watching the video almost 2 years later with real-time translation and you can see improvements and still some misses, very nice one

  • @soundpassion-africa2874
    @soundpassion-africa2874 4 года назад +1

    A very intelligent researcher. I am passionate about ASR

  • @muditjain7667
    @muditjain7667 7 лет назад +15

    Very good introduction and overview to ASR. Thanks.

    • @zes7215
      @zes7215 6 лет назад

      nst as gx or not

  • @YouTubist666
    @YouTubist666 2 года назад +1

    Prof. Jyothi is presenting at Microsoft on an Apple laptop.

  • @NikkieBiteMe
    @NikkieBiteMe 5 лет назад +13

    This was so insightful and clearly presented! Thank you so much!!! :)

  • @dheerajkurugod
    @dheerajkurugod 5 лет назад

    Excellent talk about ASR by Preethi. Thank you very much

  • @SAI-kg6bb
    @SAI-kg6bb 6 лет назад +4

    Very good explanation :) It is very helpful to understand easily for this kind of topics, instead of going through vast research papers. Anyways thank you.

  • @girl4632
    @girl4632 Год назад

    Isn't not only about using words as basic unit.
    But about comparison of input data and the stored one asa slight difference between background noise, speed,voices, amplitudes will cause problems in comparison

  • @monkeymonkey7501
    @monkeymonkey7501 5 лет назад

    Got you! Prof Alan Black from CMU is among the audience!

  • @ItsOkaySandy
    @ItsOkaySandy 2 года назад

    Impressive Presentation Skills

  • @fugurerme8581
    @fugurerme8581 6 лет назад +1

    Very clearly overview for ASR, pretty good!

  • @DrNudratUniverse
    @DrNudratUniverse 2 года назад

    Wonderful explanation of ASR. Please share the PPT for this presentation with me.

  • @jamesyang8432
    @jamesyang8432 Год назад

    Vice nice introduction about ASR

  • @nadjaseeberg1753
    @nadjaseeberg1753 3 года назад

    Very neat and concise overview on ASR. Thanks a lot!

  • @saransuriya5638
    @saransuriya5638 4 года назад

    Sister you look like our state people (Tamil) I don't know exactly but I think u r from Tamil Nadu..
    Anyway nice explanation... Preethi akka(sister)...

  • @quasinx5606
    @quasinx5606 6 лет назад +1

    Dear Microsoft Research,
    At 1:09:00 you are at the section "What's next?" and talk about several problems. One of those problems is handling noisy real-life settings with many speakers (e.g., meetings, parties). Is it possible to implement the coktail party algorithm into your system? It may be possible to solve the problem partially.
    Yours sincerely,
    D.

  • @rajasudalaimuthu
    @rajasudalaimuthu 5 лет назад

    Excellent overview of ASR. Thanks you.

  • @shivamsoni9763
    @shivamsoni9763 6 лет назад +1

    Hello Ma'am Can you suggest some material or source related to speech recogniton.
    I want to work on this in my M.Tech thesis.

  • @TechVizTheDataScienceGuy
    @TechVizTheDataScienceGuy 4 года назад +1

    Really nice introduction 👍

  • @kshitijmishra2750
    @kshitijmishra2750 3 года назад

    Absolutely amazing and very insightful session.

  • @springeve159
    @springeve159 4 года назад +1

    Thank you so much. ..Such an excellent presentation!

  • @monad_tcp
    @monad_tcp 4 года назад

    Dynamic Belgian Networks ? I guess from context its "Bayesian". I can understand English, but I had to watch this with subtitles on, to test how good CC was.

  • @wiamfadel7321
    @wiamfadel7321 3 года назад

    Thank u, and Thanks to the ASR of RUclips that i can understand clearly what u said as i m not english speaker.

  • @Presserp
    @Presserp 2 года назад

    Not all mistakes were caught. Palin for example said "that entrepreneurial spirit", not "their entrepreneurial spirit."

  • @nevinguo5556
    @nevinguo5556 5 лет назад +1

    Very helpful introduction. Thanks!

  • @muhammadbinzafar2216
    @muhammadbinzafar2216 6 лет назад

    That's a lot of talks, beautifully explained!

  • @nisharshah7167
    @nisharshah7167 4 года назад

    Good introduction of ASR system.

  • @juniorsilva5713
    @juniorsilva5713 8 месяцев назад

    Great lecture! Thanks a lot! =)

  • @ahmedalsaady-y7s
    @ahmedalsaady-y7s 6 дней назад

    pdf of this lecture?

  • @elizabethsherly7693
    @elizabethsherly7693 5 лет назад +1

    good lecture. But expected something more on End-To-End System with RNN

    • @11hamma
      @11hamma 4 года назад

      ya same

    • @tinku067
      @tinku067 Год назад

      this comment didn't age well, Attention is all you need to change the paradigm to more transformer-based models.

  • @dikshadhote7786
    @dikshadhote7786 5 лет назад

    heyy....can I get the links of all videos use in this seminar

  • @ebaitarh1753
    @ebaitarh1753 3 года назад +1

    thanks very much for this video. i wish to ask, which university is that? i wish to take part in your lectures. Multilingual speech recognition is my topic of research

    • @aryanchauhan8066
      @aryanchauhan8066 3 года назад +1

      Indian institute of technology Bombay best college in india

  • @archanasm3402
    @archanasm3402 5 лет назад

    Very good explanation. Thanks a lot

  • @SpaceSoftSystem
    @SpaceSoftSystem 10 месяцев назад

    how i can change live videos voice , like English voice to Hindi , how i can change your any video English voice to Hindi voice , i am not able to read caption

  • @farooq8fox
    @farooq8fox 3 года назад +1

    16:20, subtitles have adapted to accent now

  • @muhammadnomankhanassistant3793
    @muhammadnomankhanassistant3793 4 года назад +1

    Excellent presentation, Stay Blessed.

  • @3bdo3id
    @3bdo3id 10 месяцев назад

    ‏‪1:03:19‬‏
    Graves & Jaitley, "Towards End-to-End Speech Recognition with Recurrent
    Neural Networks" ICML 2014
    --:--:--
    Maas et al, "Lexicon Free Conversional Speech Recognition with Neural Networks", NAACL 2015
    1:09:36
    Chan et al., "Listen, Attend and Spell: A NN for large Vocabulary Conversional Speech Recognition", ICASSP 2016

  • @BadriNathJK
    @BadriNathJK 2 года назад

    Wow. Well presented

  • @joserene6467
    @joserene6467 6 лет назад +3

    Muito esclarecedor , obrigado!

  • @hadjdaoudmomo9534
    @hadjdaoudmomo9534 5 лет назад

    Great clear presentation

  • @naveengabriel9368
    @naveengabriel9368 4 года назад

    Where can i get the graph at 1:13:10 title Languages with ASR

  • @takapaisa5450
    @takapaisa5450 6 лет назад +1

    Can I have the slide used in this video, please...?

    • @maddai1764
      @maddai1764 5 лет назад +2

      www.cse.iitb.ac.in/~pjyothi/cs753/slides/lecture1.pdf

    • @arnavdas3139
      @arnavdas3139 5 лет назад

      @@maddai1764 thanks...

  • @jon4136
    @jon4136 5 лет назад

    needed this for my research sub.

  • @chrissidiras
    @chrissidiras 4 года назад

    Funny when she says 'clearly speech recognition is not difficult for humans'. It takes at least 1-2 years for humans gain this skill and a decade to be fully developed, if you include speech in noise recognition. It is far from easy.

  • @anwarshome
    @anwarshome 6 лет назад

    Your awesome, please post more lectures

    • @PragyAgarwal
      @PragyAgarwal 6 лет назад +2

      The recordings aren't available, but you can get all other material here: www.cse.iitb.ac.in/~pjyothi/cs753/

  • @miitan9421
    @miitan9421 6 лет назад

    Very nicely explained!

  • @fridaynightfunkinmoni7600
    @fridaynightfunkinmoni7600 4 года назад +1

    Thank you very much my sir
    please my project for Speech Samples Recognition
    and i need heelp please. Can i get on your emial please

  • @RAVINDRABACHATE
    @RAVINDRABACHATE 5 лет назад +1

    Shall I get the ppt for reference?
    Very nice lecture. Thank you.

  • @aariskazi9002
    @aariskazi9002 4 года назад

    Can I get the name of the speaker ?

  • @mymathsclass5980
    @mymathsclass5980 2 года назад

    Well explained.

  • @spwim
    @spwim 5 лет назад +2

    this should be subtitled for sure :-D

  • @amarchaudhary5832
    @amarchaudhary5832 6 лет назад

    Nice session....

  • @prakashchaudhary8491
    @prakashchaudhary8491 4 года назад

    IITB my favourite 💟💟💟

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    It’s Point Presentation.

  • @kirill_bykov
    @kirill_bykov 4 года назад

    27:42 subtitles are wrong and no way to fix them. I dislike who is not open for contribution.

  • @monad_tcp
    @monad_tcp 4 года назад +3

    51:04, lol, youtube CC has the joke in its model. let us pray vs lettuce spray.

    • @Aditya-te7oo
      @Aditya-te7oo 2 года назад

      Luiz Felipe 😂😂
      Btw, I want to ask a question, what one has to do if someone is interested in it ? I mean what educational background he/she has to have ?

    • @monad_tcp
      @monad_tcp 2 года назад +1

      @@Aditya-te7oo my background is computing science. I guess that and specialization in statistics/data analysis how help.

    • @Aditya-te7oo
      @Aditya-te7oo 2 года назад

      @@monad_tcp
      Okay, thanks for that.
      Btw, it's 5:15 in the morning and I didn't sleep the whole night. 😂😅

  • @eggstrovaganza
    @eggstrovaganza 5 лет назад +5

    so...at microsoft research they use macbook 🤔🤗

  • @redwanboukhalfa4264
    @redwanboukhalfa4264 6 лет назад

    Slides link please

    • @maddai1764
      @maddai1764 5 лет назад

      www.cse.iitb.ac.in/~pjyothi/cs753/slides/lecture1.pdf

  • @enthdegree
    @enthdegree Год назад +1

    why does she mispronounce phoneme for the first half hour

  • @aojing
    @aojing 6 месяцев назад

    When was this talk recorded? It's way too much outdated and definitely not fashion even back in 2016...

    • @ReArNiDcOM
      @ReArNiDcOM 5 месяцев назад

      I am currently learning about ASRs and I am trying to find good resources on modern ASRs so that I can make my own ASR system. This is largely to learn and have a new project on my resume. Do you have any recommendation for resources more relevant than this video?

  • @Aditya-te7oo
    @Aditya-te7oo 2 года назад

    Liked it.

  • @ifstory
    @ifstory 4 года назад

    Am I missing something - the moderator is using an APPLE computer - Isn't this a Microsoft event? Bill Gates must be going crazy.!!

  • @moalghifari
    @moalghifari 5 лет назад

    Cool laptop

  • @priyanshusharma4484
    @priyanshusharma4484 Год назад

    18:02

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Vocabulary ✌🏿

  • @priyanshusharma4484
    @priyanshusharma4484 Год назад

    14:18

  • @laenetmoloto9716
    @laenetmoloto9716 2 года назад

    31:05

  • @christopherilayaraja1399
    @christopherilayaraja1399 6 лет назад +1

    I like you preethi!!!!

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Speech white ❤️

  • @Tadesan
    @Tadesan 2 года назад

    She's wearing her formal pajamas.

  • @李白-f5u
    @李白-f5u 4 года назад

    I was attracted by this women

  • @kenichimori8533
    @kenichimori8533 7 лет назад

    Texture Infact co so many likely presentation.
    F(0^=) Prime index Equal Point Proof 0

  • @aylasedai2317
    @aylasedai2317 6 лет назад +1

    While the talk is interesting, I am so sick of hearing "X is sexist" or any other "-ist" or "-ism". Can we just say that the application is too narrowly focused?
    For heaven's sake, it was a toy dog with a magnet, not a woman or elderly person hating machine.
    I