Seeing Voices: 1 - Intro to Spectrograms
HTML-код
- Опубликовано: 13 окт 2021
- Jay explores an incredible visualization method used in speech recognition technology and in the analysis of animal communication. Spectrograms show which frequencies (high-pitch/low pitch) are active in a specific sound. They reveal a lot about the nature of sound, aid artificial intelligence, but more excitingly, inform us about the intelligence of animals we underestimate.
Prairie Dogs: America's Meerkats - Language
• Prairie Dogs: America'...
Wild Dolphins Swimming in HD Compilation
• Wild Dolphins Swimming...
Speech Recognition, Mitch Marcus [slides]
www.seas.upenn.edu/~cis391/Le...
A big thank you for both the video you made. As someone working in ML audio for audio almost one year, this videos made thing clearer for me 😀
Thank you so much. This is very interesting topic. I'm looking forward for the next episode.
Amazing Jay.. Really helpful
Super cool ...just looking for sound exploration ...please make more on this ....also make video about notes in sound
Amazing jalamar!
As usual, you always share with us high quality content. 🙏 Thank you again, is there any notebook or code example?
Excellent intro! can i get the code of the Spectrogram?
Hey what does it mean dealing with mel spectrograms (128,216,3). By using 3 windows length 93ms,46ms and 23ms and in the end they have write 128,216,3 what does 3 shows here??
I am designing an animal communication app using riffusion.