Dear Professor Jürgen, you are a life saver. This is so helpful especially when the class is starting to use Praat even more now! You explain the concept so simply and slowly it's a breath of fresh air truly!
I've been teaching this for years, and it's the first time I see it in a very comprehensive and easy way to understand. especially the comparison with other instruments, which is so useful! Thank you so much!
I've rarely heard such a clear explanation of sound waves as in this video. It couldn't be clearer! I'm now working as a speech therapist with voice patients, and I'm so glad I watched this video carefully.
(0:03) Hello. This e-lecture is an introduction to some of the central aspects of acoustic phonetics: the study of the physical properties of the speech signal. This requires a precise understanding of the nature of sound waves, and the experimental techniques used in the field. In this e-lecture, we will focus on sound waves in general; we will look at simple as well as complex sound waves; and we will discuss the phenomenon of resonance. (0:39) Sound, as you all know, originates from the motion (or vibration) of a sound source, for example from a tuning fork. The result of this vibration is known as a sound wave - in this particular case, a simple sound wave - which can be mathematically modelled as a sine wave. Here are some examples (1:04). (1:12) Most sources of sounds produce complex sets of vibrations. They arise from the combination of a number of simple sound waves, and are referred to as complex sound waves. Speech involves the use of complex sound waves because it results from the simultaneous use of many sound sources in the vocal tract. Here are some examples (1:37), and a vowel (1:44). Okay. (1:49) The vibration of a sound source is normally intensified by the body around it. This intensification is referred to as resonance. Depending on the material and the shape of this body, several resonance frequencies are produced. So, our program is clear: we will first look at simple, then at complex sound waves, and we'll finally discuss the phenomenon of resonance. (2:20) Simple sound waves are regular in motion and are [thus] referred to as periodic. Two properties are central to the measurement of simple sound waves. The frequency of a sound wave is measured in hertz (Hz) - it denotes the number of cycles of a sound wave per second. Here we have one cycle (2:43). Assuming that this here is one second, we need two cycles to fill that second. In other words, we have a frequency of 2Hz. (3:09) The amplitude of a sound wave denotes the maximum displacement of a sound wave in a cycle of movement - that is, the distance from the rest point - and is thus important for the loudness of a sound wave. However, the total sensation of loudness is a combination of frequency and amplitude. For this reason, the term intensity (measured in decibels (Db)) is used to refer to the overall loudness of a sound. (3:45) Having discussed simple sound waves, it is important to note that every sound we hear is not a pure tone, but a complex tone: its waveform is not simple, but complex. Complex waveforms are synthesized from a sufficient number of simple sound waves. There are two types of complex waveforms: complex periodic sound waves, and complex aperiodic sound waves. Speech makes use of both kinds: vowels for example are basically periodic, whereas consonants range from periodic to aperiodic. (4:34) As already mentioned, the vibration of a sound source is normally intensified by the body around it. So, each sound wave (whether simple or complex) consists of a sound source and some sort of resonance. The sound wave created by a sound source - for example by a tuning fork, by the piece of reed in a saxophone, at the orifice of a flute, or (last but not least) by the vocal folds - is a complex sound wave and it is referred to as the fundamental frequency, or F-Nought (F0) - the Americans sometimes use the term F-Zero instead of F-Naught. F-Naught is filtered - that is, it is intensified or damped - by numerous parts of the resonating body (for example, by the body of a saxophone, the body of a flute, or by the vocal tract). The resulting bundles of resonance frequencies (or harmonics) are multiples of F0. In speech, they're called formants and are numbered F1, F2, and so on. (6:09) Let us exemplify this first on the basis of a musical instrument, and then on the basis of speech. On an oboe, F0 is the result of the vibration of the reed. This fundamental frequency is intensified and damped by the resonating body. As a result, a number of harmonics are created as integer multiples of the frequency of F0. So, this is where F0 is created (6:43), and let’s now listen to the sounds and then work out the harmonics. Let’s take the tone A (6:56). Now, A involves a fundamental frequency of 440Hz, then F1 is twice 440Hz (that is 880Hz). F2 is times F0 (that is 1320Hz), and so on and so forth. (7:29) In speech, F0 is the result of vocal fold vibration. Depending on age, sex, and pitch it varies between 50Hz and 500Hz. So, let's write that down - 50Hz to 500Hz. By changing and modifying the shape of the vocal tract (that is the resonating body), the acoustic properties of F0 are altered and this causes different harmonics - or rather clusters of harmonics - to be boosted or smoothed. These clusters - or more precisely their peaks - are referred to as formants. In fact, two main resonance chambers of speech production can be associated with these formants. On the one hand we have the pharyngeal chamber (the pharynx), which creates the first formant (F1), and the second chamber is the oral resonance chamber, which is associated with the second formant, or F2. However, the formant pattern of a speech sound is the outcome of the whole vocal tract working as one resonance system. Furthermore, the harmonics of a sound can often hardly be identified since many sounds involve additional acoustic effects, such as friction noise, bursts, and so on and so forth. So, it is a little bit more complicated than that. (9:16) The interplay between F0 (the fundamental frequency) and its resonance frequencies has come to be known as the source-filter model. So, let's briefly look at the source filter-model. This model associates laryngeal action with vocal tract resonance. At the bottom, we can see down here (9:38) the unmodified laryngeal source wave with its periodic resonance frequencies. And then we have two further displays of the frequency spectrum. In the middle, we see a schematic display of the first three formant peaks: F1, F2, and F3. And at the top we see the modified spectrum with all resonance frequencies. The current setting - this setting that is displayed over here (10:14) - displays the frequency spectrum for the vowel [a]. If we compare these settings with the spectrum for the vowel [i], we can see that the laryngeal frequency spectrum remains unaffected - we have the same fundamental frequency. The frequency spectrum, however, that includes the vocal tract resonances, is different. On the one hand, we can see different values for F1 - now here is a value for [a], and here are the values for [i] -so F1 and F2 are different. Whereas in [a] the first two formats are relatively close together; [i] involves a low F1 value, but a very high value for the second formant. The explanation is quite simple: the oral cavity for [i] which is responsible for the F2 value is much smaller (it’s this little bit here (11:31)) as compared with the oral cavity involved in [a]. So, this explains the low value for F2 in the vowel [a], and the high value for F2 in the vowel [i]. (11:51) Well, it is quite simple I think, and I hope you've understood it. So that's it for now. (11:59) I hope you have now got a first idea about how physicists describe sounds in general, and how speech sounds are produced and described acoustically, and in particular how the vocal tract damps and amplifies the fundamental frequency that is created by means of vocal fold vibration. (12:19) Thanks for your attention.
Professor Jürgen Handke, your explanation is the best! Such clarity and simplicity is not a easy thing! Now suddenly, all the books I've read become clear! Merci beaucoup!
The Virtual Linguistics Campus and The Ling Space are getting me through my graduate degree one video at a time! Very grateful!!
Dear Professor Jürgen, you are a life saver. This is so helpful especially when the class is starting to use Praat even more now! You explain the concept so simply and slowly it's a breath of fresh air truly!
Thanks, just spread the news, you are all welcome in our video channel.
I've been teaching this for years, and it's the first time I see it in a very comprehensive and easy way to understand. especially the comparison with other instruments, which is so useful! Thank you so much!
I've rarely heard such a clear explanation of sound waves as in this video. It couldn't be clearer! I'm now working as a speech therapist with voice patients, and I'm so glad I watched this video carefully.
(0:03) Hello. This e-lecture is an introduction to some of the central aspects of acoustic phonetics: the study of the physical properties of the speech signal. This requires a precise understanding of the nature of sound waves, and the experimental techniques used in the field. In this e-lecture, we will focus on sound waves in general; we will look at simple as well as complex sound waves; and we will discuss the phenomenon of resonance.
(0:39) Sound, as you all know, originates from the motion (or vibration) of a sound source, for example from a tuning fork. The result of this vibration is known as a sound wave - in this particular case, a simple sound wave - which can be mathematically modelled as a sine wave. Here are some examples (1:04).
(1:12) Most sources of sounds produce complex sets of vibrations. They arise from the combination of a number of simple sound waves, and are referred to as complex sound waves. Speech involves the use of complex sound waves because it results from the simultaneous use of many sound sources in the vocal tract. Here are some examples (1:37), and a vowel (1:44). Okay.
(1:49) The vibration of a sound source is normally intensified by the body around it. This intensification is referred to as resonance. Depending on the material and the shape of this body, several resonance frequencies are produced. So, our program is clear: we will first look at simple, then at complex sound waves, and we'll finally discuss the phenomenon of resonance.
(2:20) Simple sound waves are regular in motion and are [thus] referred to as periodic. Two properties are central to the measurement of simple sound waves. The frequency of a sound wave is measured in hertz (Hz) - it denotes the number of cycles of a sound wave per second. Here we have one cycle (2:43). Assuming that this here is one second, we need two cycles to fill that second. In other words, we have a frequency of 2Hz.
(3:09) The amplitude of a sound wave denotes the maximum displacement of a sound wave in a cycle of movement - that is, the distance from the rest point - and is thus important for the loudness of a sound wave. However, the total sensation of loudness is a combination of frequency and amplitude. For this reason, the term intensity (measured in decibels (Db)) is used to refer to the overall loudness of a sound.
(3:45) Having discussed simple sound waves, it is important to note that every sound we hear is not a pure tone, but a complex tone: its waveform is not simple, but complex. Complex waveforms are synthesized from a sufficient number of simple sound waves. There are two types of complex waveforms: complex periodic sound waves, and complex aperiodic sound waves. Speech makes use of both kinds: vowels for example are basically periodic, whereas consonants range from periodic to aperiodic.
(4:34) As already mentioned, the vibration of a sound source is normally intensified by the body around it. So, each sound wave (whether simple or complex) consists of a sound source and some sort of resonance. The sound wave created by a sound source - for example by a tuning fork, by the piece of reed in a saxophone, at the orifice of a flute, or (last but not least) by the vocal folds - is a complex sound wave and it is referred to as the fundamental frequency, or F-Nought (F0) - the Americans sometimes use the term F-Zero instead of F-Naught. F-Naught is filtered - that is, it is intensified or damped - by numerous parts of the resonating body (for example, by the body of a saxophone, the body of a flute, or by the vocal tract). The resulting bundles of resonance frequencies (or harmonics) are multiples of F0. In speech, they're called formants and are numbered F1, F2, and so on.
(6:09) Let us exemplify this first on the basis of a musical instrument, and then on the basis of speech. On an oboe, F0 is the result of the vibration of the reed. This fundamental frequency is intensified and damped by the resonating body. As a result, a number of harmonics are created as integer multiples of the frequency of F0. So, this is where F0 is created (6:43), and let’s now listen to the sounds and then work out the harmonics. Let’s take the tone A (6:56). Now, A involves a fundamental frequency of 440Hz, then F1 is twice 440Hz (that is 880Hz). F2 is times F0 (that is 1320Hz), and so on and so forth.
(7:29) In speech, F0 is the result of vocal fold vibration. Depending on age, sex, and pitch it varies between 50Hz and 500Hz. So, let's write that down - 50Hz to 500Hz. By changing and modifying the shape of the vocal tract (that is the resonating body), the acoustic properties of F0 are altered and this causes different harmonics - or rather clusters of harmonics - to be boosted or smoothed. These clusters - or more precisely their peaks - are referred to as formants. In fact, two main resonance chambers of speech production can be associated with these formants. On the one hand we have the pharyngeal chamber (the pharynx), which creates the first formant (F1), and the second chamber is the oral resonance chamber, which is associated with the second formant, or F2. However, the formant pattern of a speech sound is the outcome of the whole vocal tract working as one resonance system. Furthermore, the harmonics of a sound can often hardly be identified since many sounds involve additional acoustic effects, such as friction noise, bursts, and so on and so forth. So, it is a little bit more complicated than that.
(9:16) The interplay between F0 (the fundamental frequency) and its resonance frequencies has come to be known as the source-filter model. So, let's briefly look at the source filter-model. This model associates laryngeal action with vocal tract resonance. At the bottom, we can see down here (9:38) the unmodified laryngeal source wave with its periodic resonance frequencies. And then we have two further displays of the frequency spectrum. In the middle, we see a schematic display of the first three formant peaks: F1, F2, and F3. And at the top we see the modified spectrum with all resonance frequencies. The current setting - this setting that is displayed over here (10:14) - displays the frequency spectrum for the vowel [a]. If we compare these settings with the spectrum for the vowel [i], we can see that the laryngeal frequency spectrum remains unaffected - we have the same fundamental frequency. The frequency spectrum, however, that includes the vocal tract resonances, is different. On the one hand, we can see different values for F1 - now here is a value for [a], and here are the values for [i] -so F1 and F2 are different. Whereas in [a] the first two formats are relatively close together; [i] involves a low F1 value, but a very high value for the second formant. The explanation is quite simple: the oral cavity for [i] which is responsible for the F2 value is much smaller (it’s this little bit here (11:31)) as compared with the oral cavity involved in [a]. So, this explains the low value for F2 in the vowel [a], and the high value for F2 in the vowel [i].
(11:51) Well, it is quite simple I think, and I hope you've understood it. So that's it for now.
(11:59) I hope you have now got a first idea about how physicists describe sounds in general, and how speech sounds are produced and described acoustically, and in particular how the vocal tract damps and amplifies the fundamental frequency that is created by means of vocal fold vibration.
(12:19) Thanks for your attention.
Professor Jürgen Handke, your explanation is the best! Such clarity and simplicity is not a easy thing! Now suddenly, all the books I've read become clear!
Merci beaucoup!
Danke!
Wow! The way you explain the content in detail and with visuals helps tremendously!! I actually am understanding what I am reading now. Thank you!!
Thank you for your lovely explanation Prof.
Very clear presentation, thanks.
perfect explaination!
Very useful and clear explanation. Thanks!
❤❤
more of this please
شكرا جزيلا
Thank you! This was very clear and helpful!
I guess that means "Thank you very much!" - if so, you're welcome.
Thank you so much
Thanks!!
Thank you very much, God bless you professor :)
Thank you so much 😻😻
Sir how harmonics are produced by the vocal folds at the same time
Fundamental vibration and series of harmonics
why it can not be downloaded??
like sooo