Demystifying the Fourier Transform: The Intuition

Valerio Velardo - The Sound of AI

Просмотров 41 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 30 июл 2024
I explain how the Fourier Transform works. I avoid getting into the mathematical intricacies (for now!). Instead, I focus on the intuition using a visual approach. The Fourier Transform is a fundamental tool used in audio signal processing for extracting information from audio data, and transform a signal from the time to the frequency domain.
Slides:
github.com/musikalkemist/Audi...
Code:
github.com/musikalkemist/Audi...
Join The Sound Of AI Slack community:
valeriovelardo.com/the-sound-...
Interested in hiring me as a consultant/freelancer?
valeriovelardo.com/
Follow Valerio on Facebook:
/ thesoundofai
Connect with Valerio on Linkedin:
/ valeriovelardo
Follow Valerio on Twitter:
/ musikalkemist
Наука

Комментарии • 85

@gildorodrigues4108 3 года назад ⁺³⁵
This is the best Fourier Transform class I've ever watched!!
Thank you very much Valerio!
@ValerioVelardoTheSoundofAI 3 года назад
Thanks Gildo!
@EmmiFitz Месяц назад
These video are helping me so much in understanding Audio Processing! Thank you so much! This has so so much value to me.
@DZTech-tz1pe 3 месяца назад
One year before my graduation as a computer science eng, two years ago, I spent a semester trying to understand FFT from lectures, no surprise it didn't work, but now it all makes perfect sense, Thank you for all your effort
@ValerioVelardoTheSoundofAI 3 месяца назад
Glad I could help :)
@user-ni2fo1uh2l Год назад
what a beautiful class! You are doing amazing Valerio! I am grateful and thanks a lot for all the efforts you put in this lovely lecture series
@siddhantgupta-voiceofai3956 4 года назад
I really appreciate the effort you must have put in Valerio, Great content ! Keep up the good work. Your channel is one of the best in audio machine learning out there.
@ValerioVelardoTheSoundofAI 4 года назад
Thanks Siddhant!
@phunghm Год назад
Love all the valuable videos you've made. Amazing insights & applicable
@JawadQ1 3 года назад
Your explanations are great. This is one of my favorite channels.
@ValerioVelardoTheSoundofAI 3 года назад
Thanks a lot JawadQ1 :)
@ThomasBallatore 3 года назад ⁺⁴
I just binge watched the whole series over New Years and I'm just blown away-incredible content and presentation. Thanks Valerio! All the videos are great but I found this one on the Fourier Transform just magical. Keep up the excellent work...it is very appreciated 🎉
@ValerioVelardoTheSoundofAI 3 года назад ⁺¹
Thanks a lot Thomas!
@patrick.portes Год назад
From Brazil. You're the man in audio processing knowledge! Thanks a lot!
@Erosis 3 года назад ⁺³
Simplest and most intuitive explanation of FFT I've seen. Fantastic job!
@ValerioVelardoTheSoundofAI 3 года назад
Thank you Austin!
@sathyanarayananvittal7832 6 месяцев назад
Very clear video for this tough concept. Thank you so much for this. Every book i picked for deep learning had only images and so audio was magic to me till I watched your videos.
@mirfalltleiderkeinnameein7223 2 года назад ⁺¹
I am really impressed how good you explained it. This helps to put concepts and intuitions from different sources together. Especially that you introduced FT as maximization problem.
@ValerioVelardoTheSoundofAI 2 года назад
Thanks!
@venkatesanr9455 4 года назад
Awesome explanation and eagerly waiting for your next video as usual in the weeks, Cheers
@ValerioVelardoTheSoundofAI 4 года назад ⁺¹
Thank you Venkatesan! Next video will be out on Thursday, as usual ;)
@shubham6867 3 года назад ⁺³
I must say I really love the way you teach and make me work in a better way for Audio Recognition Project.
@ValerioVelardoTheSoundofAI 3 года назад ⁺¹
Thank you Shubham!
@akch96 Год назад
I always found FT very tricky but you explained really well. Thank You!
@davemoore5858 8 месяцев назад
Where were you when I was at radio school a million years ago??
Great tutorial!! Thanks
@kewtomrao 3 года назад
Just one word....INCREDIBLE!
@lfg12 3 месяца назад
Awesome job - thanks so much for putting this together.
@MathStatsMe 22 дня назад
Excellent video. Thank you for the explanation.
I do have a concern about your sine wave formula, and maybe somebody has already mentioned this, or maybe you talked about it and I missed it. If you want phi to be equal to the phase shift in your sine function, the horizontal scale factor (2pi•f) must be shown as a coefficient on the binomial (t - phi), as horizontal dilation is done before horizontal translation (otherwise the dilation changes the translation), and when shown in with the variable, must be written in the opposite order (last transformation closest to the variable). The way you have represented your sine function in this problem, the true phase shift would be phi/(2pi•f).
Contrary to most physics texts, I find it more straightforward mathematically to let phi represent the true phase shift and show the argument of the sine function as 2pi•f(t - phi). Some physics texts and math texts do this, but not many. Having taught college algebra many times, I think this format is more consistent with the logic of transformations of parent functions, and therefore more conducive to a deeper understanding of the geometry of the sine wave. But, to each their own, I suppose.
@evrenbingol7785 3 года назад
Man I have read so many FTs , I know how "Fast" is based on log and the idea of recursion and the matrix reduction and all that to use in Mel freqs and all but I have never seen an explanation as good as this.
This makes complete sense. This is awesome, instead of AI/ML it almost makes me want to go back to collage and do a phd on math(maybe I am pushing it)
@ValerioVelardoTheSoundofAI 3 года назад
Thank you!
@scottk5083 3 года назад
Thank you for this! Your content is amazing
@rohanyadav8762 3 года назад ⁺¹
Thanks Valerio, finally Fourier transform makes sense to me.
@ValerioVelardoTheSoundofAI 3 года назад
Nice!
@vidyagopal3431 2 года назад
No words. well articulated
@mohammadrezapourtorkan8595 2 года назад
this video is fantastic! thanks a lot
@doyourealise 2 года назад
amazing video sir , subscribed!
@benjamingonzalez9216 2 года назад
You rock man. Thank you so much !
@pascalstirtztel2448 2 года назад
Great Work. Tank you very much
@UtkalSinha Месяц назад
Thank you so much !!
@kabeerjaffri4015 2 года назад
Great video
@fujinafiul6044 2 года назад
loved it
@LeeMyeongjae 3 года назад
thank you!!
@user-co6pu8zv3v Год назад
Thank you :)
@i_am-ki_m 2 года назад
Interesting interpretation form to a classical FT view (25:05)!
@emanalsuradi4969 6 месяцев назад
Hi Valerio,
This playlist is AWESOME! Thanks for your efforts. I do have a question in the code. When we zoomed in to the waveform
plt.plot(t[10000:10400], signal[10000:10400])
Why did you choose sample number 10000 as a starting point?
@jefflawrence243 Год назад
hervorragend!
@amitbenhur3722 2 года назад
Could someone please explain why Valerio defined frequency=np.linspace(0, sr, len(magnitude)) ?
Why do the values get to the sampling rate and why are the steps by the length of the magnitude array?
@arunasjunevicius533 2 года назад
You rock man, i’ve been coding and reading about fft’s here and there. Never had understanding about basic principle. Though it begs a question, how do animals process complex sounds? Does it matter that we have two ears? Do we have internal fft?;)
@shaunakranade9517 Год назад
Thanks for the amazing video, I just had a question about the part in the python code where you explained setting the range of frequencies for the FT. You stated that the range of frequencies on the x axis is between 0 and the sampling rate. Shouldn't it be between 0 and the Nyquist frequency? Would be great if anyone else could clear up this minor question as well. Thanks!
@Watake125 3 года назад ⁺¹
Hello, incredible series, very clear explanation so far.
I just have a question, but maybe in future videos I will found an answer.
I understand that we can go from frequency domain to time domain. But don't we miss the time information?
I mean, I understand for a short sound that we can decompose it and recreate it easily, but is it the case for a long audio?
As we don't have information from the time, we will have as an output of IFT only one constant signal? And maybe not a blank if we have two separates notes?
Thank in advance for your help :)
@AlexAlex-fo9gt 2 года назад
I agree, I have the same question: If the pianist is playing musical scale, we have spectrum of sounds. But in case we reproduce sound from this spectrum we got simultaneous sound of 7 musical notes, that is musical chord, not a musical scale.
@davemystic1485 2 года назад
When we apply foourier transform for a particular sine wave with certain frequency and phase can we have negative area?If yes then how is it represented in the frequency spectrum.
@diamondcutterandf598 Год назад
is it possible for the magnitude of the fourier transform to be negative?
@shandi1241 Год назад
9:26 isn't the X axis should be twice shorted than the signal length, fft is vertically mirrored so the right part of the plot is redundant
@atharvbhagya4317 Год назад
hello sir, what is the relative phase. and hod do you obtain it when doing inverse fft. because you only calculated the magnitude and frequency when dong the FFT.
is the relative phase the optimal phase that maximises the area that you calculated when doing fft do we store the phase somewhere like in an array and use it for inv fft in future?
@Waffano Год назад
Just wanna make sure my understanding is correct: @30:21, we can basically delete the "max" part of the d_f function, if we just plug in the optimal phase value right?
@bellafrade 4 года назад
grazie
@ValerioVelardoTheSoundofAI 4 года назад
Prego :)
@Nnemka 9 месяцев назад
my signals look way different once i started on crearting the sine wave which also affected the phase value. is this normal?
i didnt really get a smooth corrolation as stated
@anikbasubhaumik6006 Год назад
Can you please suggest a book for Audio Processing for absolute beginners, I want to learn Audio processing for ML applications
@Waffano 2 года назад
Can someone tell me why @14:55 we multiply the sine wave function with 0.5? It seems like it controls the max and min amplitude of the resulting sine wave? Is there a reason why this is 0.5 or can it be set to just 1?
@KingQuetzal Год назад
.5 is the phase of the sin wave as a percentage where 2π = 1, π=.5, and 0=0
@Waffano Год назад
@@KingQuetzal Thanks! But what is the phase variable for then? :)
@KingQuetzal Год назад
@@Waffano Weird I must have looked at the wrong time. Yeah 0.5 is the amplitude multiplier. A normal sin wave is from -1 to +1 while the one generated at 15:28 is from -0.5 to +0.5.
@bangladeshisingaporevlog9273 3 года назад
I am waiting for the next vedio
@ValerioVelardoTheSoundofAI 3 года назад ⁺¹
I've already published a few in the series since this one!
@bangladeshisingaporevlog9273 3 года назад
@@ValerioVelardoTheSoundofAI I watched all the previous vedios
@ValerioVelardoTheSoundofAI 3 года назад ⁺¹
@@bangladeshisingaporevlog9273 what I meant is this isn't the last! There are new ones already...
@user-ry2lw7uh2p 2 года назад
hi what is the magnitude unit?
@pranaykothari9870 Год назад
4:12 after fourier transform, X axis is frequency but what is the Y axis?
@Waffano Год назад
Its magnitude. In other words: The area of the combined curve of s(t) and the sinusoid at a specific frequency.
@MohanKumar-gj9th 3 года назад
notebook is not working on the gooogle colab, >?
@ValerioVelardoTheSoundofAI 3 года назад
Not sure what's not working, but I haven't tried this in Colab.
@chaimarebah115 2 года назад
i hope you start a serie for python just python please
@zsun0188 3 года назад
It's annoying that only the left side of my headphone is working. Is it just me?
@DiogoCocharro 3 года назад
28:30 What is 'd' in the equation? Great video by the way!
@DiogoCocharro 3 года назад
Sorry, the answer came a few seconds later... magnitude.. :P
@ProfSoft 2 года назад
34:50
very very good example , but dont make it again please professor :)
@sudheerv6941 3 года назад
please add video quality option 144p

Следующие

Автовоспроизведение

Complex Numbers for Audio Signal Processing