This was extremely helpful!! Between my 3 econometrics textbooks (Griffiths, Greene, and Wooldridge), the information on MA models was sparse. This really cleared up the mindset behind this model!
Thank you so much for explaining this so well! My professor and textbook explain this concept very mathematically which is hard to understand for beginners, they should really give a simple example and then dive into the details as you did.
Gemini 1.5 Pro: This video is about moving average model in time series analysis. The speaker uses a cupcake example to explain the concept. The moving average model is a statistical method used to forecast future values based on past values. It is a technique commonly used in time series analysis. The basic idea of the moving average model is to take an average of the past observations. This average is then used as the forecast for the next period. There are different variations of moving average models, and the speaker introduces the concept with moving average one (MA1) model. In the video, a grad student is used as an example. The grad student needs to bring cupcakes to a professor's dinner party every month. The number of cupcakes the grad student should bring is the forecast. The professor is known to be crazy and will tell the grad student how many cupcakes he thinks were wrong each month. This is the error term. The moving average model is used to adjust the number of cupcakes the grad student brings based on the error term from the previous month. The coefficient is a weight given to the error term. In the example, the coefficient is 0.5, meaning the grad student will adjust the number of cupcakes he brings by half of the error term from the previous month. For example, if the grad student brings 10 cupcakes in the first month, and the professor says the grad student brought 2 too many, then the grad student will bring 9 cupcakes in the second month (10 cupcakes - 0.5*2 error term). The video shows how the moving average model works through a table and graph. The speaker also mentions that there are other variations of moving average models, such as moving average two (MA2) model, which would take into account the error terms from two previous months.
Great videos, thank you! I have a question. Period 1 value is our mean value but we don't know what is mean since we just started from point 0. How to calculate residual then? We know the true observation and we don't know the mean. Is it just a guess? But when we use any statistical package it does not ask us to input guess mean value.
I still don't think this makes sense to me why is incorporating past error somehow gives us better prediction in the future in this case. Since this crazy professor will randomly choose an acceptable # of cupcakes, your past error shouldn't help in better predicting in the future.
Event though the professor selects a different number every time, at the end the average is stable. Assume you have a time series of images. Images, due to the unstable environment they're taken in or all other factors that manipulate images nature, are not always the same, although they are taken from the same scene. So, what is the goal here ?to find the mutual information in the images and ignore the noises. These noises are how crazy professor is , and the importance of error, which we can handle by its coefficient. By handling these factors, we can get close to recognising the mutual information. Remember, these are unsupervised models. There are no lable to rely on.
Thanks this is a really clear explanation. My only question is when you are calculating your f_t column, why are you including the error from the current time period? Shouldn't you only be including the 0.5*e-t-1?
Observation: 5:32 Its always centered at 10 because the errors mean was 0 (per 1:02) and error was multiplied by Φ, which will have have a mean of 0. Feeling a little awkward commenting multiple times. Just trying to understand more by thinking aloud, and that someone may correct my understanding. :) Great videos!
If a physics student is reading this, just wanna share my intution that this is exactly like a control system . whatever error our model is getting, it is moving to cover it , little bit like PI controller in Electrical engineering :) not sure if it clicks to anyone
You have amazing content, and it is very well explained. I love trading and thankfully I was introduced to this awesome trader Brian Branum. He has a trading system that is truly splendid. I enjoy good wins trading with him because he always manages my risks properly. Hence, I always receive my profits into my bitcoin wallet.
the idea is that you're trying to predict the next value. you get told what the next value is by the professor. if its random then there is no signal in there & the results are still meaningless
How is the average moving though? It was fixed for each prediction! Wouldn't it have to be recalculated each time for it to be moving? Also we didn't seem to use anything related to the error being normally distributed... is there a reason for that? why was it mentioned in the first place?
How come some MA(1) formulas have x_t = mu + (phi1) error_t + (phi2) error_t-1..... If you predicting at time t then how would you know error at time t (error_t), why are some formulas like this?
This is a great explanation but in many equation they also add the current error (epsilon_t). I just don't get how are we supposed to know our current error if we are trying to forecast a value. Do we simply neglect that current equation for forecasting?
I really like your videos. They work very well for me, someone without any background in time series. However, this one is somewhat confusing. You are demonstrating the concept of *moving average* with an example where the average stays the same. I get that the estimate moves around, but that is due to the error variance, right? The average itself is not moving anywhere. Both mu and mu_epsilon are assumed to be constant, so what's moving here?
Where does the noise in the equation come from? In our data we only have time on the x axis and Y as the target variable. There is no error term. What I mean to ask is does the MA model first regress y on y lag terms like the AR model and then calculate error between the actual and predicted y terms? Then regress y against the calculated error terms(residuals)?
The error is a white noise coming from random shocks whose distribution is iid~(0,1). Ftting the MA estimates is more complicated than it is in autoregressive models (AR models), because the lagged error terms are not observable. This means that iterative non-linear fitting procedures need to be used in place of linear least squares. Hope this helps :).
i wish my professor had explained it exactly like u just did
Thank you very much for making a vague concept so clear.
This was extremely helpful!! Between my 3 econometrics textbooks (Griffiths, Greene, and Wooldridge), the information on MA models was sparse. This really cleared up the mindset behind this model!
Thank you so much for making this fun video! Makes so much more sense now (after struggling through my not-so-crazy professor's stats class)
God Bless You! I needed a fast way to get some concepts on time series forecasting and you saved me.
Easy, Fast, Complete.
I really don't know how to thank you for that great demonstration! I've been trying to understand MA process for years!
Thank you so much for your very intelligent explanation to this model!!! i felt so confused about this model before.
Thank you Sir. You have a great way of explaining things, something I sadly rarely find from my coding/statistics teachers.
Wow! Great explanation. The professor´s example was very intuitive. Thanks for the content!
Never seen a better explanation of MA models. Immediate subscription!
Same here! I knew I would suscribe after 1 minute in the video. Very clear and very useful video. Thank you very much.
ALWAYS GRATEFUL, THANK YOU FOR THE WONDERFUL CONTENT
So simple yet easy to understand. Thank you!
Finally ❤️ a video with an applicable and relevant example ❤️🙏
I was stuck where is the “error" term coming from. Now I know... it is the error from the past. You explained! I wish you were my professor.
Oh damm!! this is wonderful, Simplified and explained pretty nicely. Keep spreading you knowledge!!
Thank you! Will do!
Simple and clear explanation, thank you !
Great explanation! I've learned everything that I looked for. Thank you.
Fantastic, got too caught up in the math in my macroeconometrics course and had no idea what these things actually were. Super helpful conceptually
Finally understood this, thank you so much. Highly recommend!
Thanks man. You're doing a suberb job.
OMG, this is brilliant , amazing ,wonderful ,thank you
Great explanation. Keep up the good work!
Brilliant explanation, thank you!
a year trying to understand this, and I ve just needed 15 minutes thx!!
Perfect explanation! Thank you!
I was terrified for the mathematical symbols, but you made it so easy to understand! thank you!
Simple Explanation is a Talent - Thanks for this
I love this video, so simple but effective
Thank you so much for explaining this so well! My professor and textbook explain this concept very mathematically which is hard to understand for beginners, they should really give a simple example and then dive into the details as you did.
Glad it helped!
Thank you very much! Such a clear explanation!
This was the best video on MA. The crazy prof made our life easier 😂😂😂
You are spectacularly GOOD in the explanation of the ARIMA! Cheers
I appreciate that!
This men's explanation is way better than those profs at University.
this is really helpful and so easy to understand!!!
Great video! Thanks for sharing!
Nice example super easy to understand the concept!
Amazing explanation man
Extremely well explained
Thanks!!! Perfect explanation :)
Thank you so much, I have been reading this concept in an Econometric book...but this is easy to comprehend
Glad it was helpful!
Greatly explain!!! Thanks
This explanation gives better understanding why do we need avoid unit root in Time Series predictions
Gemini 1.5 Pro: This video is about moving average model in time series analysis. The speaker uses a cupcake example to explain the concept.
The moving average model is a statistical method used to forecast future values based on past values. It is a technique commonly used in time series analysis.
The basic idea of the moving average model is to take an average of the past observations. This average is then used as the forecast for the next period. There are different variations of moving average models, and the speaker introduces the concept with moving average one (MA1) model.
In the video, a grad student is used as an example. The grad student needs to bring cupcakes to a professor's dinner party every month. The number of cupcakes the grad student should bring is the forecast. The professor is known to be crazy and will tell the grad student how many cupcakes he thinks were wrong each month. This is the error term.
The moving average model is used to adjust the number of cupcakes the grad student brings based on the error term from the previous month. The coefficient is a weight given to the error term. In the example, the coefficient is 0.5, meaning the grad student will adjust the number of cupcakes he brings by half of the error term from the previous month.
For example, if the grad student brings 10 cupcakes in the first month, and the professor says the grad student brought 2 too many, then the grad student will bring 9 cupcakes in the second month (10 cupcakes - 0.5*2 error term).
The video shows how the moving average model works through a table and graph. The speaker also mentions that there are other variations of moving average models, such as moving average two (MA2) model, which would take into account the error terms from two previous months.
you are just amazing
LOVE IT. Thank you.
Of course!
Great videos, thank you! I have a question. Period 1 value is our mean value but we don't know what is mean since we just started from point 0. How to calculate residual then? We know the true observation and we don't know the mean. Is it just a guess? But when we use any statistical package it does not ask us to input guess mean value.
Manyt thanks for your clear explanation of the mathematical moving average formula
of course!
Exceptionally useful videos for actuarial exams. Thanks for helping me pass🙂(hopefully)
Explained with the Cup Cakes it makes perfect sense, thumbs up!
Perfect!
Excellent explanation
God Bless you.
thanks! Really helpful
Amazing explaination
Thanks you so much.
THANK YOU SO MUCH
Great video. I think the calculation of the 3rd row is wrong. It should've been 9+0.5 = 9.5
No.. Constant term is 10 not 9
Great Presentation...
Glad you liked it!
thank you so much
Amazzzziiiiinnngggggg
Thank you❤❤❤
Thank you for the video, how should we choose the 0.5 coefficient in front of the error term from last period in the regression model?
you are too good
Hi, great explanation! One question, how do you guess the mu value (the average cupcake you bring) for the fist time?
Perfect.
THANK YOU!!
You're welcome!
Thank you. Love your video tutorials! Just one question: shouldn't the curve at 5'58'' be f_t? And c(10,9,10.5,10,11) be f_(t-1)?
Had I watched your series earlier would have saved me $3000 :(
Wonderful example.
thanks!
Let's use an example that is sligtly more natural to us -- so here's this crazy professor. :D
So not natural.. it is why you are so good in teaching
Thank you so much for making this video. I am so frustrate to understand this concept :(
sameee
thank you so so much
You're welcome!
Great video. Do you always start with the mean as your first guess for f hat? Also, how do you fit an MA(q) model?
I still don't think this makes sense to me why is incorporating past error somehow gives us better prediction in the future in this case. Since this crazy professor will randomly choose an acceptable # of cupcakes, your past error shouldn't help in better predicting in the future.
I think the student naively believes the crazy professor will stick to his prior t-1 position (the student is unaware of the professor's craziness)
Everything in time series assumes that you can use past info to predict future info
Event though the professor selects a different number every time, at the end the average is stable. Assume you have a time series of images. Images, due to the unstable environment they're taken in or all other factors that manipulate images nature, are not always the same, although they are taken from the same scene. So, what is the goal here ?to find the mutual information in the images and ignore the noises. These noises are how crazy professor is , and the importance of error, which we can handle by its coefficient. By handling these factors, we can get close to recognising the mutual information. Remember, these are unsupervised models. There are no lable to rely on.
Thanks this is a really clear explanation. My only question is when you are calculating your f_t column, why are you including the error from the current time period? Shouldn't you only be including the 0.5*e-t-1?
Observation: 5:32 Its always centered at 10 because the errors mean was 0 (per 1:02) and error was multiplied by Φ, which will have have a mean of 0.
Feeling a little awkward commenting multiple times. Just trying to understand more by thinking aloud, and that someone may correct my understanding. :)
Great videos!
I saw the same thing, think it was just his mistake in calculation
Hey amazing Content Bravo !
Can you add to that a video talking about random walk ?
That would be great .
Awesome
how is it possible you can explain this stuff so easily!
If a physics student is reading this, just wanna share my intution that this is exactly like a control system . whatever error our model is getting, it is moving to cover it , little bit like PI controller in Electrical engineering :) not sure if it clicks to anyone
THANK you
You're welcome!
God-like!
Thanks 🙏
You have amazing content, and it is very well explained. I love trading and thankfully I was introduced to this awesome trader Brian Branum. He has a trading system that is truly splendid. I enjoy good wins trading with him because he always manages my risks properly. Hence, I always receive my profits into my bitcoin wallet.
Trading without a broker or even a mentor is among the surest ways to lose everything in your trading account.
I would like to know this guy. At the very least, speak with him personally.
If you can get me a link to speak with him, I'd greatly appreciate it.
This trading strategy helps to manage losses/trades better.
I am certain you will have a positive learning and trading experience with him
Well explained ❤
Thank you 🙂
Thanks goddd
cool !!!
How do we know what the "error" is there is if there is no "true value" given a random realization of data.
the idea is that you're trying to predict the next value. you get told what the next value is by the professor. if its random then there is no signal in there & the results are still meaningless
Really good explaination!
Maybe I'm stupid for asking this...
If one was to write an MA filter, how do you determine M?
How is the average moving though? It was fixed for each prediction! Wouldn't it have to be recalculated each time for it to be moving?
Also we didn't seem to use anything related to the error being normally distributed... is there a reason for that? why was it mentioned in the first place?
Exactly right, I am also having same query, Average not moving
Did you get any other source where this explained clearly
How come some MA(1) formulas have x_t = mu + (phi1) error_t + (phi2) error_t-1..... If you predicting at time t then how would you know error at time t (error_t), why are some formulas like this?
What does it mean when the MA(1) estimated parameter = 1? For AR(1) that would mean there's a unit root. Any particular corollary for MA models?
Does MA model assume et (lagged residuals) are pure white noise ? Mean =0, constant variance , and no autocorrelation of residuals ?
This is a great explanation but in many equation they also add the current error (epsilon_t). I just don't get how are we supposed to know our current error if we are trying to forecast a value. Do we simply neglect that current equation for forecasting?
Hi. The mean of et is not 0. For time interval 5, you need to write -1.
You can see how the crazy professor gets hungrier month by month
I really like your videos. They work very well for me, someone without any background in time series. However, this one is somewhat confusing. You are demonstrating the concept of *moving average* with an example where the average stays the same. I get that the estimate moves around, but that is due to the error variance, right? The average itself is not moving anywhere. Both mu and mu_epsilon are assumed to be constant, so what's moving here?
Where does the noise in the equation come from? In our data we only have time on the x axis and Y as the target variable. There is no error term. What I mean to ask is does the MA model first regress y on y lag terms like the AR model and then calculate error between the actual and predicted y terms? Then regress y against the calculated error terms(residuals)?
The error is a white noise coming from random shocks whose distribution is iid~(0,1). Ftting the MA estimates is more complicated than it is in autoregressive models (AR models), because the lagged error terms are not observable. This means that iterative non-linear fitting procedures need to be used in place of linear least squares. Hope this helps :).
Sir please make videos on restricted Boltzmann machine