Great Video! However, there appear to be a number of missing steps. Maybe that was in a prior video but you never said so. For example, where did the transition matrix come from? You mention very briefly about from past data, but never connected what that means or how it’s done. Similar for rest of the process. Maybe a follow up “show me” video would be in order.
I don’t think this video is correct like at all since a hidden markov works with 2 matrix’s, working back and forth to calculate the probability of a observable sequence given the hidden sequence And he only has 1 matrix
Really great content! If you don’t mind, may be there is another way to use ML and NN in financial market: you can not only predict future returns and prices, you can also predict how different factors affect price fluctuations right at the moment?
@ritvikmath I have a trading system that I'm currently adding hidden markov models to. Any chance I can run some ideas by you? It has to do with programmatically detecting which price distribution regime changes using a hidden markov model.
Maybe I've missed something: what exactly is the latent variable you're trying to approximate? If, as you've described, the sentiment values of positive, negative, and neutral correspond to distinct normal distributions with respective means, then this is simply a linear model. It would be hidden if the three sentiment values corresponded to the three, respective distributions and the latent variable was the stock price.
can you do a notebook demo on how to fit hmm and do inference on out-of-sample data? maybe include these - fit returns on other distribution other than gaussian (single, multi-variate case) - set initial state transition matrix ; like higher value to diagonal entries (since regimes rarely changes) - show us how the input(length of observation) might change the inference value
Can you explain how you traded based off this model specifically? Did you predict sentiment for the next day and sell/buy/hold based on that for a day with a rolling window of 1 ?
basically the model outputs hidden states as integers {0,1,2}, each with a corresponding mean for its normal distribution. We map the integers to sentiments via ranking the means.
How did you arrive at the transition probabilities for sentiment? Like you said, isnt it hidden? How about the distributions? Would be really helpfull to get the code for these videos. This is cool, but we dont even know how the RNN was implemented.
So once the HMM is fitted we have 3 hidden states encoded as 0, 1, 2 in the code. We associate the one with the smallest mean as negative sentiment, etc.
How did you train this? How does it map to sentiment if you are just putting Hidden units to 3? As an aside, I think you should start adding jupyter notebooks to your videos like Karpathy
Are you considering commission fees? In my experience that makes the return hard to go to positive at all because the fees make it a negative sum game and so with a negative bias which gives loss on long term with high probability.
@@ritvikmath Thank you. I recommend you to do a statistical T test on your profit return values (negative and positive values over time) to see if its mean is significantly different from zero to the positive direction. If so, then you know your model has some real power. This helps to see out of the fog of random noise. Also the non normality of the distribution of profits must be corrected to get a better picture. Since the T test converges to be Mean / SE as the number of values grows, you can use this as a simple metric, but I recommend you to use the absolute cubic differences instead of the squared one when calculating SE, this can account for the non normality in a simpler way. I can derive this for you if interested. Keep up the good work. Cheers.
What about changing the time scale of your model. Currently your "delta t" is 1 day, but what about e.g. the sentiment over the course of a week/ month/ year
Pretty interesting. If you build a transition matrix between the states rising trend - neutral - falling trend, then the probabilities of transitions will be very different from the proposed matrix of transitions between sentiments. The probability of observing a growing trend tomorrow, if we saw a growing trend today, will be about 78%, if it was neutral, then it will remain neutral about 60%, if there was a falling trend, then the falling trend will disappear about 81%. The only problem is with the cost of transition (for example, from a growing market to a neutral market). It can be so high that the mathimatical expectation becomes close to zero, despite such remarkable probabilities.
Do you really stock trade? I think these 1k experiments are just paper experiments, correct? Do you have any specific view on the trading? If you do, what kind of prediction technique would you use considering all possible variants?
These are actual experiments and I do trade. Of course you can argue why not just run them as paper trading and that’s a totally valid point. It just personally makes me more invested (no pun intended) if I put down real money
Interested concept. How did you estimate transition (and emission) probabilities? Obviously the most likely sequence of hidden states could be inferred with the viterbi algorithm (covered in one of your other videos). I'm assuming you used the baum welch algorithm to estimate state transition probabilities (but this was not explained).
Is it possible to predict volatility of an asset using Markov model? As volatility is mean reverting, so should the return distribution curves be narrower?
Maybe you saw some article of guy who says machine learning cannot predict finances. All what we can do is optimize models like Fama and French model to find more important factors affecting stocks prices. Or for example use machine learning to optimize weights of given assets in the portfolio. What do you think? Article with name "Why machine learning can’t predict the prices of stocks or cryptocurrencies: the limits to applying machine learning in empirical asset pricing". Its easy to find I dont want to advertise it. Just wanted to know your thoughts on the subject
The Fama and French model is essentially a straightforward application of Ordinary Least Squares (OLS) on factors to analyze a portfolio of asset returns. Since OLS is a machine learning algorithm, I fail to see your point. Perhaps you're skeptical about complex models with too many parameters, suggesting that in essence, the parameters cannot be accurately estimated?
@@cv462-l4x "All what we can do is optimize models like ..." isnt this your claim? article you suggested says the opposite "Traditional empirical asset pricing models like CAPM and Fama-French have limitations in accurately capturing systematic risks and explaining stock returns. These models may be over-parameterized, mis-specified, or unable to capture non-linear relationships among factors."
It’s true, you cannot “predict stock prices”. Stock prices in an efficient market are semi-martingales: it’s mostly random walk. Why is it random? Because everything that can be predicted has already been predicted and priced into the asset. The price of all assets at any given time already incorporates all available information. That’s why no firm on wall street tries to “predict future prices”. They either sell financial services to companies, provide a market, or other services (or in case of HFT try to arbitrage millisecond differences in prices between markets). This is the danger of data science/machine learning without domain expertise. Or even a basic understanding of the domain.
@@cumibakar10Right, if I'm not mistaken what quants do at most is manage risk in portfolios by trying to accurately model the components of some commodity rather than predicting any specific future state. If the market is efficient then you can't beat anyone and if it isn't, you aren't the one to find out either.
Great video - thanks for sharing. I think it would be great to see what the average pct. change of the SP500 was (as a whole, not as an index) to check for general market movement bias. Great to see the comparison of the RNN and random approach. Also I never knew a coffee in the States was 8$!! Outrageous
0.02% return in a day is 7.5% annual return. Pretty much market index 0.2% daily is a 107% annual return on your investment (you money doubles every year), that's something. 0.4% is something like 330% return per year. There's no way these returns are consistent.
Great explanation. It would be interesting to follow the predictions of the models on a regular basis just to see what happens when the market shifts. Ie, does the model predict that... or more than likely, can not make such predictions. Thanks for the video.
Great Video! However, there appear to be a number of missing steps. Maybe that was in a prior video but you never said so. For example, where did the transition matrix come from? You mention very briefly about from past data, but never connected what that means or how it’s done. Similar for rest of the process. Maybe a follow up “show me” video would be in order.
up !
The matrix came from data from march 2023 to march 2024, 8:30
I don’t think this video is correct like at all since a hidden markov works with 2 matrix’s, working back and forth to calculate the probability of a observable sequence given the hidden sequence
And he only has 1 matrix
Awesome series, keep up the good work!
Thanks, will do!
Really great content! If you don’t mind, may be there is another way to use ML and NN in financial market: you can not only predict future returns and prices, you can also predict how different factors affect price fluctuations right at the moment?
@ritvikmath I have a trading system that I'm currently adding hidden markov models to. Any chance I can run some ideas by you? It has to do with programmatically detecting which price distribution regime changes using a hidden markov model.
Maybe I've missed something: what exactly is the latent variable you're trying to approximate? If, as you've described, the sentiment values of positive, negative, and neutral correspond to distinct normal distributions with respective means, then this is simply a linear model. It would be hidden if the three sentiment values corresponded to the three, respective distributions and the latent variable was the stock price.
Is the code for this in your RUclips repo on github?
No
can you do a notebook demo on how to fit hmm and do inference on out-of-sample data?
maybe include these
- fit returns on other distribution other than gaussian (single, multi-variate case)
- set initial state transition matrix ; like higher value to diagonal entries (since regimes rarely changes)
- show us how the input(length of observation) might change the inference value
Good suggestion!
I agree with him, a practical tutorial would be very useful!
We love your videos dude!
Love you all 🙏
Thanks Ritvik! Great content as usual
My pleasure!
Can you explain how you traded based off this model specifically? Did you predict sentiment for the next day and sell/buy/hold based on that for a day with a rolling window of 1 ?
Yes exactly
Intriguing Content as usual! Btw, how did you come you with the sentiment?
basically the model outputs hidden states as integers {0,1,2}, each with a corresponding mean for its normal distribution. We map the integers to sentiments via ranking the means.
How did you arrive at the transition probabilities for sentiment? Like you said, isnt it hidden?
How about the distributions?
Would be really helpfull to get the code for these videos. This is cool, but we dont even know how the RNN was implemented.
So once the HMM is fitted we have 3 hidden states encoded as 0, 1, 2 in the code. We associate the one with the smallest mean as negative sentiment, etc.
@@ritvikmath Thank you. That is for the distributions i believe.
But how about coming up with the sentiment transition probabilities?
@@rexlaurus5894I would imagine you could just do (constrained) gradient descent on the training data?
How did you train this? How does it map to sentiment if you are just putting Hidden units to 3? As an aside, I think you should start adding jupyter notebooks to your videos like Karpathy
The model outputs numerical hidden states of 0,1,2 and mu,sigma for each. We map the numerical hidden states to sentiments by ascending mu
"ascending mu?" do you have any sentiment data to validate this on? @@ritvikmath
Are you considering commission fees? In my experience that makes the return hard to go to positive at all because the fees make it a negative sum game and so with a negative bias which gives loss on long term with high probability.
Yes they are included
@@ritvikmath Thank you. I recommend you to do a statistical T test on your profit return values (negative and positive values over time) to see if its mean is significantly different from zero to the positive direction. If so, then you know your model has some real power. This helps to see out of the fog of random noise. Also the non normality of the distribution of profits must be corrected to get a better picture. Since the T test converges to be Mean / SE as the number of values grows, you can use this as a simple metric, but I recommend you to use the absolute cubic differences instead of the squared one when calculating SE, this can account for the non normality in a simpler way. I can derive this for you if interested. Keep up the good work. Cheers.
@@ritvikmath Amazing, I thought this is a gross return purely from the price diference!
What about changing the time scale of your model. Currently your "delta t" is 1 day, but what about e.g. the sentiment over the course of a week/ month/ year
Cool suggestion!
Pretty interesting. If you build a transition matrix between the states rising trend - neutral - falling trend, then the probabilities of transitions will be very different from the proposed matrix of transitions between sentiments. The probability of observing a growing trend tomorrow, if we saw a growing trend today, will be about 78%, if it was neutral, then it will remain neutral about 60%, if there was a falling trend, then the falling trend will disappear about 81%. The only problem is with the cost of transition (for example, from a growing market to a neutral market). It can be so high that the mathimatical expectation becomes close to zero, despite such remarkable probabilities.
Do you really stock trade? I think these 1k experiments are just paper experiments, correct? Do you have any specific view on the trading? If you do, what kind of prediction technique would you use considering all possible variants?
These are actual experiments and I do trade. Of course you can argue why not just run them as paper trading and that’s a totally valid point. It just personally makes me more invested (no pun intended) if I put down real money
@@ritvikmath And are the fees calculated in these returns?
Yes any fees are included in the final return calculation
Interested concept. How did you estimate transition (and emission) probabilities? Obviously the most likely sequence of hidden states could be inferred with the viterbi algorithm (covered in one of your other videos). I'm assuming you used the baum welch algorithm to estimate state transition probabilities (but this was not explained).
Awesome! How did you derive the values in the transition matrix? Did you just measure the probabilities over a given timeframe?
Very nice! Do you calculate the HMM states and probabilities on the price data or the price return data?
Where did the 0.4% for RNN come from suddenly?
Great video . Can you do some Multivariate analysis data series with different business case scenarios
can you run a different test on this strategy: buy 5 stock and short 5 stocks ($500 long + $500 short), allocated by historical volatility?
Interesting idea!
Is it possible to predict volatility of an asset using Markov model? As volatility is mean reverting, so should the return distribution curves be narrower?
So, how is it different than technical analysis
what did S&P500 return on that day?
Maybe you saw some article of guy who says machine learning cannot predict finances. All what we can do is optimize models like Fama and French model to find more important factors affecting stocks prices. Or for example use machine learning to optimize weights of given assets in the portfolio. What do you think? Article with name "Why machine learning can’t predict the prices of stocks or cryptocurrencies: the limits to applying machine learning in empirical asset pricing". Its easy to find I dont want to advertise it. Just wanted to know your thoughts on the subject
The Fama and French model is essentially a straightforward application of Ordinary Least Squares (OLS) on factors to analyze a portfolio of asset returns. Since OLS is a machine learning algorithm, I fail to see your point. Perhaps you're skeptical about complex models with too many parameters, suggesting that in essence, the parameters cannot be accurately estimated?
@poisonza you can google the article I mentioned. Its big enough. That's not my opinion. That's the subject of discussion
@@cv462-l4x "All what we can do is optimize models like ..." isnt this your claim?
article you suggested says the opposite
"Traditional empirical asset pricing models like CAPM and Fama-French have limitations in accurately capturing systematic risks and explaining stock returns. These models may be over-parameterized, mis-specified, or unable to capture non-linear relationships among factors."
It’s true, you cannot “predict stock prices”. Stock prices in an efficient market are semi-martingales: it’s mostly random walk. Why is it random? Because everything that can be predicted has already been predicted and priced into the asset. The price of all assets at any given time already incorporates all available information.
That’s why no firm on wall street tries to “predict future prices”. They either sell financial services to companies, provide a market, or other services (or in case of HFT try to arbitrage millisecond differences in prices between markets).
This is the danger of data science/machine learning without domain expertise. Or even a basic understanding of the domain.
@@cumibakar10Right, if I'm not mistaken what quants do at most is manage risk in portfolios by trying to accurately model the components of some commodity rather than predicting any specific future state. If the market is efficient then you can't beat anyone and if it isn't, you aren't the one to find out either.
Great video - thanks for sharing. I think it would be great to see what the average pct. change of the SP500 was (as a whole, not as an index) to check for general market movement bias. Great to see the comparison of the RNN and random approach. Also I never knew a coffee in the States was 8$!! Outrageous
Thanks! And appreciate the suggestions.
How many trades are you making during the test?
Does Deep Learning acheive this by default?
0.02% return in a day is 7.5% annual return. Pretty much market index
0.2% daily is a 107% annual return on your investment (you money doubles every year), that's something.
0.4% is something like 330% return per year.
There's no way these returns are consistent.
Definitely not
@@ritvikmath To make things more authentic, I think you should make some backtesting over years. Just to be more clear with your audience 😃
Great Video! working on exact model and this helps to validate mine
great video!
Thanks!
my sentiment on this video: thumbs up!
Triple Barrier Labeling or some of the other work by Prof. de Prado
Great explanation. It would be interesting to follow the predictions of the models on a regular basis just to see what happens when the market shifts. Ie, does the model predict that... or more than likely, can not make such predictions.
Thanks for the video.
Great suggestion!
@@ritvikmathmaybe do a weekly session on the results of previously implemented model. Good idea for useful and interesting content
I'm living in Italy where a coffee costs less than two dollars so I choked a bit ahaha
😂
Interesting
Indeed