Thanks for a very informative explanation. This seems like a bit of a step up in complexity from earlier videos, so I suspect some viewers of earlier ones might not make it to the end of this one. I think the Quiz answers are A B B. Presumably this Probsparse approach is useful in other situations (image processing springs to mind) as well as time sequences.
In the final video are you going show an example when you feed data into the model and the interpret the output. It would be good to see any prepressing of the data to get it in the right format to feed into the model. I'm keen to use this model for a timeseries forecasting exercise 8 timesteps ahead.
Thanks for a very informative explanation. This seems like a bit of a step up in complexity from earlier videos, so I suspect some viewers of earlier ones might not make it to the end of this one. I think the Quiz answers are A B B. Presumably this Probsparse approach is useful in other situations (image processing springs to mind) as well as time sequences.
In the final video are you going show an example when you feed data into the model and the interpret the output. It would be good to see any prepressing of the data to get it in the right format to feed into the model. I'm keen to use this model for a timeseries forecasting exercise 8 timesteps ahead.
@4:38 are you sure d_q is the number of total time steps? I think it's supposed to be the dimension of the query & key.
Hey I am not able to access the links you have provided under Math courses and other related courses
A