Using XGBoost for Time Series Forecasting in Python ❌ XGBoost for Stock Price Prediction Tutorial

Поделиться
HTML-код
  • Опубликовано: 27 ноя 2024

Комментарии • 64

  • @DecisionForest
    @DecisionForest  4 года назад +2

    Hi there! If you want to stay up to date with the latest machine learning and deep learning tutorials subscribe below. Thank you for your support!
    ruclips.net/user/decisionforest

  • @dusanbosnjakovic6588
    @dusanbosnjakovic6588 3 года назад +3

    Just so I understand: This method 1) doesn't de-trend the data or make it stationary, 2) uses just the current price as the only feature (no rolling window aggregates), correct?

  • @vivekrai1974
    @vivekrai1974 2 года назад +2

    10:15 Explanation of Walk Forward Method

  • @weiyang2116
    @weiyang2116 3 года назад +3

    Great video.. How do you cater for the case where there's a new major event like the pandemic, because the old trained model wouldn't work anymore? Thanks!

  • @Spartanboy123
    @Spartanboy123 3 года назад +2

    Great Information. Thank you for your time!!
    Once you find the rmse for your train/test model. How can you predict for next 2 years?

  • @wojtek8396
    @wojtek8396 3 года назад +1

    why do you put val into np.array? Why do not you use encoders and scaler before testing?

  • @limyong8823
    @limyong8823 Год назад

    Excellent explaination of XGBoost and it's slimplifed things.

  • @hasithahiranrajapaksa5611
    @hasithahiranrajapaksa5611 3 года назад +1

    hey,i have a question. can we measure the r2Score of this? I got a negative r2score. Can I know what is the reason?

  • @ddrikee2546
    @ddrikee2546 Год назад

    Do you have relevant experience in machine learning survival analysis? For example xgbse , scikit survival and pycox package in python. I don't know how the data format is handled in this case, also time dependent ROC, calibration and DCA.

  • @StoicRichie
    @StoicRichie 2 года назад

    I didn't understand the val object,why did u reshape it.Plz give a clarity.
    Thanks

  • @vuduy9763
    @vuduy9763 3 года назад

    you save my day!! thank you from VietNam

  • @hartejhaer8851
    @hartejhaer8851 3 года назад +1

    Can you show the end prediction results please? Would be very useful as a reference point :)

  • @morecharacterswithamix9067
    @morecharacterswithamix9067 2 года назад

    Nice! I have a time series problem where the data is at daily level for 3 years and we forecast for daily level 2 years out. Currently I use Prophet which works well. Prophet takes care of spike events (holidays) and weekly / yearly seasonality. Can I use XGBoost to do this? Your example here is forecasting one step out which will be like forecasting one day out for my case.

  • @Borzacchinni
    @Borzacchinni Год назад

    Great demonstration!

  • @brandaphoon2502
    @brandaphoon2502 3 года назад +1

    I am a student and this helped me greatly! Is there a video for incorporating technical indicators or other features to have a more accurate model?

    • @DecisionForest
      @DecisionForest  3 года назад

      Happy it helped. I haven't created any video with technical indicators as features as it's pretty easy to extend my current videos with them. Let me know what results you get if you try it out. My results so far have been pretty poor with standard indicators although I wasn't surprised :)

  • @charlotteramos2267
    @charlotteramos2267 3 года назад +1

    You're a great teacher!! Thanks for the lesson, ir really helped me!! 🙂

    • @DecisionForest
      @DecisionForest  3 года назад

      Thank you so much Charlotte, glad it was useful.

  • @dishydez
    @dishydez 2 года назад

    Great video! Another tool in the ML toolbox. Is this different to the Timeseries Cross validation?

  • @stephenyates1730
    @stephenyates1730 3 года назад

    Thanks for sharing.... Keep getting an error at the last line after print(rmse)
    ... ValueError: Input contains NaN, infinity or a value too large for dtype('float64').

  • @anurag931983
    @anurag931983 2 года назад

    I read in a research paper to predict time series we may use multiple trajectories. Any insight on what these trajectories are in time series and how to calculate these?

  • @scienceforyoung
    @scienceforyoung 3 года назад

    Good video. Is there a way to predict the unknown future? or how can we adapt the function to this approach?
    Thank you.

  • @malcolmburr6602
    @malcolmburr6602 4 года назад

    @DecisionForest Great Video! Is it common for the model to take over 15 minutes? The validation walkthrough part to be exact. Mine has been running for a good amount of time.

    • @DecisionForest
      @DecisionForest  4 года назад

      Thanks Malcolm! Yes, the walk forward validation and any type of validation can take quite a long time depending on computing power.

  • @roywit7836
    @roywit7836 4 года назад

    This looks great thanks :) just subbed today, loving your channel

    • @DecisionForest
      @DecisionForest  4 года назад +1

      Thank you for the support! Glad you find the content helpful.

  • @salehalatwah9002
    @salehalatwah9002 3 года назад

    looks good , the question is how can you predict into unknown future ? how do you give it number of days to keep looping on training updating prediction and retraining

    • @DecisionForest
      @DecisionForest  3 года назад

      That's exactly what we're doing here with walk forward validation, we keep retraining and predicting. That's why walk forward validation is so important for time series.

  • @jimmyrico5364
    @jimmyrico5364 4 года назад

    Good video, thanks for sharing this!
    Could you maybe indicate why can't you after fitting the model just pass the full X_test set and use the predict method available from XGBoost?

    • @DecisionForest
      @DecisionForest  4 года назад

      Glad it was helpful. Well that’s the trick and why walk forward validation is useful. why would you ignore new data when you have it, of course you could but you want to get the best possible results.

  • @ragendhusr5791
    @ragendhusr5791 3 года назад

    Thankyou for sharing. I got error in
    pred = xgb_predict(history, test_X[0]) and
    X, y = train[:, :-1], train[:, -1].
    The error message was "IndexError: too many indices for array: array is 0-dimensional, but 2 were indexed".
    Could you please tell me how to resolve this error?

  • @malleswararaomaguluri6344
    @malleswararaomaguluri6344 3 года назад

    I have 450 points for train data, need to predict for next 10 samples, how can we do train test split

  • @Wolf-hh4rv
    @Wolf-hh4rv 3 года назад

    Just read a raft of papers telling me that tree based methods are not easily adapted to time series… if you give them the simplest model ie extrapolation, can’t do. Dude you do this for a living.

    • @DecisionForest
      @DecisionForest  3 года назад

      of course there are, and there are others claiming the opposite. We start with a theory, test it in practice and based on the results we decide what is good and what isn't. Papers are incredibly useful as they help you avoid wasting time but in the end what works in the real world matters.

  • @varungupta4228
    @varungupta4228 3 года назад

    Hi!
    Great video. But this was only the validation of the model. Suppose we want to forecast it further. How do we do that?

    • @DecisionForest
      @DecisionForest  3 года назад

      Cheers Varun! Well after you are satisfied with the results, you train on the full dataset.

    • @varungupta4228
      @varungupta4228 3 года назад

      @@DecisionForest Hi Thanks for replying. So once we do a walk forward validation on the train test split, I finalize my model. I use that model on the whole dataset. I am not able to understand what will the model return at that step?

  • @emeline894
    @emeline894 3 года назад

    Thank you for this very helpful video. Much appreciated!

  • @lollmao249
    @lollmao249 2 года назад

    i tried to sign up to get the code but failed why ?

  • @shadialmoayed1262
    @shadialmoayed1262 4 года назад

    Thanks, that was helpful.. can you provide more videos on XAI (Shap, Lime, etc..) and how to inerpret models such as XGBoost, RandomForest, Deep learning NN models and so on

    • @DecisionForest
      @DecisionForest  4 года назад +1

      Glad it was helpful. I wanted to do a video on SHAP, I actually created the notebook just that it takes forever to run, it's extremely computationally expensive especially with high dimensional datasets. I'd encourage you to use LIME as it's more practical for real world scenarios even if SHAP may be more interesting on paper.

  • @prosimulate
    @prosimulate 4 года назад +1

    You’re great Sir!

    • @DecisionForest
      @DecisionForest  4 года назад

      You’re kind, thank you! Glad it was helpful.

  • @yunoda2
    @yunoda2 3 года назад

    2:52 what is the purpose of debuggers ?

  • @brianchaplin278
    @brianchaplin278 11 месяцев назад

    Nice explanation

  • @saketnarendra8864
    @saketnarendra8864 4 года назад

    can you give the link to the dataset

  • @saurabhkumar289
    @saurabhkumar289 3 года назад

    Amazing content thanks sir

  • @sunilnarwaria7452
    @sunilnarwaria7452 3 года назад

    Great content

  • @rrmusic970
    @rrmusic970 3 года назад

    This tutorial video is crystal clear to me up until 13:59 minute, then i lost completly...
    What is the meant of 1.7967091070446082 ???
    Is it the prediction of the next hour close price??? Or have i miss something???
    i mean, how to print out that predictions for the net 1 hour close price???

    • @DecisionForest
      @DecisionForest  3 года назад

      Thanks, glad it was helpful. That is the RMSE, the root mean squared error, the metric that we want to optimize in this case. You need to optimise for small values of this metric. Hope this helped.

  • @didierleprince6106
    @didierleprince6106 5 дней назад

    Merci (:

  • @sunaxes
    @sunaxes Год назад

    Your model is using one hour's value to predict the next? This is ridiculously useless model haha. But the explanations are great!

  • @sodiqrafiu9072
    @sodiqrafiu9072 4 года назад

    How can we get the dataset, please?

    • @DecisionForest
      @DecisionForest  4 года назад +2

      Just updated the download file to contain the data.

  • @aarondelarosa3146
    @aarondelarosa3146 Год назад

    You forgot the plot.

  • @piotr780
    @piotr780 3 года назад

    model with one variable xD