Data Science with Marco
Data Science with Marco
  • Видео 37
  • Просмотров 120 554
Podcast - TimeGPT, predicting the future, and more
Links 🔗
Full episode available here: ruclips.net/video/TbMBXKuU8hU/видео.html
Master time series forecasting with my online course: www.datasciencewithmarco.com/offers/zTAs2hi6/checkout
I had a great talk with @JackRoycroftSherry on time series, how to predict them, what works and what does not. Of course, we talked about TimeGPT, what it means for the field of forecasting.
We also diverge into NLP, as a lot of parallels can be made between time series and natural language, but the models for one don't always work well for the other!
Просмотров: 368

Видео

Anomaly detection in time series with Python | Data Science with Marco
Просмотров 30 тыс.Год назад
A hands-on lesson on detecting outliers in time series data using Python. Full source code: github.com/marcopeix/youtube_tutorials/blob/main/YT_02_anomaly_detection_time_series.ipynb Dataset can be found here: github.com/numenta/NAB/blob/master/data/realAWSCloudwatch/ec2_cpu_utilization_24ae8d.csv Labels can be found here: github.com/numenta/NAB/blob/master/labels/combined_labels.json Chapters:...
Feature selection in machine learning | Full course
Просмотров 23 тыс.Год назад
Full source code on GitHub: github.com/marcopeix/youtube_tutorials/blob/main/YT_01_feature_selection.ipynb Introduction - 0:00 Initial code setup - 2:19 Variance threshold - 11:04 Variance threshold (code) - 13:02 Filter method - 19:39 Filter method (code) - 21:27 RFE - 29:08 RFE (code) - 30:42 Boruta - 37:12 Boruta (code) - 41:21 Thank you - 46:35 A full course on feature selection in machine ...
Should you aim for data science or data engineering? | Data Science Q&A #1
Просмотров 236Год назад
Weekly recap of the questions I answered about data science! Question 1: Why is SQL important when Python and R exist? (0:00) Question 2: How common is R in the industry compared to Python (0:38) Question 3: Should I aim for data science or data engineering? (1:35) Question 4: I have crossed the beginner stage of data science. How do I go deeper? (2:30) Question 5: Should I add certificates to ...
ARMA Model - Time Series Analysis in Python and TensorFlow
Просмотров 9 тыс.3 года назад
ARMA Model - Time Series Analysis in Python and TensorFlow
Autoregressive Process - Applied Time Series Analysis in Python and TensorFlow
Просмотров 2,1 тыс.3 года назад
Autoregressive Process - Applied Time Series Analysis in Python and TensorFlow
Moving Average Process - Applied Time Series Analysis in Python and TensorFlow
Просмотров 1,9 тыс.3 года назад
Moving Average Process - Applied Time Series Analysis in Python and TensorFlow
Random Walk Model - Applied Time Series Analysis in Python and TensorFlow
Просмотров 4,1 тыс.3 года назад
Random Walk Model - Applied Time Series Analysis in Python and TensorFlow
Stationarity and Differencing - Applied Time Series Analysis in Python and TensorFlow
Просмотров 1,5 тыс.3 года назад
Stationarity and Differencing - Applied Time Series Analysis in Python and TensorFlow
Autocorrelation and White Noise - Applied Time Series Analysis in Python and TensorFlow
Просмотров 5 тыс.3 года назад
Autocorrelation and White Noise - Applied Time Series Analysis in Python and TensorFlow
Basic Statistics - Applied Time Series Analysis in Python and TensorFlow
Просмотров 1,3 тыс.3 года назад
Basic Statistics - Applied Time Series Analysis in Python and TensorFlow
What Are Time Series - Applied Time Series Analysis in Python and TensorFlow
Просмотров 2,1 тыс.3 года назад
What Are Time Series - Applied Time Series Analysis in Python and TensorFlow
Data Science Portfolio Project: Regression #2 | Data Science with Marco
Просмотров 2,1 тыс.4 года назад
Data Science Portfolio Project: Regression #2 | Data Science with Marco
Data Science Portfolio Project: Regression #1 | Data Science with Marco
Просмотров 4,8 тыс.4 года назад
Data Science Portfolio Project: Regression #1 | Data Science with Marco
Unsupervised Learning | PCA and Clustering | Data Science with Marco
Просмотров 10 тыс.4 года назад
Unsupervised Learning | PCA and Clustering | Data Science with Marco
Suppor Vector Machine (SVM) in Python | Data Science with Marco
Просмотров 8264 года назад
Suppor Vector Machine (SVM) in Python | Data Science with Marco
Decision Trees | Data Science with Marco
Просмотров 6374 года назад
Decision Trees | Data Science with Marco
Resampling and Regularization | Data Science with Marco
Просмотров 1,2 тыс.4 года назад
Resampling and Regularization | Data Science with Marco
Classification in Python | logistic regression, LDA, QDA | Data Science With Marco
Просмотров 9 тыс.4 года назад
Classification in Python | logistic regression, LDA, QDA | Data Science With Marco
Linear Regression in Python | Data Science with Marco
Просмотров 2,5 тыс.4 года назад
Linear Regression in Python | Data Science with Marco

Комментарии

  • @TheSerbes
    @TheSerbes День назад

    I will be making an hourly passenger count forecast using LSTM time series model with 6-7 parameters. Can I choose the parameters as you did here?

  • @TheSerbes
    @TheSerbes День назад

    I want to make LSTM time series, what should I do for this? I think the situation is different for time series. Would I be wrong if I use what you did? There is both trend and seasonality in the series.

  • @pedro_tonom
    @pedro_tonom 9 дней назад

    Amazing video and excelent didatic. Congrats for the great quality, helped me a lot!

  • @purecheese9012
    @purecheese9012 21 день назад

    Seriously this channel is amazing, you deserve so many more subscribers man!

  • @exstream_play9144
    @exstream_play9144 24 дня назад

    Wait, just realized you are such a small RUclipsr. Thought you would have at least 200,000 subscribers with this quality video. Explaining everything in depth and very understandable with very helpful and educational videos!

  • @lecturesfromleeds614
    @lecturesfromleeds614 Месяц назад

    Marco's the man!

  • @mandefrolegesse5748
    @mandefrolegesse5748 Месяц назад

    Very interesting explanation and clear to understand. I was looking for this kind of tutorial. Subscribed👍

  • @ax5344
    @ax5344 Месяц назад

    I like the logic of this video. You showed the baseline, then three additional methods, then compare them in the end. Thanks a lot for sharing the technique. The feature/target matrix is also very helpful. My question is the principle or concept behind the filter method, RFE, and boruta. Is it possible to do a video on them?

  • @nabeel_kaleel
    @nabeel_kaleel Месяц назад

    subscribed

  • @mauroSfigueira
    @mauroSfigueira 2 месяца назад

    Hugely informative and educational content. Many feature engineering videos are not that instructive.

  • @code2compass
    @code2compass 2 месяца назад

    Hello!! quick question, why is the threshold 3.5 any reason please?

  • @babakheydari9689
    @babakheydari9689 2 месяца назад

    It was great! Thanks for sharing your knowledge. Hope to see more of you.

  • @eduardoaraujo9466
    @eduardoaraujo9466 3 месяца назад

    Do you have LinkedIn? Could I follow you? : )

  • @abhinavreddy6451
    @abhinavreddy6451 3 месяца назад

    Please do more Data science-related content, It was very helpful I searched everywhere for feature selection videos and finally landed on this video and this was all I needed, the content is awesome and the explanation is as well!

  • @tanyaalexander1460
    @tanyaalexander1460 3 месяца назад

    I am a noob to data science and feature selection. Yours is the most succinct and clear lesson I have found... Thank you!

  • @nikhildoye9671
    @nikhildoye9671 4 месяца назад

    I thought feature selection is done before model training. Am I wrong?

  • @user-jf6dd2iy2s
    @user-jf6dd2iy2s 4 месяца назад

    Thank you so much, you are my life saver !!

  • @user-qz3nx4xy8c
    @user-qz3nx4xy8c 4 месяца назад

    how about random cut forest ?

  • @joaovict007
    @joaovict007 4 месяца назад

    Very interesting content, thank you!

  • @imadsaddik
    @imadsaddik 4 месяца назад

    Thank you for sharing

  • @michaelmecham6145
    @michaelmecham6145 4 месяца назад

    Sensational video, thank you so much!

  • @littlepigywigy
    @littlepigywigy 4 месяца назад

    nice and clear

  • @tongji1984
    @tongji1984 5 месяцев назад

    Dear Marco Thank you.😀

  • @isabelahorta3063
    @isabelahorta3063 5 месяцев назад

    Hi! Do you recomend any video for pattern-wise anomaly detection?

    • @datasciencewithmarco
      @datasciencewithmarco 5 месяцев назад

      I don't know any, but you can look at the library TOAD for anonaly detection in time series. They do pattern-wise detection if I remember well

  • @user-ou5yb7uk3g
    @user-ou5yb7uk3g 5 месяцев назад

    Hello Marco, thank you so much for such a great video. Can you please make a video on anomaly detection for time series data using pycaret.

  • @eladiomendez8226
    @eladiomendez8226 5 месяцев назад

    Awesome video

  • @mrthwibble
    @mrthwibble 5 месяцев назад

    Excellent video, however I'm preoccupied trying to figure out if having wine as a gas would make dinner parties better or worse. 🤔

  • @scott6571
    @scott6571 6 месяцев назад

    Thank you! It's helpful!

  • @nerwax450
    @nerwax450 6 месяцев назад

    Merci Marco pour le partage !

  • @hoanhvong
    @hoanhvong 6 месяцев назад

    🎉 thank you a lot

  • @user-ms4et7nv9p
    @user-ms4et7nv9p 6 месяцев назад

    Hi Marco!! Thank you so much for making great videos on "Anomaly detection". Great Great work! Please keep sharing! 🙏🙏🙏🙏

  • @alfathterry7215
    @alfathterry7215 7 месяцев назад

    in Variance threshold technique, if we use Standard scaler instead of Minmax scaler, the variance would be the same for all variables.... does it means we can eliminate this step and just use standars scaler?

  • @oluwasegunodunlami7360
    @oluwasegunodunlami7360 7 месяцев назад

    Wow, this video is really helpful, a lot of interesting methods were shown. Thanks a lot. I like to ask you to make a future video covering how you perform feature engineering and model fine tuning 1:49

  • @dorukucar7105
    @dorukucar7105 8 месяцев назад

    pretty helpful!

  • @Loicmartins
    @Loicmartins 8 месяцев назад

    Thank you very much for your work!

  • @maythamsaeed533
    @maythamsaeed533 8 месяцев назад

    very helpful video and easy way to explain the content. thanks alot

  • @gouthamkarakavalasa4267
    @gouthamkarakavalasa4267 8 месяцев назад

    Anomaly detection is unsupervised, how did you get to if a point is anomaly or not, even before training the model ?

    • @datasciencewithmarco
      @datasciencewithmarco 8 месяцев назад

      The dataset is labeled. That way, we can measure the performance of each anomaly detection methods.

    • @devanshtalapa1416
      @devanshtalapa1416 4 месяца назад

      We got a few positive labels in cross validation

  • @parinkittipongdaja4684
    @parinkittipongdaja4684 8 месяцев назад

    Very clear in very short VDO!!!!

  • @claumynbega1670
    @claumynbega1670 8 месяцев назад

    Thanks for this valuable work. Helps me learning the subject.

  • @samuelliaw951
    @samuelliaw951 8 месяцев назад

    Really great content! Learnt a lot. Thanks for your hard work!

  • @chiragsaraogi363
    @chiragsaraogi363 8 месяцев назад

    This is an incredibly helpful video. One thing I noticed is that all features are numerical. How do we approach feature selection with a mix of numerical and categorical features? Also, when we have categorical features, do we first convert them to numerical features or first do feature selection. A video on this would be really helpful. Thank you

    • @haleematajoke4794
      @haleematajoke4794 7 месяцев назад

      You will need to convert the categorical features into numerical format by using label encoding which automatically converts it to numerical values or custom mapping where u can manually assign ur preferred values to the features. I hope it helps

    • @haleematajoke4794
      @haleematajoke4794 7 месяцев назад

      You will have to do the conversion before feature selection because machine learning models only learn from numerical data

  • @jannoona123
    @jannoona123 9 месяцев назад

    Hi Marco! I'm working on a project and this has a lot of components I need. I noticed the specification of the data said that it was being recorded every 5 minutes, could you create a tutorial on how to retrieve a stream of live data and pass it to the algorithm in a somewhat real-time fashion? I hope this is similar to what I understood from your data collection in the video

    • @jiretkatharpi1099
      @jiretkatharpi1099 8 месяцев назад

      Hi I wanted to work on the same thing, did you get anything?

  • @paramvirsaini2806
    @paramvirsaini2806 9 месяцев назад

    Great explanation. Easy hands-on as well!!

  • @harrishvar7677
    @harrishvar7677 9 месяцев назад

    Hey !, Is it possible to identify and flag anomalies within a continuous numerical attribute?

    • @datasciencewithmarco
      @datasciencewithmarco 9 месяцев назад

      If by continuous, you mean at a very high frequency, then yes, I don't see why not!

    • @harrishvar7677
      @harrishvar7677 9 месяцев назад

      Thanks !, If possible, can you make a video on that, it would be really helpful !@@datasciencewithmarco

  • @strawstraw1237
    @strawstraw1237 9 месяцев назад

    Should we use the pc scores as the input for k-means or the original dataset?

  • @pabloarriagadaojeda6452
    @pabloarriagadaojeda6452 10 месяцев назад

    hey Marco!! This is the first time I've watched one of your videos, and after 5 minutes of starting the video, I quickly went through your entire channel, looking at your content. It's AMAZING! Thank you for all your efforts to share your knowledge with the community. A hug from Chile!!

    • @datasciencewithmarco
      @datasciencewithmarco 10 месяцев назад

      Thanks Pablo for the kind words! Really appreciate it!

    • @belkmouf
      @belkmouf 7 месяцев назад

      You rock bro🎩 off to you.

  • @nathaliegrand4018
    @nathaliegrand4018 10 месяцев назад

    Excellent presentation. Very clear explanation. Would be great to have more info on the impact of the context and wich one of the methods is expected to work best in wich context.

  • @ananya_kathak
    @ananya_kathak 11 месяцев назад

    What is the accuracy?

    • @datasciencewithmarco
      @datasciencewithmarco 11 месяцев назад

      Here, accuracy is really not a good idea, because there are so few anomalies. A simple baseline could achieve 99% accuracy, even though there is no "learning". That's why we use the confusion matrix here to see if we can actually identify anomalies.

  • @pooraniayswariya997
    @pooraniayswariya997 Год назад

    Can you teach how to do MRMR feature selection in ML?