Train Test Split with Python Machine Learning (Scikit-Learn)

Поделиться
HTML-код
  • Опубликовано: 19 окт 2024
  • In this Python Machine Learning Tutorial, we take a look at how you can split a data set through train test split in scikit learn.
    This is a great method for prepping your data before you run a model.
    Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
    📧 Email: ryannolandata@gmail.com
    🌐 Website & Blog: ryannolandata....
    🍿 WATCH NEXT
    Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
    Feature Scaling: • Python Feature Scaling...
    Random Forest Classifier: • Random Forest Algorith...
    Ordinal Encoder: • Ordinal Encoder with P...
    MY OTHER SOCIALS:
    👨‍💻 LinkedIn: / ryan-p-nolan
    🐦 Twitter: / ryannolan_
    ⚙️ GitHub: github.com/Rya...
    🖥️ Discord: / discord
    📚 *Data and AI Courses: datacamp.pxf.i...
    📚 *Practice SQL & Python Interview Questions: stratascratch....
    WHO AM I?
    As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
    This RUclips channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
    *This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.

Комментарии • 29

  • @RyanAndMattDataScience
    @RyanAndMattDataScience  2 месяца назад

    Hey guys I hope you enjoyed the video! If you did please subscribe to the channel!
    Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg
    If you want to watch a full course on Machine Learning check out Datacamp: datacamp.pxf.io/XYD7Qg
    Want to solve Python data interview questions: stratascratch.com/?via=ryan
    I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com
    *Both Datacamp and Stratascratch are affiliate links.

  • @tengsolomon
    @tengsolomon Год назад +2

    Thanks so much for your video. So simple and easy to follow.

  • @rifqimaruf
    @rifqimaruf 6 месяцев назад

    im starting learning machine learning cause my duty on college, this video explain with ease, thank you Ryan, keep it up.

  • @gajendrakc813
    @gajendrakc813 7 дней назад

    Thank you Ryan. Learning so much from you.

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  7 дней назад +1

      No problem join our discord also! We will be hosting trainings and office hours in the future

  • @onurdatascience
    @onurdatascience Год назад +1

    Important topic, great content!

  • @darks_
    @darks_ 4 месяца назад

    Thanks a lot! Small question, what should I do if I want to have a stratified splitting with the same database?

  • @kimaudreymagan484
    @kimaudreymagan484 17 дней назад

    Thank you!

  • @N1246-c2f
    @N1246-c2f 8 месяцев назад

    Thanks this makes so much sense! I'm running a multiple regression on some stock data but my r2 value is coming out pretty low.. do you know how i can improve the model? or do u have a vid on it?

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  8 месяцев назад

      No problem and ye check out my Kaggle projects. I go over different techniques. Try different models and hyper parameters. Optuna also may help

  • @henry-o8i
    @henry-o8i 7 месяцев назад

    Thanks for the great content. I wonder if you discuss data leakage in your later videos/project. I was confused on when I should do train_test_split in a project. like should i do the pre-processing data first or train_test_split first

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  7 месяцев назад +1

      Hey may cover this way later this year. Focusing on Ai vids next few months

  • @frankdearr2772
    @frankdearr2772 7 месяцев назад

    great topic thanks 👍

  • @henry-o8i
    @henry-o8i 7 месяцев назад

    Ryan - Great content- thanks but wonder if you can provide a road map for the playlists. I think that will be really helpful.

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  7 месяцев назад

      the playlist is in order + has a few projects along the way. I do plan on adding to it later this year

    • @henry-o8i
      @henry-o8i 7 месяцев назад

      Thank You. Thanks for the great content.- I been giving up on studying data science after attending bootcamp 2 years ago. I found your videos been really helpful for me to refresh/studying data science again.@@RyanAndMattDataScience

  • @jencinas8586
    @jencinas8586 5 месяцев назад

    Hello , do you recommend learning sql first ,before starting with ML ?

  • @epicmemesandanime329
    @epicmemesandanime329 Месяц назад

    at 3:01
    why y=df["HOF"]?

    • @gajendrakc813
      @gajendrakc813 5 дней назад

      That is assigning HOF column as y. X ( Rest of the columns ) is input and y is output. We are using X ( rest of the columns ) to determine y ( output). Hope that made sense.

  • @subhanjalpant8824
    @subhanjalpant8824 28 дней назад

    Where is the data????