Train Test Split with Python Machine Learning (Scikit-Learn)
HTML-код
- Опубликовано: 19 окт 2024
- In this Python Machine Learning Tutorial, we take a look at how you can split a data set through train test split in scikit learn.
This is a great method for prepping your data before you run a model.
Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
📧 Email: ryannolandata@gmail.com
🌐 Website & Blog: ryannolandata....
🍿 WATCH NEXT
Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
Feature Scaling: • Python Feature Scaling...
Random Forest Classifier: • Random Forest Algorith...
Ordinal Encoder: • Ordinal Encoder with P...
MY OTHER SOCIALS:
👨💻 LinkedIn: / ryan-p-nolan
🐦 Twitter: / ryannolan_
⚙️ GitHub: github.com/Rya...
🖥️ Discord: / discord
📚 *Data and AI Courses: datacamp.pxf.i...
📚 *Practice SQL & Python Interview Questions: stratascratch....
WHO AM I?
As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
This RUclips channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
*This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.
Hey guys I hope you enjoyed the video! If you did please subscribe to the channel!
Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg
If you want to watch a full course on Machine Learning check out Datacamp: datacamp.pxf.io/XYD7Qg
Want to solve Python data interview questions: stratascratch.com/?via=ryan
I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com
*Both Datacamp and Stratascratch are affiliate links.
Thanks so much for your video. So simple and easy to follow.
No problem. I try to keep all my vids simple and straight to the point
im starting learning machine learning cause my duty on college, this video explain with ease, thank you Ryan, keep it up.
Np check out my other ML vids
Thank you Ryan. Learning so much from you.
No problem join our discord also! We will be hosting trainings and office hours in the future
Important topic, great content!
Thanks!
Thanks a lot! Small question, what should I do if I want to have a stratified splitting with the same database?
Thank you!
No problem
Thanks this makes so much sense! I'm running a multiple regression on some stock data but my r2 value is coming out pretty low.. do you know how i can improve the model? or do u have a vid on it?
No problem and ye check out my Kaggle projects. I go over different techniques. Try different models and hyper parameters. Optuna also may help
Thanks for the great content. I wonder if you discuss data leakage in your later videos/project. I was confused on when I should do train_test_split in a project. like should i do the pre-processing data first or train_test_split first
Hey may cover this way later this year. Focusing on Ai vids next few months
great topic thanks 👍
Thank you
Ryan - Great content- thanks but wonder if you can provide a road map for the playlists. I think that will be really helpful.
the playlist is in order + has a few projects along the way. I do plan on adding to it later this year
Thank You. Thanks for the great content.- I been giving up on studying data science after attending bootcamp 2 years ago. I found your videos been really helpful for me to refresh/studying data science again.@@RyanAndMattDataScience
Hello , do you recommend learning sql first ,before starting with ML ?
Sql and Python which I have vids on my channel
at 3:01
why y=df["HOF"]?
That is assigning HOF column as y. X ( Rest of the columns ) is input and y is output. We are using X ( rest of the columns ) to determine y ( output). Hope that made sense.
Where is the data????
Github
on his github
@@casonpark link?