How to Build Your First Decision Tree in Python (scikit-learn)

Поделиться
HTML-код
  • Опубликовано: 2 окт 2024
  • Are you intrigued by the power of decision-making in machine learning?
    By the end of this tutorial, you'll have a solid grasp of Decision Trees, be capable of implementing them in Python, and understand their role in various machine learning projects.
    What you'll discover:
    The fundamentals of Decision Trees: How they make decisions and create splits
    Hands-on coding: Building Decision Trees in Python using popular libraries
    Pruning and preventing overfitting: Strategies for optimizing Decision Tree performance
    Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
    📧 Email: ryannolandata@gmail.com
    🌐 Website & Blog: ryannolandata....
    🍿 WATCH NEXT
    Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
    KNN Classification: • How to Build Your Firs...
    Logistic Regression: • Hands-On Machine Learn...
    Support Vector Machine: • Mastering Support Vect...
    MY OTHER SOCIALS:
    👨‍💻 LinkedIn: / ryan-p-nolan
    🐦 Twitter: / ryannolan_
    ⚙️ GitHub: github.com/Rya...
    🖥️ Discord: / discord
    📚 *Data and AI Courses: datacamp.pxf.i...
    📚 *Practice SQL & Python Interview Questions: stratascratch....
    WHO AM I?
    As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
    This RUclips channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
    *This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.
  • НаукаНаука

Комментарии • 29

  • @RyanAndMattDataScience
    @RyanAndMattDataScience  Месяц назад

    Hey guys I hope you enjoyed the video! If you did please subscribe to the channel!
    Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg
    If you want to watch a full course on Machine Learning check out Datacamp: datacamp.pxf.io/XYD7Qg
    Want to solve Python data interview questions: stratascratch.com/?via=ryan
    I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com
    *Both Datacamp and Stratascratch are affiliate links.

  • @Daniellagnaux
    @Daniellagnaux 4 месяца назад +2

    Amazing! You explained the task concisely and clearly! Thank you very much!

  • @LeandruFleidl
    @LeandruFleidl Месяц назад

    Wonderful, this video saved me hours of reading documentation. Thank you very much 👍

  • @fatihahasus1069
    @fatihahasus1069 3 месяца назад +1

    13 is the number of features right? so if I have 60 columns 0:60?

  • @Nothing-fc3xo
    @Nothing-fc3xo 5 месяцев назад +1

    what the heck is this??😢 I’m literally taking my first AI course and my prof demanded such project like this .
    she didn’t even explained or taught us Python first

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  5 месяцев назад +1

      Have a lot of Python vids and working on a beginner series so hopefully it helps

  • @glenkamai2027
    @glenkamai2027 Месяц назад

    good tutorial but you are explaining concepts shallowly men

  • @henry-o8i
    @henry-o8i 6 месяцев назад

    Thanks, Ryan. You are the best. Quick question- does it matter if i use the standard scaler for the data. If so, do i perform it before train test split or after? Also, i think it may be best if you put this in front of the Random Forest on your playlist. Thanks again

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  6 месяцев назад

      I'll move this infront of it now, thanks. I've been working on revamping the playlists and desxcriptions this week. Preprocess your data before you split btw

  • @SamuelOgazi
    @SamuelOgazi 7 месяцев назад +2

    This was so helpful and straight to the point.
    Tbh, I got the logic from other channels but the implementation here was a breeze.
    I am dragging my friends here.
    God bless!

  • @SC-jd4gw
    @SC-jd4gw 4 месяца назад

    Thanks so much for your video but i have a question, I follow everything you did but when i do the print(classification_report(y_test, y_pred)) i have 7 rows, not only two.
    Why did this happen?

  • @telmagiovana6006
    @telmagiovana6006 4 месяца назад

    Thanks for the video! I have a question, in this part that we're talking about the importance of each feature(11:47), is it calculated by the gini?

  • @montanaapproves1044
    @montanaapproves1044 6 месяцев назад

    Hey man, I'm quite new to machine learning and I would like to know what IDE are you using in this video?

  • @andrewhuang1452
    @andrewhuang1452 3 месяца назад

    Where is the link of the csv

  • @aryan8020
    @aryan8020 3 месяца назад

    Great video and explanation, didn't believe that you only have 8k subs...

  • @michaelangelomerza3966
    @michaelangelomerza3966 10 месяцев назад

    Hi. I'm still learning python and may I ask. How will you add another data on that? For example I want to predict a new player if he will be among the HOF. My input will be only one. Shall I import a new CSV file containing that data then put it on X_test, and y_test? Thank you.

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  10 месяцев назад

      Once you have a model built you can predict on it with inputs.

    • @michaelangelomerza3966
      @michaelangelomerza3966 10 месяцев назад

      Is it possible to output multiple results for one input? Im currently trying to build a College Course Recommender and the Inputs are based on the student grades, strand, hobbies/likes then output multiple possible courses that fits the inputs?
      @@RyanAndMattDataScience

  • @oleblack
    @oleblack 6 месяцев назад

    Great video. Thanks man , this video helped me with my Final assessment

  • @abdullah.montasheri
    @abdullah.montasheri 11 месяцев назад

    can you share the notebook file?

    • @RyanAndMattDataScience
      @RyanAndMattDataScience  11 месяцев назад +1

      I plan on doing a bulk upload to GitHub in the future for all the videos