Comparing machine learning models in scikit-learn

Поделиться
HTML-код
  • Опубликовано: 19 окт 2024

Комментарии • 596

  • @dataschool
    @dataschool  3 года назад +10

    Having problems with the code? I just finished updating the notebooks to use *scikit-learn 0.23* and *Python 3.9* 🎉! You can download the updated notebooks here: github.com/justmarkham/scikit-learn-videos

    • @emmittkyrie2965
      @emmittkyrie2965 3 года назад

      Sorry to be off topic but does anyone know of a method to get back into an Instagram account..?
      I was stupid forgot the password. I would love any tricks you can give me!

    • @lyleelias8595
      @lyleelias8595 3 года назад

      @Emmitt Kyrie Instablaster :)

    • @emmittkyrie2965
      @emmittkyrie2965 3 года назад

      @Lyle Elias Thanks for your reply. I got to the site thru google and im trying it out atm.
      Seems to take quite some time so I will get back to you later with my results.

  • @rajatpai5048
    @rajatpai5048 5 лет назад +3

    Really simple to understand. Doesn't make it seem like "its a library thing, library does it for ya". Thank you for doing this

    • @dataschool
      @dataschool  5 лет назад

      You're very welcome! Thanks for your kind words!

  • @siddharthkotwal8823
    @siddharthkotwal8823 8 лет назад +63

    That's some killer delivery, you didn't waste a word! Great tutorial!

  • @suhailchougle7315
    @suhailchougle7315 4 года назад +5

    This is by far the best Sci-kit Learn tutorial on RUclips. I can say this because I have seen almost every tutorial and this covers everything starting from scratch.I knew how all the algorithms work but what I needed was how do I implement those algorithms from loading the data set to all terminologies to checking the accuracy and what not and this series has everything I was looking for ,thank you so much for this.Really appreciate it.

    • @dataschool
      @dataschool  3 года назад +1

      Wow! Thank you so much for your kind words! :)

  • @WanderingJoy
    @WanderingJoy 5 лет назад +7

    "models that overfit have learned the noise in the data rather than the signal" - yes, well said!

    • @dataschool
      @dataschool  5 лет назад

      Glad it was helpful to you!

  • @pabloalonso5440
    @pabloalonso5440 8 лет назад +38

    Dear Kevin. To me your videos are a reference, as those of Mr Andrew Ng. Very good job! Thank you very much from Spain :)

  • @LordBadenRulez
    @LordBadenRulez 8 лет назад +11

    I like the pace of these videos. You speak really slow and clear which helps your viewer to digest the information on the fly. Loving your work!

    • @dataschool
      @dataschool  8 лет назад +4

      Thanks for the feedback! I'm really glad to hear that my presentation of the material works well for you. Good luck with your education!

    • @CausticCatastrophe
      @CausticCatastrophe 7 лет назад

      Yeah, the slow pace is generally great, though personally I view these at 1.25 speed. Still clear at that rate too. :)

  • @uppubhai
    @uppubhai 8 лет назад +51

    This is such a gem for beginners .Thank you very much Kevin

  • @priyanshugupta7591
    @priyanshugupta7591 4 года назад +2

    Your way of delivery is exceptional. I have never seen somebody teaching so well like you. I made me interested in ML Thanks bro...God bless U

  • @miguelgutierrez2902
    @miguelgutierrez2902 4 года назад +3

    Uno de los mejores manuales sobre "Machine learning" que he visto. Gracias por ofrecernos la oportunidad de aprender. Además, tu pronunciación es perfecta para hispanohablantes

  • @dataschool
    @dataschool  6 лет назад +19

    *Note:* This video was recorded using Python 2.7 and scikit-learn 0.16. Recently, I updated the code to use Python 3.6 and scikit-learn 0.19.1. You can download the updated code here: github.com/justmarkham/scikit-learn-videos

    • @dataschool
      @dataschool  6 лет назад +3

      You're very welcome!

    • @aquaman788
      @aquaman788 4 года назад

      @@dataschool can we have a lecture about Tensorflow?

  • @kushsheth4801
    @kushsheth4801 3 года назад +1

    loving this series man just started out with ML and DS understanding everything

    • @dataschool
      @dataschool  3 года назад

      That's excellent to hear!

  • @KhalilMuhammad
    @KhalilMuhammad 9 лет назад +17

    This is a brilliant tutorial -- I love everything about it. Thanks.

    • @dataschool
      @dataschool  9 лет назад +1

      +Khalil Muhammad Wow, thank you! I really appreciate your kind words!

  • @FULLCOUNSEL
    @FULLCOUNSEL 7 лет назад

    I thank God I landed on your videos. I see things clearer than ever. You are a gifted tutor. God bless you sir.

    • @dataschool
      @dataschool  7 лет назад

      Wow, thanks so much for your incredibly kind comments!

  • @beansgoya
    @beansgoya 6 лет назад

    The last two videos are the best ones I’ve seen someone explain scikit learn’s predictions. Every other video jumps straight to the full analysis but in reality, you can predict in as little as 4 lines of code. Great job!

  • @frankacito8076
    @frankacito8076 6 лет назад

    Your teaching style is outstanding. As someone who has used R in the past, I really appreciate the clarity of your explanations and demonstrations.

  • @daokou1851
    @daokou1851 8 лет назад

    A student from CN jumping across the Great Wall learned this excellent class. Thx.

    • @dataschool
      @dataschool  8 лет назад

      Awesome! You're very welcome!

  • @susmit03
    @susmit03 7 лет назад +103

    for i in range(1, 10001) :
    print(“THANK YOU VERY MUCH")

    • @dataschool
      @dataschool  7 лет назад +5

      HA! Love it! You're very welcome :)

    • @susmit03
      @susmit03 7 лет назад +3

      Data School Thank you.
      Your reply shows your passion for programming.
      Keep up the good work of teaching.

    • @dataschool
      @dataschool  7 лет назад +2

      Thanks! :)

    • @dioszegizoltan4493
      @dioszegizoltan4493 7 лет назад +4

      No ,this is the correct one:
      while 1 == 1:
      print("THANK YOU VERY MUCH")

    • @shawndonaldson5674
      @shawndonaldson5674 7 лет назад +7

      while True: print("THANK YOU VERY MUCH")

  • @pranavjoshi7021
    @pranavjoshi7021 7 лет назад

    This video series sets such a high standards for Content, Context and Delivery of Machine Learning training ! Its a winner for all those who are starting to learn Machine Learning !! Thank you so much for your efforts Kevin !!!

    • @dataschool
      @dataschool  7 лет назад

      Wow, thank you so much for your very kind comment! I really appreciate your support!

  • @NirajKumar-hq2rj
    @NirajKumar-hq2rj 8 лет назад

    Excellent teaching !!! I am required to set up competency around advance analytic involving ML/DS (since I am coming from DWH and BI practice) in my organization, so I wanted to learn and practice. Now , I feel like taking this as a full time profession and become Data Scientist. It's so much fun and exciting work, such video has made it lot easier. Thank you !!!

    • @dataschool
      @dataschool  8 лет назад

      Awesome! So glad to hear! Thanks for your kind words, and good luck on your educational journey :)

  • @DionMJulien
    @DionMJulien 7 лет назад

    I couldn't agree more with berry jordaan.
    The way you deliver the content of a quite complex topic naturally guides me to want to learn more about machine learning.
    Thank you very much

    • @dataschool
      @dataschool  7 лет назад

      Thank you so much for your comment - you're very welcome!

  • @galustbayburcyan1083
    @galustbayburcyan1083 7 лет назад +1

    I was looking for ML tutorials and can say that your videos are simply the best.Thanks a lot

    • @dataschool
      @dataschool  7 лет назад

      Wow, thank you so much! What a nice comment!

  • @juancastillo2249
    @juancastillo2249 8 лет назад

    Wow I must say your teaching style is amazing. Very organized, thorough and easy to follow. Thanks for your time, and keep making great videos! I wish more professors were like you at my school.

    • @dataschool
      @dataschool  8 лет назад

      +Juan P Castillo What a nice comment! Thank you so much for your generous words! I'm glad the series has been helpful to you :)

  • @frankgiardina205
    @frankgiardina205 4 года назад +1

    Kevin this series is excellent, you are able to really simplify the topic to make it easy to learn Thanks

  • @fubar0sid
    @fubar0sid 8 лет назад +3

    Most notable take aways from the video:
    - "Plotting testing accuracy vs model complexity is a very useful way to tune any parameters that relate to model complexity."
    - "Once you have chosen a model and it's optimal parameters and are ready to make predictions on out of sample data, it's important to re train your model on all of the available training data."
    - Repeating the train/test split process multiple times in a systematic way using k fold cross_validation

    • @dataschool
      @dataschool  8 лет назад

      Great summary! I approve :)

  • @kamruzzamantanim2055
    @kamruzzamantanim2055 6 лет назад

    My confident level is super high to learn Machine Learning after seeing this video. Your every word is very clear and correct. Thank you very much.

  • @SomeIndoGuy
    @SomeIndoGuy 8 лет назад +3

    Great series, honestly it's the most easily understandable lecture about one of the complicated topic in computer science.
    Love the flow of the video, the tempo of the complexity, really easy to follow. I have several comments to improve in my opinion:
    1. When you point out on specific parts of the screen, it would be great to not just use the cursor but also a more visually impactful feedback (there are tools for this)
    2. Would love to get a repeated definition of the specific terms (such as model complexity, what does that mean? The higher the value of n_neighbors the more complex it is? what does it mean to be complex?)
    3. I understand that this is an introduction class, but it would be really helpful to show the industry's best practices (advanced series?)
    Great work, I subscribed, and liking all of your videos.

    • @dataschool
      @dataschool  8 лет назад

      +SomeIndoGuy Thanks for your very kind comments, as well as your feedback!
      Regarding model complexity, this is an excellent essay on the bias-variance tradeoff (a critical machine learning topic) that touches on model complexity: scott.fortmann-roe.com/docs/BiasVariance.html

  • @BrianMoyer-kq2gl
    @BrianMoyer-kq2gl Год назад

    This was one of the best videos on the topic that I've found. Thank you for being so succinct and breaking this down so clearly!

  • @RajeshSriMuthu
    @RajeshSriMuthu 5 лет назад +1

    OMG finally found a ML tutor who is awesome.... i cant skip any seconds in your videos, even each words are informative

    • @dataschool
      @dataschool  5 лет назад

      Thanks so much for your kind words! I truly appreciate it!

  • @cbdave79
    @cbdave79 8 лет назад +1

    Thank you for these videos! they are well made and clear. I don't think i understood ML until sitting through your videos.

    • @dataschool
      @dataschool  8 лет назад

      Thanks for your kind comment! That's so nice to hear.

  • @vikramsamal85
    @vikramsamal85 7 лет назад

    I have been reading from a lot of source but till date this series is the best! I wish there much more videos and reference which will take us to the advanced level!

    • @dataschool
      @dataschool  7 лет назад

      Thanks so much for your kind comment!

  • @personsname0
    @personsname0 6 лет назад

    Best video series I've come across on sklearn! I tried a few other channels before this and was left feeling like I still had no idea what was going on, but after only 5 of your videos I already feel way more confident that I can actually get into it, cheers!

    • @dataschool
      @dataschool  6 лет назад

      Awesome! Thanks for your kind comments, and good for you! :)

  • @MridulBanikcse
    @MridulBanikcse 7 лет назад

    Thanks Sir , for giving effort to make these videos . Being a beginner I find these resources extremely helpful .

  • @muhammadbilalahmad4888
    @muhammadbilalahmad4888 7 лет назад

    Thank you Kevin for sharing well organized, normal speed video lectures on scikit learn. These videos are very helpful to teach ML in python to graduate students. The links in the resources are also very valuable. You deserve appreciations. I would suggest to upload lectures ML with R.

    • @dataschool
      @dataschool  7 лет назад

      You're very welcome! I'm glad to hear the videos have been helpful to you! I'm focused on Python these days, so I don't anticipate making any videos on R - sorry!

  • @hasslefreelabs
    @hasslefreelabs 6 лет назад

    I'm watching this tutorial from last few days..Very very precise and accurate content.. it made me to rewind watch many times..! great..!

    • @dataschool
      @dataschool  6 лет назад

      Awesome! Glad it's helpful to you!

  • @louisebuijs3221
    @louisebuijs3221 3 года назад +1

    Thanks so much for all these videos! Im doing an internship at a really nice group but they're letting me figure out most of the stuff by myself so this is super useful!

  • @TheGautamj
    @TheGautamj 3 года назад

    Man, he just makes it so easy to learn.
    Wish we had half as good teachers as him in school.

    • @dataschool
      @dataschool  3 года назад +1

      Thank you so much Gautam!

  • @robindong3802
    @robindong3802 6 лет назад +1

    I have to say this is another great lesson by Kevin. Thank you very much indeed.

    • @dataschool
      @dataschool  6 лет назад

      Thanks! I'm glad my videos are helpful to you!

  • @anshusharma-hn3ef
    @anshusharma-hn3ef 6 лет назад

    Thank you so much for putting up this series. I was looking for something basic yet comprehensive and something easy to follow. This is being very helpful to me . Thanks.

    • @dataschool
      @dataschool  6 лет назад

      That's great to hear! You are very welcome.

  • @PankajMishra-ey3yh
    @PankajMishra-ey3yh 8 лет назад +1

    I started after you put a video on how to make submission on kaggle on my request,I did well in last contest and finished 144 in leader board :) All credit goes to you

    • @dataschool
      @dataschool  8 лет назад +1

      Amazing!!! That's great to hear! :)
      For others who might be interested, this is my video about creating Kaggle submissions: ruclips.net/video/ylRlGCtAtiE/видео.html

  • @sharonwaithira
    @sharonwaithira 6 лет назад +1

    I love the series so far. I have learned so much. Thank you for creating these. They are quite easy to follow.

    • @dataschool
      @dataschool  6 лет назад

      Awesome, that's great to hear!

  • @jamesrajendranVideo
    @jamesrajendranVideo 7 лет назад

    Awesome.....Highly effective communication......So for the best of Machine Learning videos......very grateful to the author. The flow and methodology makes Machine Learning look so simple which in fact is quite complex for beginners like me.

    • @dataschool
      @dataschool  7 лет назад

      Thanks so much for your kind comment! I'm glad to hear the machine learning videos have been helpful to you. I know it's complex but you will get it eventually... good luck with your education!

  • @pierrelaurent8284
    @pierrelaurent8284 7 лет назад +2

    great video ! great resources to understand the Bias-Variance trade-off. You are a reference Kevin. Thanks a ton.

  • @tmeloliveira
    @tmeloliveira 7 лет назад +3

    Man, your videos are great! :)
    I'm taking the Machine Learning Nanodegree by Udacity and your videos are an awesome supportive material. Thanks for sharing with us!

    • @dataschool
      @dataschool  7 лет назад

      Thanks for your kind words! You are very welcome :)

  • @jordandixon5307
    @jordandixon5307 6 лет назад +1

    I really disliked machine learning after we got taught it at uni. you really have sparked my interest again thank you so much for this series.

    • @dataschool
      @dataschool  6 лет назад

      That's great to hear! You are very welcome.

  • @vishwass9491
    @vishwass9491 8 лет назад

    I believe the K value you have set for teaching is perfect for my learning. thanks

    • @dataschool
      @dataschool  8 лет назад

      +vishwas s HA! Love a good machine learning joke :)

  • @TheGrebull
    @TheGrebull 5 лет назад

    It used to be hard for me to learn Machine Learning, but now thanks to you it isn't anymore

    • @dataschool
      @dataschool  5 лет назад +1

      Thanks so much! That is awesome to hear 😄

  • @chuanliangjiang7390
    @chuanliangjiang7390 6 лет назад

    This lecture is fantastic and extremely helpful to learn machine learning from scratch, very appreciate to share this wonderful vedio

    • @dataschool
      @dataschool  6 лет назад

      Thanks for your kind comment!

  • @beansgoya
    @beansgoya 6 лет назад +1

    “Overfitting learns the noise of the data, rather than the signals”
    I finally understand what overfitting means.

  • @johnfreitas250
    @johnfreitas250 5 лет назад

    Can't thank you enough for your great tutorials, Kevin. Make everything so clear and understandable. You're awesome man.

  • @amitblizer4567
    @amitblizer4567 Год назад +1

    Thank you very much for this elaborated explanation. Very helpful

  • @gustavomello278
    @gustavomello278 6 лет назад

    Great videos kevin. I like your deliberately slow style. It is hard to improve, but if I may suggest something. As your videos are long, it would be useful if you have an index in the description with links to the times of the subtopics. That would help a lot on review and certainly would increase the number of re-visits.

    • @dataschool
      @dataschool  6 лет назад +1

      Thanks for the suggestion! I know the videos are super long, but ever since making this series, I have tried to make shorter videos.
      And, thanks for the time-coding suggestion! I'll consider it.

  • @zennicliffzennicliff
    @zennicliffzennicliff 5 лет назад

    Thank you very much again! I look forward to learning mode on the various libraries and models for machine -learning, with great examples as usual. Greetings!

    • @dataschool
      @dataschool  5 лет назад

      Thanks for your suggestions! This course might interest you: www.dataschool.io/learn/

  • @libardomm.trasimaco
    @libardomm.trasimaco 7 лет назад

    Wow. This video in particular is one of the most useful videos that I have found in the entire RUclips. Thanks you very much, your a great person and a great teacher!

    • @dataschool
      @dataschool  6 лет назад +1

      Wow, thank you so much! :)

  • @a.s.5961
    @a.s.5961 3 года назад +2

    I love you man, i have watched every single video of yours.

  • @karlavillarrealleal1254
    @karlavillarrealleal1254 Год назад +1

    Thank you for such clear and well done tutorials!

  • @aimene_tayebbey
    @aimene_tayebbey 7 лет назад

    if it means anything to you, i really like the way you put things and simplify them, thanx man

    • @dataschool
      @dataschool  7 лет назад

      Thanks for your kind comment!

  • @sbk1398
    @sbk1398 6 лет назад +1

    my goodness. what intriguing and useful videos. you have a true gift

  • @nikhilsingh5233
    @nikhilsingh5233 7 лет назад

    this is the best machine learning tutorial I have ever gone through.
    thank u so much.👍

  • @emov8326
    @emov8326 7 лет назад +1

    You brought back my motivation to study. I am so Grateful :) Thanks a lot!!!

  • @leowan5033
    @leowan5033 7 лет назад

    A great series for sklearn beginners, and your rate of speech really taken care of people like me, for a moment there I thougth I had a significant progress in hearing, HAHA. Thank you very much from China.

    • @dataschool
      @dataschool  7 лет назад

      You are very welcome! It's great to hear that my delivery works for you! Thanks for your kind comment.

  • @oi4252
    @oi4252 4 года назад +1

    Quarantine with Data School is lit!!

  • @TheReferrer72
    @TheReferrer72 7 лет назад

    Nicely paced set of tutorials. Thanks

  • @balipavankalyan5008
    @balipavankalyan5008 5 лет назад

    you're the right model for us , because u trained such a way

  • @AnuragVarmaP
    @AnuragVarmaP 4 года назад

    very good work man. u even reply to all the messages from the path 3 years. its very helpful.
    make more of these videos

  • @UnknownUserx-xe1tm
    @UnknownUserx-xe1tm 8 лет назад

    You are doing a really good work with concise explantions, thanks a lot !!

    • @dataschool
      @dataschool  8 лет назад +1

      +arab ilies You're very welcome!

  • @medgarfsantos
    @medgarfsantos 8 лет назад

    awesome video. i would also love to see a video regarding SVM kernels, major differences among them, when to choose them, and how the different parameters may affect the classification and the metrics.

    • @dataschool
      @dataschool  8 лет назад

      Glad you liked it, and thanks for the suggestion!

  • @BillTubbs
    @BillTubbs 8 лет назад

    Great video lecture series. Love the slow, clear delivery. I noticed a few deprecation warnings when running the code myself. Is there a forum for reporting technical issues/questions?

    • @dataschool
      @dataschool  8 лет назад

      Glad you like the series!
      Regarding technical issues, you are welcome to log them as issues in my GitHub repo: github.com/justmarkham/scikit-learn-videos/issues. I will eventually update the notebooks to reflect Python 3, and I know the API is changing slightly in the upcoming scikit-learn 0.18 release, so I will address that as well. But I'd love to know the specifics of any errors or warnings that you receive!
      Regarding questions, you can ask them on GitHub, or post RUclips comments, and I'll see them either way. Thanks!

  • @lucassantana7511
    @lucassantana7511 6 лет назад

    Dude, thank you very much for this set of videos. I'm from Brazil and I'm really enjoying youe course.

  • @turnoffthesystem4789
    @turnoffthesystem4789 4 года назад +1

    matee, awesome videos. You saved my ass for an ML deadline. Awesome, really

  • @Chromenia
    @Chromenia 4 года назад +1

    It is just awesome to understand the concept from you. Thanks a ton!

  • @lotushouse85
    @lotushouse85 6 лет назад

    Best tutorial I have ever seen. period.

  • @StanleySI
    @StanleySI 3 года назад

    Stunningly clear logic and structure. Thank you!

    • @dataschool
      @dataschool  3 года назад

      You're very welcome! 🙌

  • @albertovalesalonso
    @albertovalesalonso 7 лет назад

    Great explanation of noise and signal!

  • @uttamraj7653
    @uttamraj7653 6 лет назад

    Really your lectures are AWESOME :-)...........................The way you are explaining is really SUPER ( at every point you are giving reason why you are doing this things makes your lectures UNIQUE from all others resources).
    One more problem which currently i am facing is related to problem solving related to machine learning, so please is it possible to make 4-5 videos in which you explain five different types of problems with five different fields ( like business ,medical,education,banking etc).

    • @dataschool
      @dataschool  6 лет назад

      Thanks for your suggestion!

  • @pradyumna27
    @pradyumna27 8 лет назад +1

    Thanks for the great video series. Please keep uploading .

    • @dataschool
      @dataschool  8 лет назад

      You're very welcome! I have a video series on pandas (for data exploration, cleaning, etc) that might interest you: ruclips.net/p/PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y

    • @pradyumna27
      @pradyumna27 8 лет назад

      Certainly it would. Thanks allot.

  • @drumsking10
    @drumsking10 9 лет назад +1

    Hey Kevin, you are creating an excellent resource for those interested in getting started! I have two questions, first I was wondering if you knew of any dataset repositories that we could practice these techniques independently. I like to learn by doing, and having a repository of datasets would be useful. Secondly, do you have a Patreon account or some way I can give some donations to you for these videos? Your work done on these and your other videos is well deserving of at least some sort of contribution to your pocket!

    • @dataschool
      @dataschool  9 лет назад

      drumsking10 Wow, thanks for the kind comments!
      1. The UCI Machine Learning Repository is an excellent collection of datasets. You can filter on task, attribute type, etc., and many of the datasets are well-documented: archive.ics.uci.edu/ml/datasets.html
      2. That's very generous of you! It is a lot of work (10 to 20 hours per video for this series), but it has also been a lot of fun! I don't currently use Patreon, though if you visit my main channel page, there is a box on the right side that says "Support this channel": ruclips.net/user/dataschool

    • @dataschool
      @dataschool  6 лет назад

      I finally launched a Patreon campaign, and I'd love to have your support! www.patreon.com/dataschool/overview

  • @alifazal9062
    @alifazal9062 4 года назад

    you're doing a great job, I would just emphasize on giving more examples that are relatable and speaking like you're talking to another person in the room. I only give feedbacks because thats what I would've wanted from people tuning in.

    • @dataschool
      @dataschool  3 года назад

      Thanks for your suggestions!

  • @sunudaniel2521
    @sunudaniel2521 6 лет назад

    Thank you very much for your valuable time explaining machine learning. I will always think how they predict the future value. Now I know.

  • @VyshfulThinking
    @VyshfulThinking 6 лет назад +1

    i love everything about the series. Thank you very much

  • @preetammishra6468
    @preetammishra6468 5 лет назад

    Your videos are so cool and easy to understand, thanks for uploading,please keep on doing it!

  • @suprotikdey1910
    @suprotikdey1910 7 лет назад

    You did the job quite quick for me to jump start ML. All courses I saw had durations ranging from 3 months to a year.. thank you!!
    do you have any tutorial for neural nets and deep learning??

    • @dataschool
      @dataschool  7 лет назад

      Glad the videos were helpful to you! I don't currently have any tutorials on neural networks or deep learning, but please subscribe to my newsletter for updates on future tutorials: www.dataschool.io/subscribe/

  • @manjuappu89
    @manjuappu89 8 лет назад

    Your Explaination is simply superb. Love to watch some more videos

    • @dataschool
      @dataschool  8 лет назад

      Thanks so much for your kind comment! The entire video playlist for this series is here: ruclips.net/p/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A

  • @albertteplitsky7629
    @albertteplitsky7629 6 лет назад

    Hey Kevin,
    I was playing around with an SVC model where the testing set was set to 0.25 and random_state =4 and I got a response of 0.99333.... Would I be able to say that SVC is the best way to model this data? I guess my question is at what threshold does the testing accuracy confirm the best model?
    You are an amazing teacher. Thank you for the videos and I cannot wait for the unsupervised learning vids.

    • @dataschool
      @dataschool  6 лет назад

      You will never be provided with a guarantee that you have chosen the best model. There is no threshold that will confirm this.
      The only exception would perhaps be if you can predict something with 100% accuracy all the time. Then, you know you have found the best model.
      Hope that helps!

  • @HP-vy6is
    @HP-vy6is 7 лет назад

    Thank you so much for educating us......The resources are really helpful....Structured lectures are interesting....Please continue videos with
    SciPy
    NumPy
    SciKit

    • @dataschool
      @dataschool  7 лет назад

      Thanks for your suggestions!

  • @jiwon5315
    @jiwon5315 5 лет назад

    I ADORE YOU 💕💕💕 you teach difficult concepts really well. Thank you and i hope you keep posting.

  • @sungdeukpark9299
    @sungdeukpark9299 9 лет назад

    very excited with your tutorial and it's helpful for novice like me.
    I got 2 questions, 'can I know your overall agenda for this tutorial?' and 'are you gonna make a lecture every week?'

    • @dataschool
      @dataschool  9 лет назад

      SungDeuk Park Great to hear! I will generally post a new video every other week. A list of the topics I will be covering during the series is here: blog.kaggle.com/2015/04/08/new-video-series-introduction-to-machine-learning-with-scikit-learn/

  • @odeshsingh2330
    @odeshsingh2330 4 года назад +1

    Very well explained and great teaching style!! I am doing my first pass through your videos. I will go back and enter the python code and run these on my next pass. I was hoping I could find a set of graded exercises at the end of each video. Any thoughts on this ?

    • @dataschool
      @dataschool  4 года назад

      Glad you like the videos! The only course for which I offer exercises is my paid online course, Machine Learning with Text: www.dataschool.io/learn/

  • @galanixstudio673
    @galanixstudio673 6 лет назад

    Hi, Kevin. Let me do some remark. On 3:00 you`ve mentioned that train dataset must take ALL the samples (so including the "test" as well)? The thing if we do so test samples willn`t demostrate any error at all. So that must be said that BEFORE trainig model the whole data should be split into two groups (for training model and for test). So what`s your opinion about that? Yep. And hopefully on 11:00 min that you talked about. Iwas worried that I can be lost. Still that`t totally great as you covered that information. Thanks.

    • @dataschool
      @dataschool  6 лет назад

      In the video, I outline why you should not use evaluation procedure #1. I included it for explanatory purposes.
      Hope that helps!

  • @xiangxinzhang1770
    @xiangxinzhang1770 Год назад +1

    When it comes to using the model for future predictions on real-life data, you can directly use the trained model without retraining it with the whole training data, including the test data. The idea is that the model has learned patterns and relationships from the training data that generalize well to unseen data, including real-life data.
    Retraining the model with the entire dataset, including the test data, is generally not recommended as it may lead to overfitting. Overfitting occurs when the model becomes too specific to the training data, capturing noise and irrelevant patterns, which can reduce its performance on new data.

    • @dataschool
      @dataschool  9 месяцев назад

      Thanks for sharing, but if I'm understanding you correctly, I respectfully disagree.

  • @mariuspy
    @mariuspy 8 лет назад

    Thanks. Clear and concise, straight to the point

  • @brendensong8000
    @brendensong8000 3 года назад +1

    Thank you for the great class! I learned so much from your video!!!

  • @terryliu3635
    @terryliu3635 6 лет назад +1

    Great series of courses on Pandas and Scikit Learn! I’ve been enjoying every video I watched on this channel. Thanks so much!
    On machine learning using Scikit Learn, I’m wondering if you could share a lesson on Random Forest and related concepts. Thanks again.
    Terry

    • @dataschool
      @dataschool  6 лет назад

      Thanks for your kind words! Thanks for your lesson suggestion. I don't have a video about that topic, but check out this page and search for "random forests": www.dataschool.io/start/

    • @terryliu3635
      @terryliu3635 5 лет назад

      Thanks for the reply! Much appreciated!

  • @matthewturnerphd
    @matthewturnerphd 9 лет назад

    I like the videos. Great work! Hope to see more.
    But I do worry a bit that you say you will still be using a lot of train/test split in the future. The problems that this method introduces are well established in the literature, and, given the ease of implementing either a cross-validation or bootstrap in Python/Scikit, it is a good habit for students/beginners to get into. Looking at this from the other side -- working with students who have picked up the train/test split habit from prior classes and online learning -- it is usually very hard to get them to use more valid procedures in their work. And except for exotic circumstances, it is usually not possible to justify using train/test for either real-world or more basic research. (See Hastie, Tibshirani, and Friedman's books for justifications.)
    Still, with that caveat, I do recommend your videos to students! Thank you for your work.

    • @dataschool
      @dataschool  9 лет назад

      +Matthew Turner I do appreciate the point. However, there are goals for model evaluation beyond just producing the most reliable estimates of out-of-sample error. For example, error diagnosis often requires looking at the confusion matrix, and while train_test_split makes this easy, cross_val_score does not.
      As well, cross_val_score needs to be used as part of a pipeline if you have any preprocessing, such as feature extraction or feature standardization. Teaching pipeline adds complexity that I find most students struggle with early on. That is another reason I use train_test_split, because you can do proper model evaluation (that includes preprocessing) without pipeline.
      All in all, it comes down to one's priorities, as well as the Python and scikit-learn fluency of one's students. I do appreciate your perspective, and am aware of the tradeoffs, but I've made a purposeful choice in this area based on my educational priorities and the backgrounds of the students I hope to reach.
      Thanks for sharing my videos with others! I appreciate it.

  • @yrmh1
    @yrmh1 9 лет назад

    Thanks for your informative video. I'm looking forward to your video on k-fold CV.

  • @nishagorasia2416
    @nishagorasia2416 6 лет назад

    Wow Videos...Great for reference. Really appreciate your efforts in making such videos which proves very beneficial to beginners like me. Please keep sharing your knowlegde to us through such excellent videos.

  • @ariamcmann6389
    @ariamcmann6389 8 лет назад +1

    This man speaks so mechanically I feel like I'm in an English as a Second Language class.

    • @dataschool
      @dataschool  8 лет назад +2

      Actually, English is a second language for around 50% of my viewers, and many of them have commented that they appreciate that I speak in a clear manner. For native English speakers that find the videos move too slowly, it can be helpful to use the RUclips controls to speed up the videos.

  • @andreyzinchenko6208
    @andreyzinchenko6208 8 лет назад

    Man, thanks for what you're doing. It's a really great help for people

    • @dataschool
      @dataschool  8 лет назад

      You're very welcome! Thanks for your comment.

  • @yujerry9992
    @yujerry9992 8 лет назад

    your kind words sound beautiful!

  • @hrishikeshtele
    @hrishikeshtele 6 лет назад

    Really great videos. Very informative videos for newbies like me. Thanks!!!!

    • @dataschool
      @dataschool  6 лет назад

      You're welcome! Thanks for your kind comment!