Seaborn boxplot | Box plot explanation, box plot demo, and how to make a box plot in Python seaborn

Поделиться
HTML-код
  • Опубликовано: 23 дек 2024

Комментарии •

  • @kamranibrahimov4790
    @kamranibrahimov4790 4 года назад +32

    The best explanation of boxplots!! Keep doing videos, please)

    • @KimberlyFessel
      @KimberlyFessel  4 года назад +4

      Thank you -- glad it was helpful! Will do!

    • @baozixiao6733
      @baozixiao6733 2 года назад

      @@KimberlyFessel same feeling, best explanation! You are a life saver!

  • @ouse4
    @ouse4 Год назад +3

    Definitely the best explanation of box plot so far.
    Thanks a million

  • @tocherss
    @tocherss Год назад +1

    Great explanation. Just a typo in 1:54 where it should be 'Interquartile Range' instead of 'Inner quartile Range'. Very useful video!

  • @syedjamal9526
    @syedjamal9526 Год назад +3

    I really loved the way she explained every point. it's amazing. I will share this channel with my friends who need help regarding seaborn.

  • @marcuchaperu
    @marcuchaperu Год назад

    Not only your video is great but also the the files at github.... thank you very much!!!!

  • @jenex6918
    @jenex6918 Год назад

    I have been trying to figure out this problem for my capstone for the past 4 hours and you made everything so simple! I cannot thank you enough

  • @AE-pv9vc
    @AE-pv9vc 3 года назад

    I've been hunting youtube, internet, books, and all of them were fairly crappy at explaining why/how to do this (without proper background before diving into it). This was a very 'teaching you to fish' type of video- thanks so much.

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Thank you very much for the compliment! Glad to hear this helped 😀

  • @raveeshmalhotra7347
    @raveeshmalhotra7347 3 года назад +5

    that was a really great explanation. loved the content organization and planning. thank you so much

    • @KimberlyFessel
      @KimberlyFessel  3 года назад +1

      So glad you enjoyed the explanation! I create an outline for the structure of each video - so good to know that is effective so far!

  • @sauravsingh8967
    @sauravsingh8967 3 года назад

    The numbers (views and likes) don't do justice to the quality of the information provided in the video.
    Keep up the good work!

  • @geogeo14000
    @geogeo14000 3 года назад +1

    The best explanation of this seemingly tricky stuff I've ever seen, thank you so much ! Clarity, every step is slowly explained, and the illustration are great. Awsome, thank you very much !

  • @danielflorea3001
    @danielflorea3001 2 года назад +1

    Concise and solid explanations. Extremely useful. Thanks

  • @t.t.cooperphd5389
    @t.t.cooperphd5389 4 года назад

    You blew my mind with the order feature. You don't know how many data frames I have rearranged!

  • @acharjyaarijit
    @acharjyaarijit 2 года назад

    Thankyou Kimberly Fessel for your wonderful video. The way of your presentation and contents are excellent.

  • @Rahul-qe9gc
    @Rahul-qe9gc 4 года назад +1

    The best video on boxplot ,I have ever seen on RUclips❤️❤️ keep doing

    • @KimberlyFessel
      @KimberlyFessel  4 года назад

      So glad you enjoyed the video -- will do!

  • @Couplestravelling
    @Couplestravelling 3 года назад

    This was the best video explanation of box plot. Thanks.

    • @KimberlyFessel
      @KimberlyFessel  3 года назад +1

      Thank you - very glad my explanation was helpful!

  • @ajaychouhan3354
    @ajaychouhan3354 2 года назад

    This is the best explanation of concept and the code too. Keep it up. You deserve more followers! Keep it up!

  • @TaxMentors
    @TaxMentors Год назад

    Your video is more informative. Please make video regularly. Thanks

  • @ashwinikumar6461
    @ashwinikumar6461 Год назад

    Also would like to know regarding the customization of color options....

  • @firstlast-wz9jv
    @firstlast-wz9jv Год назад

    outstanding explanation ! ... Thank you !

  • @googlegoogle1610
    @googlegoogle1610 2 года назад

    wow, very nice explanation..........you are the best

  • @ruairiomalley8616
    @ruairiomalley8616 Год назад +1

    Why doesn't sns.boxplot(x=cars.origin, y=cars.mpg); give the following error?
    TypeError: Neither the `x` nor `y` variable appears to be numeric.

  • @umarpathologist5863
    @umarpathologist5863 Год назад

    Thanks for nice video..
    I got some additional information but not the answer of my search.
    I am a new learner and I wanted to change the outline colors of my box plot and the median line color

  • @vikash3589
    @vikash3589 3 года назад

    your videos and teaching are as perfect as you.
    Thanks you very much :)

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Thank you so much - glad to hear you are enjoying the videos!

  • @ashwinikumar6461
    @ashwinikumar6461 Год назад

    Awesome presentation , short, crisp and clear. Thanks a lot and appreciated from my heart.. Why don't tryout for scikit learn and Pandas libraries

  • @ekaterinavolkova4348
    @ekaterinavolkova4348 2 года назад

    Hi, Kimberly! In video you says that whiskers low limit equals Q1-1,5*IQR, whiskers upper limit equals Q3+1,5*IQR.
    08:18 we see
    Q1 = 17
    Q2 = 23
    Q3 = 29
    IQR = 29 - 17 = 12
    whisker low limit = Q1-1,5*IQR = 17 - 1,5 * 12 = -1
    whisker upper limit = Q3-1,5*IQR = 29 + 1,5 * 12 = 47
    05:12 we see whisker's limits are not equal -1 and 47.
    Why is it?

    • @atharvapuranik5172
      @atharvapuranik5172 2 года назад +1

      This is because the minimum value in the dataset is 9 and the maximum value is 46. So, there's no need to stretch the whisker from -1 to 47.

  • @daniel_asg
    @daniel_asg 4 года назад

    I have been learnt a lot with your videos. Thank you very much.

  • @rhumbertgz
    @rhumbertgz 4 года назад +1

    Thanks for this instructive video 👏🏻

    • @rhumbertgz
      @rhumbertgz 4 года назад

      I was wondering if you can give some feedback on the following situation: Sometimes the mean value is greater than and 50% percentile representation of my boxplot. For example, mean=4.142857 and 50% = 3.000000. Is this right? Don't they have to have the same or approximate the same value?

    • @KimberlyFessel
      @KimberlyFessel  4 года назад +2

      Good question and this is definitely possible. The 50% percentile represents the median of your data, which is calculated differently than the mean. You likely have some very large values that are making the mean higher. For example, the median (50% percentile) of [1, 3, 8] is the middle value 3, but the mean is (1 + 3 + 8)/3 = 4. Usually the mean and median will be close together, but this isn't guaranteed since outliers will influence the mean more than the median.

    • @rhumbertgz
      @rhumbertgz 4 года назад +1

      @@KimberlyFessel thanks for your time and explanation

  • @muhammadowaiskhan6831
    @muhammadowaiskhan6831 2 года назад

    very well explained, thank you very much. i highly appreciate it.

  • @kamilaknapova8557
    @kamilaknapova8557 Год назад +1

    perfect tips !

  • @dr_flunks
    @dr_flunks 2 года назад

    perfect explantion, nicely done!

  • @eatbreathedatascience9593
    @eatbreathedatascience9593 3 года назад

    Thanks very much. Your video is excellent.

  • @jitendratrivedi7889
    @jitendratrivedi7889 2 года назад

    Nice explanation .

  • @nukewares
    @nukewares 3 года назад

    Excellent work!

    • @nukewares
      @nukewares 3 года назад

      Do you know of a good way to add a marker that shows how a certain value sits on the box plot? For example the most recent value in a time series vs where it fits against the distribution of historical values?

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Oh cool - that seems super useful! I guess you can always plot seaborn figures on top of each other. So I might make a box plot and then put something like a scatter plot on top of that. Then for the scatter plot you could just plot a subset of the data if you want. For example, if df is the famous seaborn tips dataset:
      sns.boxplot(x='time', y='tip', data=df)
      sns.scatterplot(x='time', y='tip', data=df.iloc[-1:], color='black');

    • @nukewares
      @nukewares 3 года назад

      @@KimberlyFessel thanks, that scatterplot code works quite well. I find this is a great alternative way to show how a data point compares to historicals in a time series, as opposed to the normal way of showing a line chart. It's much easier on the eyes to show a box plot in my view (especially if there are hundreds of data points to compare against).

  • @fatemesadatalikhani7326
    @fatemesadatalikhani7326 3 года назад

    thank you for the good explanation

  • @aniruddhmaitra1728
    @aniruddhmaitra1728 2 года назад +1

    Great tutorial!! I had a doubt. Is there a way we can visualise the data points themselves, on the boxplot?

  • @danielpereira8022
    @danielpereira8022 3 года назад

    I just loved it!

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Hooray -- thank you! Glad you enjoyed the video. 😀

  • @shreenaj7664
    @shreenaj7664 3 года назад

    Extremely helpful! Thanks a lot! ❤❤

    • @KimberlyFessel
      @KimberlyFessel  3 года назад +1

      Most welcome - glad to hear it was helpful! 😄

  • @youcefyahiaoui1465
    @youcefyahiaoui1465 2 года назад

    Hi Kimberly, I have to admit that your tutorials are probably some of the top notch teachings I have ever seen. No matter how I try, I would not be able to thank you enough. I became a true seaborn fan and I absolute love for its efficiency, versatility and its ease of use.
    I have a question though. The majority of what you showed in boxplots or other seaborn plotting methods (FacetGrid, lineplot, hist, kde), you hue/split the data by category or assume there is a viable that's a category. What I am dealing with are datalog files where the only category are the tested part serial numbers that are in one-to-one relationship to the corresponding numbers in the parameter columns. I have different test name in different columns. For example, There is a column that has test results for those serial numbers at one voltage condition, then another column for the same test but @ different voltage conditions etc. It's all columns. Those voltage conditions are not in rows. Can you have an example illustrating how I could use a box plot and plot data from different columns on a single figure?
    Thank you so much,
    Keep up the good work.
    Best regards,
    Youcef

    • @youcefyahiaoui1465
      @youcefyahiaoui1465 2 года назад +1

      I think I was able to answer my own question. But if there is a more efficient way of doing this, it would be great to read your reply. I extracted my columns of interest with columns I needed to keep constant data id's. I used pd.melt to convert renamed columns (from wide to long). I merged the dataframes corresponding to different column variable names, then used your seaborn boxplot method. Thanks.

  • @sarbanchakraborty2053
    @sarbanchakraborty2053 3 года назад

    Really great & helpful!!!

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Thank you - very glad to hear it helped 😄

  • @Ihsan_almohsin
    @Ihsan_almohsin 3 года назад

    well done big thanks

  • @sreenathsj4064
    @sreenathsj4064 3 года назад

    Loved it mam😊

  • @susanavelasquezfernandez5089
    @susanavelasquezfernandez5089 4 года назад

    Very gratefull of your job!

  • @1Aluisio
    @1Aluisio 3 года назад

    Thank u! It helps me a lot

    • @KimberlyFessel
      @KimberlyFessel  3 года назад

      Excellent -- glad to hear that it helped!

  • @av5431
    @av5431 2 года назад

    Thanks a lot @Kimberly Fessel.

  • @jas.sin83
    @jas.sin83 3 года назад

    Is there a way to code change plot window size?

    • @KimberlyFessel
      @KimberlyFessel  3 года назад +1

      Yes you can! I often use matplotlib's pyplot module to change the size of my seaborn figures. Adding a line before your seaborn plot like plt.figure(figsize=(6, 3)) will update the figure size to 6 inches wide and 3 inches tall. My video about the matplotlib figure size might also be helpful: ruclips.net/video/UUy6_ElQXBY/видео.html

  • @espua72
    @espua72 4 года назад

    excellent job

  • @Gebev
    @Gebev Год назад

    Amazing!
    (It is "interquartile" not "inner quartile", American accent might make you confuse the two)

  • @pakaponwiwat2405
    @pakaponwiwat2405 Год назад

    Thank you

  • @mostafatarek5088
    @mostafatarek5088 Месяц назад

    02:00 nice explaination

  • @मयंकपठानियाँ

    amazing

  • @TaxMentors
    @TaxMentors Год назад

    Please start making videos again

  • @zouhir2010
    @zouhir2010 3 года назад

    great

  • @MarC-p4u1b
    @MarC-p4u1b 5 месяцев назад

    I find your videos so interesting, I wish you could translate them into Spanish.

    • @KimberlyFessel
      @KimberlyFessel  5 месяцев назад

      Oh how I wish I knew Spanish! Glad to hear you like the videos though!

    • @MarC-p4u1b
      @MarC-p4u1b 5 месяцев назад +1

      @@KimberlyFessel Thank you. Use an AI translator to translate your videos from English to Spanish.

  • @ahmedifhaam7266
    @ahmedifhaam7266 2 года назад

    the data has proves, Japan cars are best in the world.

  • @danielrubio6220
    @danielrubio6220 2 года назад

    Thanks, excellent explanation, great video!