R demo | Deep Exploratory Data Analysis (EDA) | explore your data and start to test hypotheses

Поделиться
HTML-код
  • Опубликовано: 14 июл 2024
  • In this video, I'll provide the simplest and the most effective ways to explore data in R, which will significantly speed up your work. Moreover, we'll go one step beyond EDA by starting to test our hypotheses with simple statistical tests... that's what I call "deep" here.
    If you only want the code (or want to support me), consider join the channel (join button below any of the videos), because I provide the code upon members requests.
    You of coarse don't need to see the whole thing:
    Timetable:
    00:50 - automated reports as a separate video ( • R demo | Automated Exp... )
    02:44 - exploring categorical variables
    05:00 - descriptive statistics with some basic tests, like Fishers and Chi-Square, Mann-Whitney and Kruskal-Wallis tests
    09:15 - explore distribution with skewness and kurtosis tests
    13:03 - explore normality with Quantile-Quantile plots and Shapiro-Wilk normality test
    17:20 - compare groups with box-plots and non-parametric tests, like Mann-Whitney and Kruskal-Wallis
    19:12 - explore and visualize correlations and get correlation coefficients, confidence intervals and p-values
    24:03 - explore linearity of data with non-linear models and ggplot2 package
    24:30 - explore and impute NAs as a separate video ( • R demo | How to impute... ) and outliers (part of this video)
    Here is the code and article about or instead of this video: yuzar-blog.netlify.app/posts/...
    By the way, one of the songs playing in the background is called ”Hypothesis” by the artist Vincent Rubinetti 😉
    Music by Vincent Rubinetti
    Download the music on Bandcamp:
    vincerubinetti.bandcamp.com/a...
    Stream the music on Spotify:
    open.spotify.com/album/1dVyjw...
    Enjoy! 🥳

Комментарии • 60

  • @jeromebradenbaugh2637
    @jeromebradenbaugh2637 Год назад +5

    why you don't have millions of subs is beyond me. this and your other videos are exceptional. i've used R for several years and not been exposed to some of these packages. your style is clear and concise. thank you for quality content.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад +1

      Thanks for such a nice feedback, Jerome! That means the world to me! Well, I think, Stats is not as sexy of a topic for RUclips algorithm, R is less popular as python, and my channel is still young and has not enough videos. So, if you could spread the message and share videos, I would highly appreciate this. But the best support from you is - watching my videos (there are more to come)! Kind regards!

  • @famahsnazer3494
    @famahsnazer3494 5 месяцев назад +2

    Nice and fantastic; Exceptional! Clear and easy and funny way to explain thinks. I found this tutorial like a gold! Waou

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  5 месяцев назад

      Glad you enjoyed it! It’s one of my oldest videos though. The newest might also be useful. Thanks for watching!

  • @harryporter9965
    @harryporter9965 2 года назад

    This is an amazing EDA video I have ever watched!

  • @juanfederos1969
    @juanfederos1969 2 года назад

    Wow, so many valuable information that I keep on pausing the video every two minutes to jot stuff down. Thank you for this great piece of instruction, and thank you for your other great videos, too! It really is well appreciated!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад

      Thanks for such a nice feedback, Juan! There is a link to a blog article below the video, where you can more conveniently copy-paste the R code. But I also wouldn't mind if you rewatch the video from time to time ;) that helps the channel grow. Cheers

  • @dmitrykozhevnikov6196
    @dmitrykozhevnikov6196 2 года назад

    Юрий, спасибо за видео!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад

      На здоровье! Рад что оказалось полезным 😊 Спасибо за фидбэк!

  • @angezoclanclounon1751
    @angezoclanclounon1751 3 года назад

    Great tutorial! Thanks indeed, Dr. Yury.

  • @oluremiabayomi5924
    @oluremiabayomi5924 2 года назад

    Great job. From one professor to another.

  • @t.n.strijker5114
    @t.n.strijker5114 2 года назад

    This is gold, great job! It will help me make beautiful graphs as I study the correlation between anti-oxidants levels of plants in response to abiotic stress. Thank you Mr. Zablotski.

  • @saygindiler3928
    @saygindiler3928 2 года назад

    Thank you. Perfect video. Great explanation.

  • @chacmool2581
    @chacmool2581 Год назад

    Excellent!

  • @eyadha1
    @eyadha1 2 года назад

    So helpful. great video. Many thanks !!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад

      Glad it was helpful! You might have a look at the blog article, for your convenience in copy-pasting R Code. Thanks for the feedback! yuzar-blog.netlify.app/posts/2021-01-09-exploratory-data-analysis-and-beyond-in-r-in-progress/

  • @sn9751
    @sn9751 Год назад +1

    You are gifted and very good tutor, I really enjoyed watching this video , your way to present the information is clear, short and hit the point, besides, you provide some useful links to further info about any topics that needs more time to be explained.
    Thank you

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад +1

      Thanks a ton! It was one of my oldest videos, and I think the newest are better, especially in terms of editing. So, feel free to check them out and thank you for watching

  • @muhammedhadedy4570
    @muhammedhadedy4570 Год назад

    What an amazing tutorial. Your RUclips channel deserves to have millions of subscribers. The way you present the topic is simple and focused. No much talking and directed to the point. I really love the way you explain your tutorials.
    I wish you all the best in your life, my dear professor.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад

      Thanks a lot, Muhammed! This is the best feedback I have ever got! :) And thank you for watching - this is the best support. And if you can spread the message by sharing my videos with people who might find them useful, that would be amazing! Cheers

  • @arseneadou9602
    @arseneadou9602 2 года назад

    Great Job 🎉🎉🎉

  • @rihanamsadek480
    @rihanamsadek480 3 года назад +1

    Thank you!

  • @yeah_reee
    @yeah_reee 9 месяцев назад

    i hope the creator of this videos reads this because i THANK YOU SO SO MUCH!!!
    You did more than a spectacular job describing everything in just 28 minutes! you've earned my sub just from the introduction.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  9 месяцев назад

      Wow, thank you for such a nice feedback! I am very glad my content is useful! Thanks for watching!

  • @rashawnhoward564
    @rashawnhoward564 3 года назад

    Thank you for this great video! You have a new subscriber!

  • @kapilrana1153
    @kapilrana1153 2 года назад

    Thankyou so much

  • @johnkaruitha2527
    @johnkaruitha2527 Год назад

    This is a serious video

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад

      Thanks a lot John. If you like this one, you might also like the newest on my channel. Cheers

  • @siriyakcr
    @siriyakcr 3 года назад

    Super 🥰🥰🥰

  • @dinohadjiyannis3225
    @dinohadjiyannis3225 Месяц назад

    brilliant stuff. Very easy to follow. Can you create a dedicated playlist regarding machine learning models ?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Месяц назад +1

      Great suggestion! I'am on it, just made some videos on linear regression in R. Logistic will follow and then the rest of ML one day. Thanks for nice feedback and for watching!

    • @dinohadjiyannis3225
      @dinohadjiyannis3225 Месяц назад

      @@yuzaR-Data-Science you have no idea how easy these are to follow. thanks again. keep it up.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Месяц назад

      thanks! do my best to upload more, but this summer is pretty busy.

  • @bjrh1052
    @bjrh1052 Год назад

    Great video as always.
    I have a question. How do you visualize several graphs in the code editor? Wich addin or package do you use??
    Thanks in advance

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад

      Thanks 🙏 I am not sure, what you mean with the code editor, but ggplot2 package with facet_wrap or sjPlot package with grid … something can do that.

  • @dmitry503
    @dmitry503 2 года назад

    Юрий, спасибо большое за такой контент! Приходится конспектировать )) Я понимаю, что вы организовали видео по логике анализа, но все-таки хотелось бы как-то организовать часть про пакеты тоже. Их много, они полезные, но как-то путаются с функциями по ходу повествования. Прошу прощения за критику, но только потому что очень интересно )

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад +1

      Спасибо, Дмитрий! Согласен. Надо было делать несколько малых видео, вместо одного большого. Про отдельные полезные пакеты видео тоже есть, например performance, janitor, dlook, data-explorer etc. . Так же отдельные видео про автоматические анализы и imputating missing values. Если вы знаете очень полезные пакеты, пишите, попытаюсь сделать ревью.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад +1

      кстати, забыл сказать, в описании к видео почти всегда есть линк к блогу, где всё то же самое что и в видео, только в можно сразу брать и использовать код

    • @dmitry503
      @dmitry503 2 года назад

      @@yuzaR-Data-Science Спасибо за ответ, Юрий! Буду смотреть и напишу, если что обнаружу. Уже впечатлился smartEDA после вашего обзора.

  • @chacmool2581
    @chacmool2581 Год назад

    8:11 Those plots much better with 'browser', 'viewer' or 'render' option.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад

      hey Chac, thanks, but I could not use those options with dfSummary() command. Or did you meant some other command?

    • @chacmool2581
      @chacmool2581 Год назад

      Sorry, you are right. To get dfSummary() to look good, you need to wrap it around the print() function with the print() function's method parameter set to 'render' (for RMarkdown), 'viewer' (for RStudio) or 'browser' to export it to a web browser. Like so:
      print(dfSummary(StudentData,
      ...),
      method = "render")

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  Год назад

      All right, now I see what you mean. Thanks! Good to know, since I also often produce HTML with all the results. However, last time I caught myself using only describe() from dlookr package and tbl_summary() from gtsummary. They are amazing!

  • @Dominus_Ryder
    @Dominus_Ryder 16 дней назад

    The code link in the description does not seem to be working anymore, do you have an updated one by any chance?

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  13 дней назад

      Hi Dominus, unfortunately it was blocked for too much traffic. I’ll try to reopen it ASAP with free alternative, but in the meanwhile please just rewatch the videos, because my blog is literally the script for them, so you won’t miss anything. I know that it might be cumbersome to write down the code from the script, so, I am sorry for that! But if you wanna get the written whole code now, consider to join my channel to become a member, because I post the code to members upon request. And members can see other code-posts I've posted on other videos already. Cheers

  • @titlov123
    @titlov123 3 года назад

    выкладывайте пожалуйста , код по видео ...спс

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  3 года назад

      самого кода слишком много, поэтому в описание под видео линк к блог посту со всем кодом. Спасибо за комментарий! 😊

  • @dominiccomtois7850
    @dominiccomtois7850 Год назад +1

    try view(dfSummary(data)) to get an html version of the table; its graphs are much prettier. Documentation provides instruction on how to integrate this in Rmd documents, too.