Tidy Data and Why We Need It!

Поделиться
HTML-код
  • Опубликовано: 21 авг 2024

Комментарии • 20

  • @adarshkannan1441
    @adarshkannan1441 10 месяцев назад +2

    Great explanation

  •  Год назад

    11:19 It's absolutely worth the time it takes to convert messy data into tidy data. Great video, thanks!

  • @nicolaieg9639
    @nicolaieg9639 5 месяцев назад

    Thank you very much - very useful!!

  • @dmitry503
    @dmitry503 2 года назад +1

    I think a lot of statistics students would get disappointed after watching this video ))) No comments?? Thank you for the video!

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад +2

      Thanks for the feedback! :)

    • @fishfish20
      @fishfish20 2 года назад +2

      #Dimtry, I beg to disagree respectfully, it depends on what you're looking for (personal preference). Bear in mind, no knowledge is useless.
      To sum it up, it's just your own hypothesis.
      Cheers 🥂

    • @dmitry503
      @dmitry503 2 года назад +1

      @@fishfish20 Thanks Jonathan! That was just a reply to the author's joke in the video )) not criticizing its content )

    • @fishfish20
      @fishfish20 2 года назад +3

      @@dmitry503.. 😂 😂 😂 😂 😂Noted but the first part of the statement takes the joke out of context. 😂 Thanks for the clarification

  • @yaoliao3517
    @yaoliao3517 2 года назад

    There are 5 representive examples of messy data :
    Column headers are values, not variable names.
    Multiple variables are stored in one column.
    Variables are stored in both rows and columns.
    Multiple types of observational units are stored in the same table.
    A single observational unit is stored in multiple tables.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад

      Yes, the 5 major ones, but I have seen over 300 "creative" tables :) if I used all of the possible mistakes, the video would be one hour :)

    • @yaoliao3517
      @yaoliao3517 2 года назад

      @@yuzaR-Data-Science Yeah, thanks for your tutorial.

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад

      Welcome!

  • @yaoliao3517
    @yaoliao3517 2 года назад

    Oh, this is really great!

  • @fishfish20
    @fishfish20 2 года назад +1

    Greetings sir. I have a question, do we have a package in r that can compute two way anova and give post hoc at the sane time like what ggbetweenstats function does?
    Looking forward to your reply.
    Thank you

    • @yuzaR-Data-Science
      @yuzaR-Data-Science  2 года назад +1

      unfortunately not a single package, but you can easily do that:
      - first you create a (correct) model (lm, glm, glmer etc.) with two predictors (with or without interaction)
      - then you visualize it with sjPlot packge using plot_model() command
      - finally you use emmeans package and emmeans function from it, to get pairwise comparisons
      I have an article with lots of examples and code here: yuzar-blog.netlify.app/posts/2021-01-01-how-to-visualize-models-their-assumptions-and-post-hocs/
      Hope that helps and thank you for watching the videos - it's the best support! :) cheers

    • @fishfish20
      @fishfish20 2 года назад

      @@yuzaR-Data-Science Thank you so much Sir for the clarification and continuous efforts.
      Regards
      Jonathan