How to Clean Up Messy Excel files in R | 7 Easy Strategies

Поделиться
HTML-код
  • Опубликовано: 23 авг 2024

Комментарии • 25

  • @rappa753
    @rappa753  2 месяца назад

    If you enjoyed this video, I am certain that you're going to love my Data Cleaning Master Class. It shows you even more advanced tricks for Excel file and text cleaning. You can check it outat data-cleaning.albert-rapp.de/

  • @rcepka
    @rcepka Год назад +5

    Great video, many thanks. If I may have a note/suggestion; for us less advanced R-users :), I would like to see explained in more details also partial tasks/steps you are performing. For example regex

    • @rappa753
      @rappa753  Год назад +1

      Hi Robert, thanks for reaching out. It is indeed hard to find a balance between what to explain and what to leave to the user 🙈 As for your question, the call you mentioned is constructing a regular expression using the glue function. A great guide on regular expression is r4ds.hadley.nz/regexps.html

  •  Год назад +6

    Date columns in Excel are nightmare fuel, luckily we have janitor::excel_numeric_to_date(). Great video!

    • @rappa753
      @rappa753  Год назад

      It is indeed 😀 Janitor's convenience function are super great for fixing these types of problems.

  • @muhammedhadedy4570
    @muhammedhadedy4570 6 месяцев назад

    Excellent tutorial. I literally watch it every day so that I can learn it very well. Would you please make more of these cleaning data tutorials?

    • @rappa753
      @rappa753  6 месяцев назад +1

      Happy to have a fan 😊 I'm planning on making a data cleaning course but that's still a bit in the future 😅

    • @muhammedhadedy4570
      @muhammedhadedy4570 6 месяцев назад

      @@rappa753
      Excellent. Can't wait to join.

  • @dasrotrad
    @dasrotrad Год назад +2

    Very nice. You sure packed a lot into this 7:44 min:secs. Thank you.

    • @rappa753
      @rappa753  Год назад

      Glad that you like it, Robert 😊

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 5 месяцев назад

    Nice. I was a bit surprised by seeing paste0 inside a glue function. I thought glue was used instead of paste and was simpler. It is also worth noting, as you probably, know that there are three packages specifically aimed at dealing with excel files (even messier than the ones you did - for instance subheaders at different levels throughout the document). These are tidyxl, unheadr, and unpivotr. The tidyxl package actually works on the "cellular" level that is allows you to deal with the contents of the excel cells and manipulate them. Tidyxl is powerful but takes experience to work with. It would be nice to see a video on tidyxl. Thanks.

  • @Sensemakingdata
    @Sensemakingdata Год назад +1

    Great video, well done!

  • @MKhan-zo8xo
    @MKhan-zo8xo 6 месяцев назад

    YES! WOW! I didnt know I needed this video

    • @rappa753
      @rappa753  6 месяцев назад +1

      Nice, looks like you got a lot out of this video. Thanks for sharing that with me 🤗

  • @ahmed007Jaber
    @ahmed007Jaber 11 месяцев назад +1

    Excellent one. thank you Albert

    • @rappa753
      @rappa753  11 месяцев назад

      Glad that you like it 😊

  • @hassanhijazi4757
    @hassanhijazi4757 11 месяцев назад +1

    Thank you very much. Very informative.

    • @rappa753
      @rappa753  11 месяцев назад +1

      Thank you, Hassan. Glad that you enjoy my video!

  • @Aaqib..
    @Aaqib.. Год назад +1

    so much information packed,you the best

    • @rappa753
      @rappa753  Год назад

      Thank you, that's very kind of you 😊

  • @andrea-mj9ce
    @andrea-mj9ce Год назад +2

    A proper way is to use the `tidyxl` package

    • @rappa753
      @rappa753  Год назад +1

      Uhh {tidyxl} sounds promising. it sounds a bit like an add-on to {readxl}. Thanks for the hint 🙂

  • @johnkaruitha2527
    @johnkaruitha2527 Год назад +1

    Great job Albert. But next time you use setwd() I will look for you and burn your computer (Jenny Bryan) (projects are better).

    • @rappa753
      @rappa753  Год назад

      😀 I am on board with not using setwd() for static file paths. I think that's what Jenny meant 🤔 I do love projects but still sometimes navigating inside of them is unavoidable. That's why I use here() from the {here} package to make file paths relative.