Using R to Analyze COVID-19 | R Programming Project

Поделиться
HTML-код
  • Опубликовано: 10 окт 2024
  • In this video, we saw how to use the R programming language to perform statistical analysis on the COVID-19 (coronavirus) dataset. This is the second video in the R introduction for beginners series. This video is meant to be a R programming tutorial where I show you the basic of importing a dataset and performing data analysis. You will find many useful links below to help you follow along and improve on this R project.
    If you enjoyed this video, please make sure to subscribe to the channel for similar videos!
    Installing RStudio and R Basics: • Video
    Kaggle Page Link: www.kaggle.com...
    Data Link: www.kaggle.com...
    Code: github.com/tec...
    Accompanying Article: towardsdatasci...

Комментарии • 111

  • @nishantrana6
    @nishantrana6 4 года назад +63

    normally i wouldn't get any of this and my appreciation for this type of content would have been zero, but now that i need help with my college internship i should say that you are doing god's work. thanks a ton for the help.

  • @andrewdaywalt967
    @andrewdaywalt967 3 года назад +35

    I REALLY appreciate you uploading this example. I recently started to learn R, but I have a hard time understanding the basics without seeing the larger application. This is tremendously helpful!

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад +4

      Glad it was helpful! I am not a pro by any means, but happy to share what I know

    • @mugomuiruri2313
      @mugomuiruri2313 11 месяцев назад

      how is your r so fur?

  • @staynjohnson4221
    @staynjohnson4221 3 года назад +28

    2:40
    for those who are new like me and did not understand line number 7, according to documentation: "as.integer attempts to coerce its argument to be of integer type". Meaning whenever the condition inside the argument is FALSE it will be coerced to integer 0 and if it's TRUE then 1. Thus a column of 0 and 1 values is made.

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад +1

      Very true! Thank you for helping others!

    • @divinepiggie7307
      @divinepiggie7307 2 месяца назад

      But wasn't that regarding a date in that column? A date probably means someone died on that date and if the command detects the cell as false, wouldn't that inaccurately place a 0?

    • @julianjarjour
      @julianjarjour Месяц назад

      @@divinepiggie7307 inside of the as.integer() function is data$death != 0, which means "if the number is NOT zero, [which is TRUE for dates], then set it to 1"

  • @shananarocks
    @shananarocks 2 года назад +1

    This is my 2nd day learning R as a new language and indeed learn so much in just an hour from your easy-to-follow tutorial. Thanks so much!!!

  • @techwithmuchiri5921
    @techwithmuchiri5921 4 года назад +7

    This is is great please do more of these , I'm fairly new to R and I'm trying out simple projects such as the one you've just done and they are pretty helpful 💯.
    Next time consider doing some plots that would really help.
    Thank you.

  • @LesterMoran-y6u
    @LesterMoran-y6u 3 месяца назад

    Thank you! I have been learning about t.test but they haven't showed me how it can be used in scripting. This clarified it so simple.

  • @SwastikNaik318
    @SwastikNaik318 4 месяца назад

    Completed! Completed! Completed!!! 🎉❤
    Completed my first ever project on R, great explanation, thanks for this great video! Would love to watch more of these videos ❤

  • @DarrylBodkin
    @DarrylBodkin 2 года назад +2

    Dude I love how you made this video concise and loaded. 👊🏾👊🏾👊🏾👊🏾

  • @Priscilla-qu5pg
    @Priscilla-qu5pg Год назад +1

    The way u explained is very good sir,it is helpful for me sir.Thank you very much sir for explaining. Do more videos about/using R sir.

  • @BiscuitZombies
    @BiscuitZombies 3 года назад +2

    Thanks for this. I want to start learning R and it’s good to see a use case in action of it

  • @rabiabs
    @rabiabs 2 года назад +1

    How wonderful example💐Thank you alot. I think we need more video about R practice via your explanation

  • @cototapiamagdiel3653
    @cototapiamagdiel3653 4 года назад +12

    could u teach us how to do a logistic model with that data base, thx great video

  • @askme6855
    @askme6855 3 года назад

    Happily subscribing your channel. Expecting more Videos on analysis using R-Programming

  • @shivamparashar...9536
    @shivamparashar...9536 9 месяцев назад

    Amazing explanation, i understand totally.
    Thank you

  • @mike21599
    @mike21599 4 года назад

    Im glad i took business statistics 1 and 2 in school so i can understand what you are doing. I guess my only question is what types of data corresponds to what statistical test to run? You used a t-test, but when do i run an anova table or a regression line?

  • @cobiam123
    @cobiam123 4 года назад +2

    Cordial greetings, in advance I hope you are in good health, just like your family. Thanks for your video, but I have a question: I would like to know how, through the R language and some bookstores, I could predict or obtain a probability of contagion to the neighboring towns of my city. I mean that if my city is marked with infected, how do I obtain graphically or probabilistically the probability that the neighboring towns to my city are infected, taking into account that today they are not. I hope you give me some light on this concern. Thank you.

  • @valgames8309
    @valgames8309 Год назад

    Thank you, GREAT TUTORIAL!

  • @codehow_
    @codehow_ 2 года назад

    You are really good explainer man.

  • @ianleboo
    @ianleboo 3 года назад +1

    Really precise explanation. Good work

  • @sebastianquirarte6705
    @sebastianquirarte6705 Год назад

    Great tutorial, thanks for sharing!

  • @dmadhav4867
    @dmadhav4867 Год назад

    bro love the u made can u tell how to start learning from basics in order to be a good scientist and learning R language plzz

  • @prakharpatil739
    @prakharpatil739 3 года назад

    plz upload more videos like this... there is very less content of ml practice with R.

  • @sumitmaske3032
    @sumitmaske3032 3 года назад +1

    Thanks ,I learned a lot from this

  • @coder418
    @coder418 6 месяцев назад

    Good stuff, Thanks

  • @MuhammadHabib-ho1ll
    @MuhammadHabib-ho1ll Год назад

    amazing!

  • @poojamahesh8594
    @poojamahesh8594 3 года назад

    great explaination!
    i have a dataset containing time period, I want to categorise it into early morning, morning, mid-day, afternoon, evening...
    kindly suggest me a line of code for it..

  • @srijeevbhowal
    @srijeevbhowal 3 года назад +1

    Thank you so much, can you make a video for data visualization using same csv on r studio

  • @3s843a
    @3s843a 4 года назад +3

    Neat

  • @gtensol
    @gtensol 3 года назад +2

    Thanks for the video! In a "real world" analysis, how would you go regarding co-founding variables? For example: are mens older than women, thus when you do gender analysis you are actually measure the age effect? Would you do propensity score?

    • @EmBennet99
      @EmBennet99 3 года назад +1

      There are ways to do this but the most common is to perform a least-squares / linear regression where you control for each of the characteristics so that you understand the effect of gender controlling for age, or occupation, or income level, etc.

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад +2

      Exactly - if we have additional data for confounding variables, we can just include them in the regression. If we do not have that data, we might get omitted variable bias

  • @rahuldavar7209
    @rahuldavar7209 4 года назад

    Great Explained.

  • @ianclark6730
    @ianclark6730 3 года назад +1

    This was SO awesome

  • @gouthamkarakavalasa4267
    @gouthamkarakavalasa4267 3 года назад +1

    How did you cleaned the data, I mean, u took dummy variable where death is not equal to zero, so ideally ur eliminating all the values eqaul to zero and considering the dates ??

    • @jhontreyesalbarracin9479
      @jhontreyesalbarracin9479 3 года назад +2

      By using data$death!=0, R gives you only True and False values. These values are the result of testing whether the values are different to 0 (True if they are different and False otherwise) . So, when you add as.integer you are basically converting those true values into 1 and those false values into 0. I hope this helps.

    • @gouthamkarakavalasa4267
      @gouthamkarakavalasa4267 3 года назад

      @@jhontreyesalbarracin9479 thanks for reply jhon.. yes it helped..

  • @sedenions
    @sedenions 2 года назад +1

    When I run the script ending in 'describe(data)' nothing happens. What's wrong? I have the latest version of RTools installed on my C drive.

  • @danielalexanand
    @danielalexanand 3 года назад +1

    Thank you! But why am I unable to find the link to the article?

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад

      Updated the description - thanks for pointing this out! towardsdatascience.com/r-tutorial-analyzing-covid-19-data-12670cd664d6

  • @shiromiherath4990
    @shiromiherath4990 2 года назад

    hi,, above link to the kaggle doesn't work.It also goes to download directly.

  • @bariscanbilgin
    @bariscanbilgin 3 года назад +1

    👏🏻👏🏻

  • @omkar_salkar6729
    @omkar_salkar6729 3 года назад

    Can we apply a data science algorithm on this? And if yes then particularly which algorithm?

  • @shubhangiagrawal336
    @shubhangiagrawal336 4 года назад

    When I am running the mean command exactly as what you told, I am getting error message stating-Argument is not numeric or logical:returning NA. Plz help me in solving this problem. Thanks

  • @poetry_4u559
    @poetry_4u559 Год назад

    Could you make the video of fake news detection using r language

  • @myyoutubecontent2024
    @myyoutubecontent2024 4 года назад

    Nice video, but I didn't understand why did you do sum(death_dummy)/nrows for calculating death rate

    • @pranavwarrier6693
      @pranavwarrier6693 4 года назад +1

      that signifies total deaths{sum(death_dummy)} divided by total population{nrows} to give a rate

  • @amv4256
    @amv4256 2 года назад

    Can you help me ???...How can we apply functional autoregressive (FAR) Model on this data???

  • @CreativeCommons277
    @CreativeCommons277 3 дня назад

    Error in data$death : object of type 'closure' is not subsettable

  • @sharifajmir
    @sharifajmir 2 года назад

    awesome

  • @umutcantulu
    @umutcantulu 3 года назад +2

    Thank you for that, but video should be more slower to understand clearly

  • @mugomuiruri2313
    @mugomuiruri2313 11 месяцев назад

    good

  • @sumitmaske3032
    @sumitmaske3032 3 года назад +1

    What happens when I click on "DON'T CLICK THIS" link ? I'm very curious to know, so please tell me

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад +2

      It opens a prompt that asks you to subscribe 😂 doesn’t work if you’re subscribed already :)

    • @sumitmaske3032
      @sumitmaske3032 3 года назад +1

      @@TechTribeCommunity 😂😂😂

  • @nitinchoudhary3549
    @nitinchoudhary3549 Год назад

    is it example of handling improper dataset?

  • @jahnavivardhanambati1515
    @jahnavivardhanambati1515 4 года назад

    Can we apply any regression model to this dataset? If possible, How?

  • @praveenparmar7728
    @praveenparmar7728 4 года назад

    In t.test the degree of freedom 72.234 which is incorrect, df should be 841

  • @Lavistus
    @Lavistus Год назад

    bro are you doing a speedrun holy shit

  • @anveshpandey9817
    @anveshpandey9817 2 года назад

    It is giving me error
    Error in `$

  • @pagetwentyone
    @pagetwentyone 4 года назад

    could you make a graph to represent it?

  • @kinyapop
    @kinyapop 4 года назад +2

    Can you tell me how to find the link to the code?

  • @WhisperingShutters
    @WhisperingShutters 3 года назад

    How do I plot this in a graph? I want to visualize the death rate in men and women.

  • @kvs123100
    @kvs123100 3 года назад

    wowwww

  • @ankitdongare9262
    @ankitdongare9262 3 года назад

    Hi, can you help me on High value customers identification for an E-Commerce company

  • @ankitkholiya4305
    @ankitkholiya4305 2 года назад

    thanks bro

  • @ashishpaithankar9632
    @ashishpaithankar9632 2 года назад

    can you explain what is death_dummy?

  • @darshan1113
    @darshan1113 3 года назад

    Do this code run on rgui too?

  • @brandongriggs6934
    @brandongriggs6934 3 года назад

    How do I learn this? It seems cool

  • @dattebayo_usuratonkachi
    @dattebayo_usuratonkachi 4 года назад

    How to plot them to a graph??
    like ggplot()?,could you help :/

    • @dattebayo_usuratonkachi
      @dattebayo_usuratonkachi 4 года назад

      dude please reply @Tech_Tribe,or anyone if you could help

    • @dattebayo_usuratonkachi
      @dattebayo_usuratonkachi 3 года назад

      @Shridnyan Haval hey thanks but I actually saw some other tutorials and was able to learn , anyways thanks alot :)

  • @prathameshbansode8065
    @prathameshbansode8065 2 года назад

    I have same project can u help me

  • @sanskarahire1199
    @sanskarahire1199 3 года назад

    Sir plz give csv file link

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад

      Thanks for watching! Here is the link (also in the video description): www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset/version/25

  • @superfreiheit1
    @superfreiheit1 Год назад

    Why he speak so fast? Very hard to understad

  • @umeshpatil8707
    @umeshpatil8707 4 года назад

    slow your speed.

    • @subhash5122
      @subhash5122 4 года назад +4

      MAN USE PLAYBACK SPEED IN UR UTUBE.

  • @jess_ed
    @jess_ed 3 года назад +1

    How did you know that 0.08461538 was the death rate for men when you typed out the following code: mean(men$death_dummy, na.rm = TRUE) ? I did not know that death_dummy gave you the death rate. I thought death_dummy just dummy coded the death variable and that the following code: sum(data$death_dummy)/ nrow(data) gave you death rate.

    • @TechTribeCommunity
      @TechTribeCommunity  3 года назад +1

      Good question! Both of those are equivalent. Suppose we have death_dummy as [0 0 1 0 1] for 5 men. This means that 2 died, and 3 lived, for a death rate of 0.4 (40%). Your formula would calculate 2/5=0.4. Now ask yourself, what is the average death_dummy value? Well, it is 0.4. In other words, your method is just a slightly more verbose way of calculating the mean. Same thing - identical equations.

    • @jess_ed
      @jess_ed 3 года назад +1

      @@TechTribeCommunity Thank you!! :)

  • @urmom.com629
    @urmom.com629 2 года назад

    at 2:06 when I execute the function describe(data) the console says Error in describe(data) : could not find function "describe"

    • @aifazm3163
      @aifazm3163 2 года назад

      same , did u get it fixed ?

    • @gabrielmedina2480
      @gabrielmedina2480 8 месяцев назад

      I'm having the same error

    • @adamwirth4100
      @adamwirth4100 8 месяцев назад

      yeah me too, i think this video is just outdated@@gabrielmedina2480

  • @hugoromo7081
    @hugoromo7081 3 года назад

    Could you explain a little bit how to obtain death rate for men with this expression (?)> mean(men$death_dummy, na.rm = TRUE) #8.5%!