Ideally the more the better, but time is finite and there are other important skills. It depends on the sort of data scientist you want to be. Good luck!
It is hard to find a dataset of the appropriate values to showcase linear regression. I have looked through a bunch of the toy dataset supplied with R and some of the addons (Modeldata, CarData etc). You need data big enough to split. Any suggestions other than airquality?
Most packages come with good data sets. {ggplot2} and {modeldata} are good starting points. You can see all available data sets in loaded packages with `data()`. Good luck!
@@EquitableEquations I am aware of all the usual places. I was looking more for big datasets thar are off the beaten track. I have combed through Kaggle at no awail. I rarely find data in may own domain in the earth sciences (geochemical and/or isotopic data).
Hospital data of patients died over time is an example of linear regression model where simply a kaplan plot is sufficient. It is not a good idea to use linear regression modelling. non-linear regression models are good for simulation studies.
Please advise since am getting the following issue, I have already installed the tidymodels package "library(tidymodels) Error: package or namespace load failed for ‘tidymodels’ in loadNamespace(i, c(lib.loc, .libPaths()), versionCheck = vI[[i]]): namespace ‘rlang’ 1.1.1 is already loaded, but >= 1.1.3 is required In addition: Warning message: package ‘tidymodels’ was built under R version 4.3.3 "
You can find materials supporting this vid (and others) at github.com/equitable-equations/youtube.
Initial split in content: linear regression building and evaluation AND training and validation sample sets 👏🏼👏🏼👏🏼
Awesome ! clearly explained in a simplified way. Thanks, sir
Very well explained, excellent teacher👌
You are doing a fantastic job in this video! I refer my students to many of your videos Sir! Cheers from Sweden!
Cheers from Chicago!!
Thank you i am vey aspired to be data scientist i took datacamp course what you recommend how much i should deep dive into statistic
Ideally the more the better, but time is finite and there are other important skills. It depends on the sort of data scientist you want to be. Good luck!
can you tell me what that bootcamp is
It is hard to find a dataset of the appropriate values to showcase linear regression. I have looked through a bunch of the toy dataset supplied with R and some of the addons (Modeldata, CarData etc). You need data big enough to split. Any suggestions other than airquality?
Most packages come with good data sets. {ggplot2} and {modeldata} are good starting points. You can see all available data sets in loaded packages with `data()`. Good luck!
@@EquitableEquations I am aware of all the usual places. I was looking more for big datasets thar are off the beaten track. I have combed through Kaggle at no awail. I rarely find data in may own domain in the earth sciences (geochemical and/or isotopic data).
Hospital data of patients died over time is an example of linear regression model where simply a kaplan plot is sufficient. It is not a good idea to use linear regression modelling. non-linear regression models are good for simulation studies.
I'm getting error in library(tidymodels)=not found and also in "initial_split" not found
Please advise since am getting the following issue, I have already installed the tidymodels package "library(tidymodels)
Error: package or namespace load failed for ‘tidymodels’ in loadNamespace(i, c(lib.loc, .libPaths()), versionCheck = vI[[i]]):
namespace ‘rlang’ 1.1.1 is already loaded, but >= 1.1.3 is required
In addition: Warning message:
package ‘tidymodels’ was built under R version 4.3.3 "
How do you download air-quality files?
Please provide the csv/xls file or link
@@yourivyleaguegirl it is actually one of the inbuilt data in R. There is No need for an external file.
Not-so-simple linear regression with R