Tutorial 23-Univariate, Bivariate and Multivariate Analysis- Part2 (EDA)-Data Science

Krish Naik

Просмотров 147 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 14 дек 2024

Комментарии • 82

@himanshumangoli6708 3 года назад ⁺⁷
ur teaching skills are damn good man keep it up man lots of respect
@0Fallen0 2 года назад
Another easy way to do the bivaruate plot at 11:20 is sns.scatterplot(df['sepal_length'],df['sepal_width'],hue=df['species'])
@sengnawawnghkyeng9179 Год назад
The best explanation about these variates ...
@SwavimanKumar 4 года назад ⁺³⁰
One small correction. That Hue is pronounced "Hiu" instead of "Hui". You are making absolutely great content. Love them all. Keep growing. (Y)
@EcExplorer 2 года назад ⁺⁷
But I like how he pronounced 'HUII' :D
@nagurramesh1685 Год назад ⁺²
slave mindset ?
@sumantwankhede 3 года назад ⁺³
Just one tiny correction for Univariate x label should be Sepal Length ...all other good ..Thanks Krish
@krishnasahoo4598 2 года назад
thank you so much for this..I dont know why I was unable to understand this concept. Thanks for this
@pbanerjee4008 4 года назад ⁺¹
Great job. Your sincerity shows. Wonderful effort.
@wasimshaikh9147 3 года назад ⁺⁷
X lab should have been 'Sepal length' instead of 'Petal Length'
@adis6867 3 года назад ⁺²
I came in comment box to check same
@AadityaAgarwal-qo1km 6 месяцев назад
I love when krish calls Hue as Huiii
@nabiltech1366 3 года назад
So here are objective u can obtained by using this statistical method,
1)Which features have good impact for ur model
2)Which type of algorithms u should choos
@SaurabhSaurabh-uh6eq 3 года назад
Wow what a nice explaination! 👌 👋
@sunitapatil381 4 года назад
you are grate sir .i am really grateful to your vedios thank you thank you so much sir.
@piushsingh6066 4 года назад ⁺⁶
univariate, bivariate and multivariate analysis should be done before data prep-processing or after......Please Reply...
@sindhuseelam2045 3 года назад
after
@The_Hive_Mind8878 3 года назад
Really helpful. Thanks
@Uma7473 5 лет назад ⁺¹
Thank you
@venkateshbb2926 4 года назад
Thanks for tutorial.Please arrange tutorials in proper sequential of related tutorials.
@ashita1130 4 года назад ⁺¹
Pretty badass :) Thanks!
@gautamagarwal3394 3 года назад
@Quincy Sebastian please provide me an account :/
@kalyanipadaraju5715 4 года назад
Thank you so much sir . Great explanation
@ratheesh_tabla 3 года назад ⁺³
May be I am wrong, should that be "sepal length" instead of "petal length" in xlabel? based on your plot variables or feature used for univariate analysis
@pulkitarora6605 3 года назад ⁺¹
ya its sepal length may be there is some mistake
@adarshpatodi5300 4 года назад
You need to have x label as sepal length in univariate analysis.
@krishnakanthbandaru9308 5 лет назад ⁺³
Hi I have a doubt these plots are ok for small datasets and interesting while learning but is these graphs helps when handling real time data or while working with real data science projects.
@birinaboro8391 4 года назад ⁺²
Hello Sir, could you please help me out with multivariate correlation through SPSS??
@rohithmn3378 5 лет назад ⁺¹
Thanks for the excellent tutorial..!
But this works well for classification problems. How shall we perform the similar analysis for Regression problem..!?
@makanjudavid992 Год назад
Question: it is possible to use categorical features to make predictions for a numerical targer variable ??
@ashukol Год назад
Line 17th code needs modification as follows:
sns.FacetGrid(df,hue="species").map(plt.scatter,"petal_length","sepal_width").add_legend();
plt.show()
@manishshukla125 5 лет назад
Thanks Sir!
@tanujsharma5492 2 года назад ⁺¹
sir i think there is 'sepal length' instead of 'petal length' in xlabel. am i wrong or right??
@sohamsarkar5255 3 года назад
Isn't multivariant analysis a consolidated representation of bivariant analysis, where all possible combinations of bivariant analysis are represented together?
@MageDigest 3 года назад
so from multivariate if we some graphs with overlapping variables like sepal length and sepal width, we can ignore one of them while doing any further analysis ? Please help here
@kamran_desu 4 года назад
Interesting method to plot univariate, I generally create scatterplots to make similar deductions in terms of what kind of classifier will make sense.
Here's some sample code:
import matplotlib.pyplot as plt
from sklearn import datasets
iris = datasets.load_iris()
X = iris.data
y = iris.target
F = iris.feature_names
fig, ax = plt.subplots(1, len(F), figsize=(15,2))
for i,f in enumerate(F):
ax[i].scatter(X[:,i],y, c=y)
ax[i].set(xlabel=f)
ax[i].get_yaxis().set_visible(False)
@mitrabhanuroutkali 5 лет назад ⁺¹
Use DataExplorer package in r
@alokranjanthakur5746 4 года назад
Sir can you make. Video on EDA only using python. Means what are necessary steps in EDA
@gokulansundaraj8149 2 года назад
Wow...
@d39-nischithhegde65 9 месяцев назад
can you also include link to dataset used
@nijalparmar5951 4 года назад
Sir can u plz make one video with use of spss and univariate, bivariates and multivariate analysis
@aination7302 4 года назад
Just use the graph node and plot your histograms and scatter plots for all the variables you require.
@erumalibhatti1218 2 года назад
Hi krosh what will be the codes for R for same analysis??
@marioluoni3899 4 года назад
In the uni-variate analysis, why do you put all data points on the same level? By putting them onto different levels, e.g. by setting np.zeros_like()+0, np.zeros_like()+1 and np.zeros_like()+2, it will be very clear that these 3 data sets overlap very heavily as opposed to what you say @9:00 (unless I have misunderstood what you said there). Otherwise great lectures, thanks a lot!
@Moiz_tennis 2 года назад
great suggestion!
@ramthiagu2330 3 года назад
if we have more than 10 or 20 features, how can we do multivariate analysis. will it be visible clearly in pairplot
@yugen3968 3 года назад
Why not just plot histograms for every feature for univariate analyis?
@Pankaj.6658 11 месяцев назад
sir, what is web address you are using and is it free or paid please give some details about that also.
@hepengye4239 4 года назад
Are those 4 plots along with the diagonal density plots?
@niraliborad7475 Год назад
After executing the same code for univariate analysis my output is not color distributed as shown in video. can anyone help
@pritamchowdhury3152 3 года назад
sir can you provide some practice dataset
@siddharthjain4361 2 года назад
what if we have dimension in order of 100s...??
@souravsaha7751 2 года назад
sir a virginica or versicolor kaya ha
@toppoashish7 4 года назад
How to do eda when we have many features, say 20+ and all are non correlated.
@simanchalpatnaik2566 4 года назад ⁺³
Hi Krish, Why you are keeping the Y-axis as 0. In the previous lecture also it's not explained. In graph you just kept it as 0.
Please reply.
@HimanshuYadav-re7cp 4 года назад ⁺¹
hey , he's just trying to visualize the dependency of output feature on that particular feature i.e. "petal_width" .so there is no need for y axis if u want u can put x =0 , and plot it on y axis and we endup with a vertical stack :)
@hyhyb 4 года назад
How orange , green colours came into picture, coz we didn't mention any color parameters like palette, colour?
@urvishmahajan 4 года назад
Colors are automatically assigned if you don't mention them in the parameters
@rahuldas6777 2 года назад
why put semicolons after your lines of code?
@ankita684 4 года назад
Hi Krish...when I am executing this code 'plt.plot(df_setosa['Sepal.Length'],np.zeros(df_setosa['Sepal.Length']),'o') it is returning a value error that reads as 'sequence too large; cannot be greater than 32'. How did you execute without getting this error. How to resolve?
@toyazpandey8669 4 года назад
U haven't written like after np. Zeros_like
@vatsalshingala3225 Год назад
❤❤❤❤❤❤❤❤❤❤
@mlwithstats1703 3 года назад
Sir how we can the data ???
@zainabzafari2336 11 месяцев назад
Thank you very much for your great videos.
However, this is the first video of your playlist that I could not understand. The dataset was not clear and you did not explained much.
@tejassutar4198 4 года назад
Hello sir how to know categories of given data in python? For eg. Here We want to know species categories?
@viveksingh881 4 года назад
if u r talking about getting the unique values in species then following code will help:-
for unique numbers of species - iris_data['Species'].nunique()
for names of those unique species - iris_data['Species'].unique()
@Gamer_hai_hum 3 года назад
Hello sir huge fan following ur ML playlist and I'm getting error in stringIO sir I also saw youtube video but I'm not able to slove the error it say No module something can u please guide me I'm stuck in your 7th playlist pls let me know sir it will be helpful
@anandacharya9919 5 лет назад
When I import iris in python , no commands is working I am getting error as "AttributeError: info" , and also "AttributeError: describe" , please solve this, why I am getting this error
@SATISHKUMAR-bj2kl 3 года назад
sir evertime whenever i am running code then also error messege comes with "name df is not defined" can you please help me
@Ajayraj-dx6fb 2 года назад
try to load the data once again
@ashishkumarsingh2910 4 года назад
how you are calling a url or internet file to read in pandas..... its like impossible for me to do... plztellme how?
@adarshtiwari6742 4 года назад ⁺¹
Switch on internet would make it work
@adarshtiwari6742 4 года назад
Sir how much is necessary to know to get job in data science (is there any bounds)
@ClickyKitsune 4 года назад
My personal recommendation would be to start with python , basics of SQL and couple of ML algorithms i.e regression. It all comes to how many projects you have actually created..good luck 👍
@dharmatejaadepu8597 4 года назад ⁺¹
In univariate analysis, you have taken sepal length and labelled it as petal length , can you explain me about that.
@vinodmorya5413 4 года назад
its by mistake
@vishalrai2859 3 года назад
coaching institutes just looted me
taught nothing like this
@re-cordinglyf7176 2 года назад
I can't believe you pronounced it as hueee....😂😂
@shaminmohammed672 4 года назад
Thank you

Следующие

Автовоспроизведение

Tutorial 24- Histogram in EDA- Data Science