the outlier 73
the outlier 73
  • Видео 483
  • Просмотров 444 716
Step by Step guide to Principal Component analysis (PCA) in SPSS
Welcome to our comprehensive guide on Principal Component Analysis (PCA) using SPSS. In this tutorial, we'll walk you through the entire process, from loading your wholesale price index data to interpreting the final results. You'll learn how to perform PCA, create and analyze scree plots, and build path diagrams. Whether you're a beginner or looking to refine your skills, this step-by-step guide will help you understand and apply PCA effectively. Don't forget to like, share, and subscribe for more in-depth statistical tutorials!
Step-by-Step Guide to Principal Component Analysis (PCA) in SPSS
1. Introduction to PCA
- Overview of PCA and its importance in data analysis.
2. Loading Data into...
Просмотров: 6

Видео

1992 Presidential Election Data Analysis: Chi-Square & CHAID Decision Trees
Просмотров 724 часа назад
Welcome to our deep dive into the fascinating world of election data analysis! 📊 In this video, we explore the 1992 Presidential Election dataset using advanced statistical techniques and decision tree analysis. Join us as we: 1. Perform an Exploratory Data Analysis (EDA) to uncover hidden patterns and trends. 2. Use the Chi-Square test to analyze the relationship between categorical variables....
Outliers- One dimensions vs Two Dimensional
Просмотров 4414 дней назад
outlier analysis
SPSS Graphs for Beginners: Pie Chart, Bar chart and Histogram with Normal Curve
Просмотров 49Месяц назад
Welcome to our comprehensive guide for beginners on creating graphs in SPSS! In this video, we'll walk you through the steps to create three essential types of graphs: pie charts, bar charts, and histograms with a normal curve overlay. Whether you're a student, researcher, or data enthusiast, this tutorial will help you visualize your data effectively and understand the basics of SPSS graphing....
Reliability Analysis: Comparing Cronbach's Alpha and Guttman's Split-Half Coefficient
Просмотров 94Месяц назад
Reliability Analysis: Comparing Cronbach's Alpha and Guttman's Split-Half Coefficient In this video, we delve into the world of reliability analysis, focusing on two key coefficients: Cronbach's Alpha and Guttman's Split-Half Coefficient. We'll explain what each coefficient measures, how they are calculated, and their significance in ensuring the reliability of tests and questionnaires. Additio...
SPSS Graphs Made Easy: Pie Charts, Clustered Boxplots, Scatterplots, SPLOM, and Heatmaps
Просмотров 87Месяц назад
Welcome to "SPSS Graphs Made Easy: Pie Charts, Clustered Boxplots, Scatterplots, SPLOM, and Heatmaps"! This beginner-friendly tutorial is designed to help you master the art of creating and interpreting various types of graphs in SPSS. In this video, we will cover: - How to create pie charts to visualize proportions - Building clustered boxplots for comparing distributions across categories - U...
SPSS Graphs Made Easy: Bar Charts, Stacked Bar Charts, Histograms, Dot Plots, Boxplot(For Beginners)
Просмотров 52Месяц назад
Welcome to "SPSS Graphs Made Easy: Bar Charts, Stacked Bar Charts, Histograms, Dot Plots, Boxplots"! This beginner-friendly tutorial will guide you through creating and interpreting various types of graphs in SPSS. Learn how to: - Create clear and informative bar charts and stacked bar charts - Generate detailed panel histograms - Visualize data using dot plots - Understand and use boxplots for...
Predicting Loan Defaults with QUEST: A Step-by-Step Guide
Просмотров 56Месяц назад
Welcome to our in-depth tutorial on using the QUEST decision tree algorithm in SPSS to predict loan defaults! In this video, we'll cover everything you need to know, from the advantages and disadvantages of the QUEST algorithm to a step-by-step demonstration of its implementation in SPSS. 📚 What You'll Learn: 1. Introduction to QUEST Algorithm: An overview of the QUEST (Quick, Unbiased, Efficie...
Karnataka Disability Data with SPSS: Hierarchical cluster analysis, Dendrogram, Tree Custom Tables
Просмотров 45Месяц назад
Welcome to our detailed analysis of Karnataka's Disability Data using SPSS! In this video, we will guide you through the process of analyzing disability data for Karnataka, with a focus on Hierarchical Cluster Analysis (HCA) and generating Custom Tables. What You'll Learn: 🔍 Hierarchical Cluster Analysis (HCA): - Understand the fundamentals of HCA. - Learn how to create and interpret dendrogram...
Maternal Mortality Rate analysis
Просмотров 59Месяц назад
Welcome to our in-depth exploration of the Maternal Mortality Ratio (MMR) across Karnataka's 30 districts and 240 taluks! The MMR refers to the number of maternal deaths per 100,000 live births, a crucial indicator of maternal health. In this video, we delve into the most recent and specific MMR data for each district in Karnataka, drawing from a variety of authoritative sources. Whether you're...
Robust Statistical Modeling with M-Estimators in SPSS (For Advanced SPSS users)
Просмотров 54Месяц назад
Robust Statistical Modeling with M-Estimators in SPSS Welcome to our in-depth tutorial on Robust Statistical Modeling with M-Estimators in SPSS! In this video, we explore the powerful world of M-estimators and demonstrate how they can enhance the robustness of your statistical models in SPSS. Whether you're dealing with outliers, skewed data, or other irregularities, M-estimators provide a resi...
Missing Value Analysis using MCMC (Markov Chain Monte Carlo) Simulation and Bayesian Inference.
Просмотров 80Месяц назад
Learn how to use MCMC (Markov Chain Monte Carlo) to effectively handle missing data in your datasets. This method allows you to simulate multiple possible values based on observed patterns, ensuring robust imputation even for complex data. We'll break down the process into five straightforward steps: defining your model, setting initial values, iteratively sampling data, evaluating convergence,...
Bank Loan Default Prediction: CART Model in SPSS with 5-Fold Cross Validation
Просмотров 55Месяц назад
Unlock the power of predictive analytics with our step-by-step guide to predicting bank loan defaults using the CART model in SPSS. In this video, we'll walk you through the process of setting up and running a 5-Fold Cross Validation, ensuring your model is robust and reliable. Whether you're a data science enthusiast or a finance professional, this tutorial will equip you with the skills to en...
Day 6: Time Series Analysis
Просмотров 31Месяц назад
Day 6: Time Series Analysis
Day 5: CHAID vs CART vs QUEST, Two Step Cluster Analysis & Missing value imputation using MCMC
Просмотров 46Месяц назад
Day 5: CHAID vs CART vs QUEST, Two Step Cluster Analysis & Missing value imputation using MCMC
PCA, K-Means Clustering, and Neural Networks in SPSS
Просмотров 149Месяц назад
PCA, K-Means Clustering, and Neural Networks in SPSS
Day 3: Understanding Logistic Regression: A Beginner's Guide
Просмотров 29Месяц назад
Day 3: Understanding Logistic Regression: A Beginner's Guide
Day 2: FDP on SPSS | T Test, Linear Regression
Просмотров 48Месяц назад
Day 2: FDP on SPSS | T Test, Linear Regression
Day1: Enhance Your Research Skills: FDP on SPSS for Data Analysis
Просмотров 54Месяц назад
Day1: Enhance Your Research Skills: FDP on SPSS for Data Analysis
Big or Small, Protect Them All: Help Us Understand Breast Cancer
Просмотров 22Месяц назад
Big or Small, Protect Them All: Help Us Understand Breast Cancer
How to Create a Population Pyramid Chart in SPSS | Step-by-Step Tutorial
Просмотров 118Месяц назад
How to Create a Population Pyramid Chart in SPSS | Step-by-Step Tutorial
Ch1: How to import a Excel File in SPSS for Beginners
Просмотров 38Месяц назад
Ch1: How to import a Excel File in SPSS for Beginners
SPSS Transform Menu Compute variable| SPSS for Beginners: Creating New Variables from Existing Data
Просмотров 56Месяц назад
SPSS Transform Menu Compute variable| SPSS for Beginners: Creating New Variables from Existing Data
Forward Selection Explained: Linear Regression with Boston Housing Dataset
Просмотров 69Месяц назад
Forward Selection Explained: Linear Regression with Boston Housing Dataset
Your Gateway to Machine Learning: A Tour of the UCI ML Repository
Просмотров 1043 месяца назад
Your Gateway to Machine Learning: A Tour of the UCI ML Repository
SPSS Decision Tree: Classification and Decision Tree
Просмотров 3453 месяца назад
SPSS Decision Tree: Classification and Decision Tree
Launch of Post Graduation Diploma in Business Analytics (PGDBA)
Просмотров 1989 месяцев назад
Launch of Post Graduation Diploma in Business Analytics (PGDBA)
The Ultimate Showdown: ARIMA, SARIMA & SARIMAX Which Will Take the Crown for Time Series Forecasting
Просмотров 782Год назад
The Ultimate Showdown: ARIMA, SARIMA & SARIMAX Which Will Take the Crown for Time Series Forecasting
Cracking the Code: Demystifying Stationarity in Time Series Analysis!
Просмотров 64Год назад
Cracking the Code: Demystifying Stationarity in Time Series Analysis!
The BERT Model: Unlocking the Secrets of NLP's Most Exciting Advancement Yet!
Просмотров 43Год назад
The BERT Model: Unlocking the Secrets of NLP's Most Exciting Advancement Yet!

Комментарии

  • @thou_yangba
    @thou_yangba 11 часов назад

    Very helpful sir 🔥

  • @ankitashrivastava5245
    @ankitashrivastava5245 2 дня назад

    👍

  • @gwonchanjasonyoon8087
    @gwonchanjasonyoon8087 2 дня назад

    Did Perot pull votes from Clinton or Bush?

  • @sandhyaranisahoo933
    @sandhyaranisahoo933 4 дня назад

    Nice video Sir...very very useful 👍

  • @priyagupta558
    @priyagupta558 9 дней назад

    Sir, can we use this factor as a variable for regression. Also, do we need to consider them as negative values or absolute values?

    • @theoutlier7395
      @theoutlier7395 9 дней назад

      you can use the factor scores as variables

    • @theoutlier7395
      @theoutlier7395 9 дней назад

      Also, When interpreting factor scores from factor analysis, it is important to use the scores as they are, including both negative and positive values. These scores indicate the relative positioning of observations along the factor dimensions. Converting them to absolute values would distort the interpretation and relationships identified by the factor analysis. Negative values are meaningful and represent observations that are below the mean of the factor, while positive values are above the mean.

  • @intellectMind2024
    @intellectMind2024 16 дней назад

    Nice explanation 😊

  • @chandusatish9697
    @chandusatish9697 16 дней назад

    Great👍👏 explanation sir

  • @sandhyaranisahoo933
    @sandhyaranisahoo933 18 дней назад

    Very important and helpful video..

  • @EstherDavid-x5i
    @EstherDavid-x5i 19 дней назад

    Educative

    • @theoutlier7395
      @theoutlier7395 19 дней назад

      Thanks for your kind words, Appreciate it

  • @PO-bk1wv
    @PO-bk1wv 20 дней назад

    At 19:40 when u expain support @18% of the total samples, you say that out of 1000 customers 18% of them purchased Freshmeat, CannedVeg, SoftDrink and Dairy. Is it true that they purchased CannedVeg despite it being a False in the row? Thanks.

    • @theoutlier7395
      @theoutlier7395 20 дней назад

      good observation .people have not purchased cannedved

  • @PO-bk1wv
    @PO-bk1wv 22 дня назад

    Do we need to eliminate seasonality as we prepare data for Market Basket Analysis. For e.g., people may buy bread and butter in summer, versus the same people may buy bread and jam in winter. How would this affect the analysis? Or should I do a separate analysis for summer vs winter if I expect a strong seasonality signal? Thanks for ur views.

  • @user-wo2se3rf1w
    @user-wo2se3rf1w 25 дней назад

    Nnknooon8nnnijknykknooom

  • @user-wo2se3rf1w
    @user-wo2se3rf1w 25 дней назад

    Mh

  • @890lli
    @890lli 25 дней назад

    Thanks!

  • @moyofoluwaogunyemi2315
    @moyofoluwaogunyemi2315 28 дней назад

    Thank you for this. My result is not displaying the KMO and bartlett result. Could there be any reason for this?

    • @theoutlier7395
      @theoutlier7395 27 дней назад

      if you are using SPSS, pleasse go to analyze menu, dimension reduction, factor, this open up factor analysis dialog box. one you are inside factor analysis dialog box at the right hand side you have descriptive statistics tab. please click on descriptive statistics tab. this will open up a new dialog box . the last option here is KMO and bartketts test of shericity. please make sure you select this click ok. you should get the kmo result

    • @theoutlier7395
      @theoutlier7395 27 дней назад

      if you are having issues please let me know

  • @greatmind3842
    @greatmind3842 29 дней назад

    Excellent! Very clear and audible! Took your time to explain! Thank you so much!

  • @user-gr6gh2cl3p
    @user-gr6gh2cl3p Месяц назад

    i did survey, and had 4 study groups. in that case, is it possible to run PCA on SPSS

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      Yes you can go ahead with PCA

    • @user-gr6gh2cl3p
      @user-gr6gh2cl3p Месяц назад

      @@theoutlier7395 Could you help me with running PCA in SPSS? I'm analyzing a dataset with 4 study groups and 400 samples to find where risk factors cluster the most. Your guidance would be valuable!

  • @awoths5320
    @awoths5320 Месяц назад

    The title is about homoschedasticity while the pressentation is about hetro😊

  • @jeronimocliff2972
    @jeronimocliff2972 Месяц назад

    Hello my friend. I checked and made the Factor Analysis just like you, but within Excel there is alway just one Sheet. Can you help me so that I can see Heat Map / Eigen Value? I checked the descriptive statistic option in PCA but could not find it directly

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      Hi, i have created the factor loading and the other output by running analysis in SPSS. it is not part of the raw data provided

    • @jeronimocliff2972
      @jeronimocliff2972 Месяц назад

      @@theoutlier7395 thank you very much for the clarification

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      All the best..

  • @sandhyaranisahoo933
    @sandhyaranisahoo933 Месяц назад

    Nice video ...

  • @DrBenVincent
    @DrBenVincent Месяц назад

    Very clear

  • @jonrrobinson
    @jonrrobinson Месяц назад

    Thank you for this video - very informative! I’ve only recently found Orange - it looks very interesting. Can you recommend anything for how to report these results as part of a research paper?

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      When it comes to research paper publication SPSS is preferred over orange. Orange output export capability is weak and sometimes a struggle.

  • @sandhyaranisahoo933
    @sandhyaranisahoo933 Месяц назад

    Wonderful video.

  • @supriyachaudhary5112
    @supriyachaudhary5112 Месяц назад

    Ma'am could you please explain how to do absolute principle component score - multiple linear regression (APCS MLR) modelling in SPSS.

  • @parinayamjal5768
    @parinayamjal5768 Месяц назад

    Wonderful explanation, very clear and precise. Thankyou.

  • @Winstonsmith1984ingsoc
    @Winstonsmith1984ingsoc Месяц назад

    great channel

  • @sureshmondal8702
    @sureshmondal8702 Месяц назад

    What is the variable name of rc1, rc2, rc3 means mm

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      rotated component one, rotated component two

    • @sureshmondal8702
      @sureshmondal8702 Месяц назад

      @@theoutlier7395 then Sir I have to remove the first three variable from my analysis and have to add this rc variable....is it? But in my paper what should I write about my factor/variable name, because this is different name variable from others variable name.

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      @@sureshmondal8702 you dont need to remove the original variables. Please look at the loading table to name the factors

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      You have to look at the highest loading for each component

  • @nikpapou
    @nikpapou Месяц назад

    Thanks for helping us finding the sample files!

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      All the best, Appreciate your comments...

  • @k.mohimasingha2891
    @k.mohimasingha2891 Месяц назад

    Sir agar hum bar chart ya box plot ko use na kare. Toh sirf t test mein jo mean difference nikal k aya hai wo yeh dikha rha hai ki male ko jada wage mila hai ya female ko yeh kese pata kare? Sir yeh samjha dijiye na.

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      Thanks for the question, Please look at the first table namely group statistics table under t test section. You can clearly see that average female salary is 26031 and for males it is 41441. The difference between these two numbers is 15409 which is the mean difference

    • @theoutlier7395
      @theoutlier7395 Месяц назад

      Since male employees are getting 41441 which is greater than female employees 26031. Male employees average salary is higher. Hopw that answers your question

  • @Hutch-Ee
    @Hutch-Ee Месяц назад

    I liked the way you explained the content, systematically and the summary at the end. Great ! Thanks for the explanation.

  • @sandhyaranisahoo933
    @sandhyaranisahoo933 Месяц назад

    Excellent clarity...

  • @JustAnAverageGirlShreya
    @JustAnAverageGirlShreya Месяц назад

    super clear, this was so helpful! thank you!

  • @intellectMind2024
    @intellectMind2024 Месяц назад

    Great 💯

  • @sogunagan8191
    @sogunagan8191 2 месяца назад

    can you drop me your data and file? thank you man

  • @thecopy9975
    @thecopy9975 2 месяца назад

    This channel is awesome.. thanks for the great video..

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      Thank you for your support, All the best...

  • @kashifzahid3872
    @kashifzahid3872 2 месяца назад

    Superb vedio

  • @TeeFat
    @TeeFat 2 месяца назад

    Thank you. This is very helpful. Thank you for sharing the data.

  • @makav3li665
    @makav3li665 2 месяца назад

    Excellent tutorial. Please, how do/did you create ur training and test data??

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      ruclips.net/video/VatSJGnUPzQ/видео.html

    • @makav3li665
      @makav3li665 2 месяца назад

      @@theoutlier7395 really appreciate 🙏

  • @pangpuielhtliakaelhtliaka9534
    @pangpuielhtliakaelhtliaka9534 2 месяца назад

    Can PCA tells us which original variables to retain for certain statistical analysis?

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      If the loading is greater than .7 retain it

  • @PriyaKumari-gx9gt
    @PriyaKumari-gx9gt 2 месяца назад

    how to determine sum of squares (eigenvalues) and percentage of error for given factor scores?

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      spss displays eigen values by default

  • @eylmaz6696
    @eylmaz6696 2 месяца назад

    hi sir, where should i look for commenting about clustering ? silhette skor or collecting on the center ?

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      Slightly hard to do this in Orange

    • @eylmaz6696
      @eylmaz6696 2 месяца назад

      @@theoutlier7395 can i do it just looking on the label with silhouette skor ? for instance if silhouette skor is huge, then it is more trustable ?

  • @eylmaz6696
    @eylmaz6696 2 месяца назад

    sir i have some questions to you. ı have a survey executed. options are similar to this : 1. ı know / ı dont know 2. ı speak / ı dont speak so, PDF AND CDF needs to be calculated. how can I do it ? and i need to median , mod of the words as well how can i do

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      Not sure what is the need to calculate PDf and CDF, SPSS compte variable has what you need

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      Median: use SPSS Explore

    • @theoutlier7395
      @theoutlier7395 2 месяца назад

      Mode is not easy in SPSS

    • @eylmaz6696
      @eylmaz6696 2 месяца назад

      @@theoutlier7395 Standart deviation , mean value can be obtained via Distribiuton widget on Orange datatable ? What i obtained in the distribition widget ,s about Normal Distribution ? Shortly can i get results of normal distribiuton in orange ?

    • @eylmaz6696
      @eylmaz6696 2 месяца назад

      @@theoutlier7395 thanks sir. how can i make corelation in orange. ? i dont have numeric values

  • @anshul-katyal
    @anshul-katyal 3 месяца назад

    When we have dependent variable so Do we have to select dependent variable also for PCA? And we get factor after this what can we do for the interpretation?

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      use only independent variables for factor analysis

  • @eylmaz6696
    @eylmaz6696 3 месяца назад

    how can i get output as a txt or xls format

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      after getting the output of market basket analysis in the left hand side bottom side next to question mark you have a option to get the output in text format

    • @eylmaz6696
      @eylmaz6696 3 месяца назад

      @@theoutlier7395 can i do it for all association rules ?

    • @eylmaz6696
      @eylmaz6696 3 месяца назад

      ​@@theoutlier7395 ı want to use the datas in my thesis

    • @eylmaz6696
      @eylmaz6696 2 месяца назад

      @@theoutlier7395 i could not do it can yoy help me ? can i get output by creating automatically association rııles ?

    • @eylmaz6696
      @eylmaz6696 2 месяца назад

      @@theoutlier7395 sir i have some questions to you. ı have a survey executed. options are similar to this : 1. ı know / ı dont know 2. ı speak / ı dont speak so, PDF AND CDF needs to be calculated. how can I do it ? and i need to median , mod of the words as well how can i do

  • @nimmyprakash3193
    @nimmyprakash3193 3 месяца назад

    I appreciate the video, thank you so much. It is very informative and useful. Can you please tell me what value is observed in each internal node in the tree? Also, I have run a split sample validation procedure where I obtained the risk estimate for training as 150.14 and for test as 148.00. What can we infer from that? Thank you in advance.

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      Assuming that the dependent variable is a scale variable in each of the node you are getting the mean valye of the dependent variable satisfying the rule.

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      On a average the error between actual and predicted in training data is 50.14 and for test as 148.00 respectively. If this value is closer to zero it is better

    • @nimmyprakash3193
      @nimmyprakash3193 3 месяца назад

      @@theoutlier7395 Thank you for the reply. So if the risk estimate is high does that mean the model is wrong?

    • @nimmyprakash3193
      @nimmyprakash3193 3 месяца назад

      @@theoutlier7395 Hi, My dependent variable is a scale variable and it ranges from 0 to 100. Is it okay if get a risk estimate value above 100? Thanks in advance.

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      @@nimmyprakash3193 Try to redice the risk estimate to a lower score

  • @fealgu100
    @fealgu100 3 месяца назад

    Very nice!

  • @maxdidieratta9093
    @maxdidieratta9093 3 месяца назад

    it mean a lot to me. I try to do CART but my tree haven't leafs . help me please

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      hi Please check if you have sufficient sample size

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      Looks like your data has low sample size

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      Once you are In SPSS decision tree, right hand side there is an oprion called "criteria" button click on it. then reduce the parent node sample to a lesser number . Also reduce the child node to lesser number. Re Run the analysis. IT should work

    • @theoutlier7395
      @theoutlier7395 3 месяца назад

      in case you still have issues we can have a zoom call, Please feel free to ping me

    • @maxdidieratta9093
      @maxdidieratta9093 3 месяца назад

      @@theoutlier7395 Hello you were right, I reduced the number of knots and it worked. You can't imagine how happy I am this morning. Thank you so much. God bless you