How to Perform Factor Analysis in Python(Step by Step)

Поделиться
HTML-код
  • Опубликовано: 11 сен 2024

Комментарии • 53

  • @loogsnyakwardyoh4854
    @loogsnyakwardyoh4854 4 года назад +4

    the correct column name is "Unnamed: 0". df.drop(['Unnamed: 0'], axis = 1, inplace = True) works...

  • @inusun
    @inusun 5 лет назад +4

    Great video, helped out a lot it processing my data using factor analysis for my data science class.

  • @user-gx9hk8gt3k
    @user-gx9hk8gt3k 2 года назад +1

    df = drop(['unnamed:0'], axis=1, inplace = True) This one works.

  • @mariamedina5970
    @mariamedina5970 2 года назад +1

    This is by far the best video on the internet to explain factor analysis and all the steps necessary! Great work!

  • @austin_adetunji
    @austin_adetunji 2 года назад

    Wow, this is great. i wish i can like this video multiple times

  • @kennedybwire9435
    @kennedybwire9435 2 года назад

    Hi, that is great work.

  • @quiosaevaristo7746
    @quiosaevaristo7746 Год назад

    Hi, that is an amazing video.

  • @tallyxh6477
    @tallyxh6477 3 года назад +2

    This video litterally saved me and my Master's Dissertation. Great explanations, easy to understand and follow. thank you very much!!

    • @samuelceciliochucuyamamani561
      @samuelceciliochucuyamamani561 3 года назад

      Hello! previously I was using Minitab and obtaining the factor scoring coefficients and then graphing.
      I don't know how to get that with python, I hope you can help me please.
      Saludos

  • @spef7396
    @spef7396 2 года назад

    Thank you very much, final year computer science but very little experience in statistics etc. this was a massive help

  • @diepnguyen8928
    @diepnguyen8928 3 года назад +1

    Could you explain for me why I have different results while implementing Factor Analysis in Python and STATA? There are differences in both scree plot and the value of loading factors :(( Don't know which is the correct one now :( Thank you for your support!

  • @luisady8990
    @luisady8990 4 года назад +1

    If you have trouble w/factor analyzer:
    1. Open cmd and paste: pip install factor_analyzer
    2. Once it is successfully installed, paste the command again and get the file path it is downloaded in (first line after the command).
    3. Follow the rest of the steps in the video.

  • @uniongrob8194
    @uniongrob8194 3 года назад +1

    Helpfull video, thanks!

  • @javiercmh
    @javiercmh 4 года назад +1

    very useful as it is!!! thanks a ton!

  • @nnn-bb5tf
    @nnn-bb5tf 3 года назад +1

    Thanks for this excellent video

  • @veronicam2102
    @veronicam2102 3 года назад

    Thanks!! this video was very helpful to me :D

  • @phanthilanhuong1783
    @phanthilanhuong1783 3 года назад

    Thank you for your sharing and fantastic guiding

  • @brettgregory3622
    @brettgregory3622 4 года назад

    Any ideas on how one could go about getting goodness of fit indices? All my FAs are done in R purely because of these fit indices but I really want to completely move over to Python.

  • @williamkanaan7554
    @williamkanaan7554 4 года назад

    When I type fa.fit(df), it tells me there is an error "ValueError: Found array with 0 sample(s) (shape=(0, 38)) while a minimum of 1 is required by FactorAnalyzer." but when I check my document there is not ligne with 0 values, what can I do please?

  • @elhamahmadi8673
    @elhamahmadi8673 2 года назад

    Hi, I have one question. if we want to use FA or PCA for feature selection of all types of data(without dropping string type of data) how can we do that? Is there any alternative to apply?

  • @josephkarianjahi1467
    @josephkarianjahi1467 4 года назад +1

    Good work!

  • @xiangyu9445
    @xiangyu9445 2 года назад +1

    where can I find your Jupyter notebook?

    • @KindsonTheTechPro
      @KindsonTheTechPro  2 года назад

      Would you like to join and 6 Weeks Intensive Data Science Course that begins this week? Let me know on mail@kindsonthegenius.com.

  • @rajeshghosh4344
    @rajeshghosh4344 3 года назад

    Even after appending the site package directory the factor_analyzer is not importing. Can you help?

  • @henny628
    @henny628 4 года назад

    Hi, I am trying to analyse data with multiple factors over a series of time. Is there a way I can consult you off youtube?

  • @tarunsiga3610
    @tarunsiga3610 3 года назад +1

    Could you explain me on what basis the factors were selected in FactorAnalyzer

    • @KindsonTheTechPro
      @KindsonTheTechPro  3 года назад

      Nice question but the explanation is not trivial. We consider the factor loading of each variable which tells us the entent to which the variable relates with the factor. We then select factors with highest loadings. I'll recomment you read up Principal Components Analysis(PCA) as it helps you understand FA.
      www.kindsonthegenius.com/pca-tutorial-1-how-to-perform-principal-components-analysis-pca/
      www.kindsonthegenius.com/principal-components-analysispca-in-python-step-by-step/
      www.kindsonthegenius.com/basics-of-factor-analysis-for-data-scientists/
      Question, What is Factor Analysis ruclips.net/video/s2ffkELXsHc/видео.html

  • @stepheneaster3166
    @stepheneaster3166 4 года назад

    Hi, just finding this great video on FA, thanks for posting it. When I use the varimax orthogonal rotation and then use transform() to 'score' the dataframe with the factor model, the resulting factors do not have 0 correlation as expected from an orthogonal transformation. I'd like to take those factors as inputs into a clustering exercise but want non correlated factors (similar to PCA). Thanks in advance for any thoughts on where I might be going awry.

  • @aysenzumrutsonmez3699
    @aysenzumrutsonmez3699 2 года назад

    I had an ModuleNotFoundError for factor_analyzer in the first step. How i solve this error? Please help me

  • @matangaelisha2116
    @matangaelisha2116 3 года назад

    i cant see the link to the dataset please

  • @inspiritlashi9994
    @inspiritlashi9994 3 года назад

    This is such a great video.. Thank you so much

  • @sharp389
    @sharp389 4 года назад +1

    do you know of algorithms to use other criteria to select factors? like cng package in R? Horn's PA, etc.?

  • @Lernal
    @Lernal 3 года назад

    good factor analysis for python

  • @chavianddavid
    @chavianddavid 3 года назад

    Is there something called "Factor Scores" as well?

  • @c.e.g.o4797
    @c.e.g.o4797 4 года назад

    Thank you, bro.

  • @Sudo007
    @Sudo007 4 года назад +1

    Please, tell me what the values in the matrix signify.

    • @TieMaxx
      @TieMaxx 4 года назад

      Answers to a questionnaire about personality (Big-Five traits). A1 through 5 represent questions about agreeableness, N is neuroticism, C is conscientiousness, E is extraversion and O is openness to experience. 1 represents an answer of -2 and 5 is actually +2. It does not really matter though, because they are all getting standardized by the standard variation and the mean. For example one question might be "I am interested in talking about abstract topics". -2 (in the matrix: 1.0) would mean that the statement does not describe you at all, while +2 (in the matrix: 5.0) means, it describes you perfectly. etc. etc.

  • @lucasmigge3159
    @lucasmigge3159 4 года назад

    thank you very much

  • @vineetsodhani5398
    @vineetsodhani5398 4 года назад

    How does one load the factor analyzer package? Where does one find the package?

    • @Sudo007
      @Sudo007 4 года назад +1

      pip install factor_analyzer

  • @charleshowe1297
    @charleshowe1297 4 года назад

    In 15:51-16:07 you said ev=eigenvectors and v=eigenvalues.
    But during the scree plot, you said ev = eigenvalues.
    Which is correct?

  • @sindimazibuko1399
    @sindimazibuko1399 4 года назад

    Can someone please help me figure out the name of the test I need to perform to see if my data is suitable for factor analysis? I can not seem to hear the name. Thank you in advance.

    • @qudsiyajan6485
      @qudsiyajan6485 3 года назад +1

      KMO and Bartlett's test of sphericity can help you in knowing if data is suitable for factor analysis or not.....

  • @al702893
    @al702893 3 года назад +1

    You just copied datacamp's tutorial, including comments