Interaction Terms in Stata

Поделиться
HTML-код
  • Опубликовано: 4 окт 2024
  • This video will explain how to use Stata's inline syntax for interaction and polynomial terms, as well as a quick refresher on interpreting interaction terms.

Комментарии • 88

  • @timocallaghan4408
    @timocallaghan4408 3 года назад +40

    finally I have a clue at what I am looking at in this assignment. You have a statistically significant positive effect on many of our grades!

  • @BB-se9bl
    @BB-se9bl 6 лет назад +34

    Dude you’re saving my thesis. Thanks!

  • @AN-yr7nm
    @AN-yr7nm 4 года назад +3

    Dear Nick, big thanks, you just helped one more thesis! I really wish you that you get to enjoy the help you've been giving us!

  • @abdullahrabaya9235
    @abdullahrabaya9235 4 года назад +1

    Great video, for me this is first time I saw analysis of moderating variable like this simple way. Thank you so much for time n efforts.

    • @abdullahrabaya9235
      @abdullahrabaya9235 4 года назад

      I have some questions
      1) what are the main differences between (##)(#) and which case I can interpret the results. For me, (#) seems to me like you split sample.
      2) when you calculate the total effect of moderator (b3 &b4), the P-value were not significant. So how you can do that?
      3) is there any academic paper published use this method? Pls If you have could you please share with me abdullah.rabaya@siswa.ukm.edu.my. Thank you in advance

  • @cheerioz83
    @cheerioz83 3 года назад

    Wow. Just what I was looking for. Thank you!

  • @bapaknyaayadanfathan4686
    @bapaknyaayadanfathan4686 2 года назад

    thank you.. you really save my dissertation.

  • @Garrettthethief
    @Garrettthethief 6 лет назад

    A really valuable video. THANK YOU NICK!

  • @jekamito
    @jekamito 2 года назад

    thank you, crystal clear!!!

  • @takesuretozooneyi4836
    @takesuretozooneyi4836 7 месяцев назад

    video of the century

  • @cate-pillar
    @cate-pillar 5 лет назад +1

    Thank you Nick, this really helped!

  • @kselev1708
    @kselev1708 3 года назад

    Super simple and interesting explained. Thank you very much!!!
    Dear Nick, could you maybe create a set of videos about using Stata for econometric analysis E2E? That would be soooo great and helpful for multiple master/PhD students!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 года назад

      Glad you like the material! Unfortunately I'm not planning to create any new Stata material in the near future. Plenty of E2E material in R though!

  • @dataman6744
    @dataman6744 2 года назад

    Nicely done thanks

  • @amaranwar1249
    @amaranwar1249 4 года назад +1

    It's the mean difference between the base category and the reported category.

  • @nikitapestov7602
    @nikitapestov7602 2 года назад

    The video sound is pretty good, beyond my imagination

  • @amyheadley6429
    @amyheadley6429 4 года назад +3

    Hi Nick, thanks for the video. Just wondering how to interpret odds ratios following a regression using the logistic command? Is the odds ratio for the interaction term still an additional effect or is it interpreted differently?
    Many thanks

    • @bmaclin
      @bmaclin Год назад

      I have the same question!

  • @takesuretozooneyi4836
    @takesuretozooneyi4836 7 месяцев назад

    Will the interpretation of B4 remain the same if family income was a dummy?

  • @LenaGlitzert
    @LenaGlitzert 6 лет назад

    This is just awesome. Thanks a lot!

  • @elizabethdonkor3274
    @elizabethdonkor3274 4 года назад +1

    Hi Nick, thanks for the video. Quick question pls, do we always have to include the IV's in the model if we only interested in the interaction results? So say just having the control variables, interaction terms and DV only?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад +1

      It's pretty rare that you'd want to leave out the independent variables, as doing so makes the interaction terms very difficult to interpret and sometimes meaningless. Because they have no base independent variable coefficient to contrast to, the interactions lose the "how the effect of one variable changes with values of the other" interpretation

    • @elizabethdonkor3274
      @elizabethdonkor3274 4 года назад +1

      Thank you very much Nick. I understand it now.

  • @Garrettthethief
    @Garrettthethief 6 лет назад

    Nick, two questions:
    • If you use a fixed-effects regression instead: Is the interpretation still the same?
    • Would you present interaction after interaction or all interactions in one fixed effects regression for a presentation about these interaction effects?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 лет назад +1

      Keep'nCalmProductions Yes, the interpretation remains the same, with the caveat that, like everything else with a fixed effects regression, you're looking only at Within variation. And if you think there are multiple interactions, then you should be estimating the correct model with all the interactions included (unless your sample size is small enough that doing so runs you into power issues). Unfortunately that does also make the results harder to present.

  • @TheFruitInvasion
    @TheFruitInvasion 4 года назад

    You really helped me thanks!!

  • @WeyNurSame
    @WeyNurSame 5 лет назад

    Thank you so much my man

  • @Non-disjunction
    @Non-disjunction 3 года назад

    Learned some tricks with the i. and c. commands, but overall this video went quite fast and I had to pause a lot.

  • @TinaTina-xn9on
    @TinaTina-xn9on Год назад

    Sir, what do with the high multicollinearity between the interaction term and one of its components variable?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  Год назад

      It's not a concern. In fact it will be correlated with both of its components by construction.
      If the correlation is extremely high, that may indicate that the other component doesn't have much variation, at worst.
      In general multicollinearity is way overblown as a concern.

  • @JaredForsyth
    @JaredForsyth 3 года назад

    So in this example, the interaction term doesn't have a significant p-value -- doesn't that mean that we don't have evidence of an interaction? And would we still add the coefficient (b4) in that case?
    Thanks!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 года назад

      In general you never want to pick which variables to include or exclude on the basis of significance. The interaction should be included or excluded on the basis of theory or which hypothesis you're trying to test (often, the interaction itself is what you're curious about so you'd need to keep it in). If your really want to determine whether to keep or remove it on a predictive basis, something like LASSO would be more principled than a significance test

  • @shanschannel2682
    @shanschannel2682 6 лет назад

    this was really helpful thank you!

  • @thommievg6008
    @thommievg6008 5 лет назад +1

    legend

  • @DPREY1234
    @DPREY1234 Год назад

    great!!!!!

  • @marinmes1722
    @marinmes1722 4 года назад

    Hey Nick!
    I have a question: is there a command in Stata that makes 1=highest rank (like you have for classrank)? I'm doing an analysis which includes ranking in a Dutch football league and am curious to know how you did it. I only figured it out by making an additional variable which is constructed by doing: 19-rank (there are 18 teams in the league I am researching).
    Let me know!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад

      If you have a rank variable to start wtih, doing 19-rank sounds like a pretty good way to do it. If you're constructing ranks from a raw variable you might look at the rank function in egen

  • @anikaholmnielsen
    @anikaholmnielsen 6 лет назад

    Very helpful!!!!

  • @JAY-ft1dw
    @JAY-ft1dw 4 года назад

    Thank you for your video Nick, but do you also happened to know how to "outreg2" of multiple regression output at the same time? (I basically want to outreg regression output of each set of groups (300+groups) and hoping that I don't have to copy and paste that many times.....)

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад

      Do them one after the other. Do a regression, then do outreg2 with the "replace" option. Then do another regression and do outreg2 with the "append" option, and then do as many more append versions as you like

  • @antonio23merino1
    @antonio23merino1 6 лет назад

    Nick one question:
    If you use two categorical variables (sex and education) interaction, how will be the interpretation? 1=female for the first variable and for the second one the categories are: primary, secondary, and university education. Thank you so much!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 лет назад

      Same basic idea, but everything relative to the omitted category. So if primary is omitted, then the coefficient on secondary is "difference between secondary and primary for men" and hte coefficient on secondary plus the coefficient on female*secondary is "difference between secondary and primary for women"

    • @antonio23merino1
      @antonio23merino1 6 лет назад

      thank you so much!

  • @blessingsam7198
    @blessingsam7198 3 года назад

    Very helpful, Please can I generate a variable instead though, like gen whitefeminc= 1.white*feminc, would this count as an interaction term too, I want my table to look good, without the # in the variable name.

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 года назад

      That would work (without the 1.). It will stop functions like margins from working properly afterwards though. Generally if you want the table to look nice it's better to do it properly with the interaction in the regression, and then use a table making function like outreg2 to change the variable names

    • @blessingsam7198
      @blessingsam7198 3 года назад

      @@NickHuntingtonKlein thank you.

  • @abeypp
    @abeypp 2 года назад

    Hi Nick, may i ask if we can use the interaction term as a control variable in the same model? For example, age or ethnicity.

  • @eddiele644
    @eddiele644 4 года назад

    So when do we actually interact our variables? Is there a way to see if it is necessary or do we just do it and then see if the coefficient on the interaction term is statistically significant?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад

      Ideally, build your model using theory. Should the effect of one variable be mediated by another? But outside of that, yes, adding the interaction and checking for significance is commonly done.

    • @eddiele644
      @eddiele644 4 года назад

      @@NickHuntingtonKlein Thanks Nick

  • @dr.wanmaslizawanmohammad349
    @dr.wanmaslizawanmohammad349 5 лет назад

    tq sir really helpful

  • @camillepouchucq1133
    @camillepouchucq1133 4 года назад

    Hi Nick, thank you very much. Is it possible to use interaction terms with stepwise selection ? Because when I try (like this for example : xi : stepwise, pr(.2) lockterm1 : logistic success (i.classage i.sex) i.surgery##i.sex) it answers = "interactions not allowed"
    Thank you for your help.

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад

      Sounds like it's not compatible with interactions (some commands aren't). You can usually use interactions anyway by creating them as their own variables, but in the case of stepwise regression in guessing that would make it work improperly since it wouldn't know what to drop. Maybe do a LASSO instead? That's usually better than stepwise anyway

  • @dehiole6463
    @dehiole6463 5 месяцев назад +1

    is it still true if i put white = 1; not white = 2??? 6:50

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  5 месяцев назад

      Yep it will work for any value of white (although in this case white is binary so it can only be 0 or 1 anyway, but in a case with a variable with a wider range, yes)

  • @AK-md1mr
    @AK-md1mr 4 года назад

    Hey Nick,
    First of all thanks a lot for your video!!
    Can I use interaction terms with longitudinal data as well? For example I want to look how the effect of "female" and "r_white" on "classrank" change in the last lets say 4 years ("year"). Would xtreg classrank female##year r_white##year give me any information about that?

  • @ramachandrant2719
    @ramachandrant2719 5 лет назад

    Dear Nick, Thank you for the video. How is the interpretation if the interaction term is a factor variable with three levels. For the third level, is it the difference from the omitted level or the second level?

  • @andersengbergpetersen4437
    @andersengbergpetersen4437 5 лет назад

    Really great video! I'm looking at social origin effect on occupational attainment in three different countries and want to run a regression with an interaction between the social origin dummies and country. However, in the dataset (ESS) the country variable is a string variable (no numbers assigned). Is it possible to do an interaction when one of the variables is a string variable?

  • @obeydzomonda9821
    @obeydzomonda9821 3 года назад

    Hi Nick. Thank you for the video. I wanted to ask if its possible to test moderation effect on 5 independent variables and one dependent variable using panel data in stata

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 года назад +1

      I don't see why that wouldn't be possible. Anything where you're testing that many independent variables for their effects at once is going to be a bit jumbled and likely difficult to identify though

    • @obeydzomonda9821
      @obeydzomonda9821 3 года назад

      Thank you for your response. Well appreciated.
      Regards
      Obey

  • @annedenhartog9389
    @annedenhartog9389 6 лет назад +1

    Can you also use control variables?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 лет назад +1

      Yep, just add 'em in there.

    • @annedenhartog9389
      @annedenhartog9389 6 лет назад

      Thankyou! I want to interpret if income differs between Men and women within different education levels (middle and upper). So I get interaction term 0.2258 for middle if someone is a male and 0.7203 for upper is someone is male. How do I interpret that?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 лет назад

      I'd recommend writing out your regression equation and plugging in values like in the video. I couldn't really say the interpretation without knowing your model.

  • @CoryBradley
    @CoryBradley 5 лет назад

    This is tremendous. Does anyone know if a video like this exists for logistic regressions?

  • @emilienneyameogo3525
    @emilienneyameogo3525 4 года назад

    Hi Nick, thank you very much. Is it possible to use interaction terms with the PMG model? Because when I try, it is telling me (note: lnKOF dropped because of collinearity//initial vector: extra parameter lnKOF found//specify skip option if necessary). How can I use all my variables in the same model with the interaction term without collinearity issues?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад +1

      Yes it is. That error generally means that you have perfect multicollinearity, ie you can perfectly predict lnKOF with a linear mix of your other variables. Try running the same model as a regression and you should get the range result. Your model is misspecified

    • @emilienneyameogo3525
      @emilienneyameogo3525 4 года назад

      @@NickHuntingtonKlein I will try that. Thank you.

  • @FemkeHuisman
    @FemkeHuisman 3 года назад

    Wait, but when should you use # and when ##?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 года назад +1

      # is for interaction only, ## is for interaction plus each variable by itself

  • @jawadtariq88
    @jawadtariq88 5 лет назад

    I am so confused right now, can any one tell me that is moderation and interaction are same ??

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  5 лет назад

      Roughly, if z moderates the relationship between x and y, then the coefficient on x will shrink when you include z as a control. If z mediates the relationship between the two, then the coefficient on the interaction between x and z will be nonzero.

  • @selcenzorlu
    @selcenzorlu 4 года назад

    Where can we download the data you are using?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 года назад

      I don't think I have this data on hand any more, but it comes from the NLS Investigator

  • @deeg1385
    @deeg1385 5 лет назад

    Thank u

  • @ellenkileyfriends
    @ellenkileyfriends Год назад

    Helpful but moves too fast for me.

  • @tomsrol
    @tomsrol 3 года назад

    thanks nerd

  • @matto4187
    @matto4187 3 года назад

    dear nick, i kiss you on the mouth

  • @deeg1385
    @deeg1385 5 лет назад

    Thank u