Bayesian Regression in R

Поделиться
HTML-код
  • Опубликовано: 21 авг 2024
  • НаукаНаука

Комментарии • 32

  • @SpencerPaoHere
    @SpencerPaoHere  3 года назад +3

    This was pretty dense with information. Note that my prior here was a Normal distribution, and I used arbitrary constants of a = 0.1, b = 0.1, g = 10.. Once you get to the coding demonstration, I highly recommend that you pull up the code + data to follow along.
    The code can be retrieved here: github.com/SpencerPao/Data_Science/tree/main/Bayesian%20Regression

  • @avocadoarmadillo7031
    @avocadoarmadillo7031 Год назад

    New sub, enjoying your channel so far!

  • @cat-et6dx
    @cat-et6dx 6 месяцев назад

    Very well explained. Thanks!

  • @user-wz2cx7hv1c
    @user-wz2cx7hv1c Год назад

    omg❤ this really saved my life!!! Thank you so much!!! can't believe I could use the model right now❤❤❤

  • @Stat_Guy
    @Stat_Guy 2 года назад

    Thank you Spencer, for sharing this great video! It will help a lot to understand bayesian regression

  • @lebzgold7475
    @lebzgold7475 3 года назад

    Just finding your channel now, you're of great help! Thank you so much sir! :)

  • @ayeletesema590
    @ayeletesema590 6 месяцев назад

    Thank you Spencer for this great piece of work. I would really appreciate it if you could share reference for the theoretical background. I need mathematical expressions to include it in my current work.

  • @CenasDeWinner
    @CenasDeWinner 2 года назад

    Since you post your like/dislike count, here's a like from me. The video was great btw

  • @samsonamani8103
    @samsonamani8103 5 месяцев назад

    Thanks sir, is there a written document explaining this?

  • @simple-jp1cs
    @simple-jp1cs 2 года назад

    Great video, your explanation is very clear. I am confused in how you got the values in the logit_loan_default column? Does it need to have binary values in this field?

    • @SpencerPaoHere
      @SpencerPaoHere  2 года назад

      In the example I used, you can definitely just have categorical variables (0,1). You'd just have to change your distribution to that of Bernouli (2 outputs)
      The values I have are essentially centroids for each given cluster. (Check out the kmeans_data.r script I have on github). So, in essence, I am doing a regression to fit the normal.
      Some additional information can be found here: bayesball.github.io/BOOK/bayesian-multiple-regression-and-logistic-models.html#bayesian-logistic-regression
      I hope that helps!

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 5 месяцев назад

    Not sure how you clustered with K Means? Is it so that you can apply bayesian to different clusters separately?

  • @cansahin529
    @cansahin529 3 года назад

    Thank you for your video it so helped to me on my school project appreciate that🙏🏻🙏🏻👏🏻

  • @red3987
    @red3987 Год назад

    How long did it take you to learn to do all this stuff in R and really understand what you are doing? I want to really master this type of stuff using R but do not feel as comfortable with it all (the code and the theory) as I would like to. I am about to get my undergraduate degree in econ and math, then my masters in econ and have been practicing in R for about a year and practicing statistical theory for about 6 months.

    • @SpencerPaoHere
      @SpencerPaoHere  Год назад +1

      Its a journey! I learned all my basic stuff in college (4 years) and the more advanced stuff while working. I learned R in college and all the python stuff was in industry. 6 months is not that much but around the 1 year mark of continuously using the language: R (or any programming language for that matter) and you’ll be just fine!

    • @red3987
      @red3987 Год назад

      @@SpencerPaoHere thank you !:)

  • @agracian1
    @agracian1 3 года назад

    Hi Spencer, could your share the R code for how you k-mean'd the data to generate Kmeans_scaled_data100.csv?Did you run a logit regression 1st using glm to generate your logit_loan_default field?
    Thanks Alex

    • @SpencerPaoHere
      @SpencerPaoHere  3 года назад +1

      I uploaded the kmeans code to my github repository with some sample data. (The original dataset is too large). The binary outcomes already came as is, so I did not do any glm related functions.

  • @chukwuemekanworu-uo2td
    @chukwuemekanworu-uo2td Год назад

    The video is too blurred, i can't see it clearly.

    • @SpencerPaoHere
      @SpencerPaoHere  Год назад

      Weird. This video goes up to 1080p. You might have to wait for the buffer to go through. (And select the appropriate setting)

  • @juanete69
    @juanete69 2 года назад

    Do we need to use cross-validation in bayesian analysis to control overfitting?

    • @SpencerPaoHere
      @SpencerPaoHere  2 года назад

      Nope. They are fundamentally different. (Bayesian has MCMC that comes into play)

    • @juanete69
      @juanete69 2 года назад

      @@SpencerPaoHere Then bayesian models can't overfit if they have too many parameters or very few observations?

    • @SpencerPaoHere
      @SpencerPaoHere  2 года назад

      @@juanete69 it most certainly can! Though the sampling will address the overfitness of the model to the best of its ability via MCMC

    • @SpencerPaoHere
      @SpencerPaoHere  2 года назад

      @@juanete69 it most certainly can! Though the sampling will address the overfitness of the model to the best of its ability via MCMC

  • @taotaotan5671
    @taotaotan5671 3 года назад

    Nice meme 3:31 lol

  • @cliffme6310
    @cliffme6310 2 года назад

    can this be used to predict forex markets reversals?

    • @SpencerPaoHere
      @SpencerPaoHere  2 года назад

      You could try! Though, I've personally never tried this technique on forex, so I can't comment on how well it'd do.

  • @yiranding1257
    @yiranding1257 3 года назад

    I see one punch man pretty cool