An introduction to importance sampling

Поделиться
HTML-код
  • Опубликовано: 14 май 2018
  • This video explains what is meant by importance sampling, and how this method can be used to provide estimates of a distribution's characteristics, even if we are unable to sample from that distribution.
    This video is part of a lecture course which closely follows the material covered in the book, "A Student's Guide to Bayesian Statistics", published by Sage, which is available to order on Amazon here: www.amazon.co.uk/Students-Gui...
    For more information on all things Bayesian, have a look at: ben-lambert.com/bayesian/. The playlist for the lecture course is here: • A Student's Guide to B...

Комментарии • 46

  • @vladislavstankov1796
    @vladislavstankov1796 3 года назад

    you are amazing! Every time when i have troubles understanding some concept and need to search the web, if i see that you have a video on this topic, is already solved for me.

  • @Hecklit
    @Hecklit 5 лет назад

    I really like the way you explain things. Thanks a lot.

  • @jorgepereiradelgado7492
    @jorgepereiradelgado7492 5 лет назад

    Great video! Thanks for putting effort into this, it really helped! Greetings from Spain.

  • @alexanderl.2689
    @alexanderl.2689 5 лет назад

    Thank you so much, with the help of your video I finally understand the idea behind IS

  • @violinplayer7201
    @violinplayer7201 4 года назад

    Well and very clearly explained! Thank you!

  • @lexparsimoniae2107
    @lexparsimoniae2107 3 года назад

    Ben, you are a life saver.

  • @kleemc6267
    @kleemc6267 5 лет назад

    Well done ! I like the graphics that shows convergence.

  • @mystonefeel
    @mystonefeel 4 года назад

    The best explain on importance sampling... good animation

  • @maurocamaraescudero1062
    @maurocamaraescudero1062 5 лет назад +2

    Great series of videos, complements very well the book by Casella! Would you please be able to add a link to the Mathematica animation code for both importance and rejection sampling? They would be tremendously useful!

  • @YChen-ut1dw
    @YChen-ut1dw 4 года назад +1

    this is perfect, thank you

  • @sivad1025
    @sivad1025 3 года назад

    You saved my life. Thanks.

  • @AndriyDrozdyuk
    @AndriyDrozdyuk 5 лет назад

    Beautiful video.

  • @songlinyang9248
    @songlinyang9248 5 лет назад

    nice explained,thank you

  • @emmanuelameyaw6806
    @emmanuelameyaw6806 3 года назад +1

    Hi Lambert, it would be great to include sequential importance sampling in the series.

  • @inothernews
    @inothernews 6 лет назад +36

    Hi, thanks for the great video series. Question: for importance sampling, do we already assume we know what g is, such that given some x, we can compute g(x)? If we do, then couldn't we compute its mean analytically? If not, how do we resolve g(x) in the g(x)/f(x) term?

    • @SpartacanUsuals
      @SpartacanUsuals  6 лет назад +24

      Hi, thanks for your comment. Yes, you’re right that we need to know g(x) (at least up to a constant of proportionality). The beauty of importance sampling is that even if g(x) is unnormalised, however - and so we cannot compute the particular expectation - we can still use importance sampling to estimate it. Also, whilst I’ve illustrated how importance sampling works for a 1D example (where computing integrals is typically quite straightforward), importance sampling can be used in higher dimensional integration where it is usually not possible to compute integrals analytically. Hope that helps! Best, Ben

    • @inothernews
      @inothernews 6 лет назад +3

      Right, that cleared things up a lot. Thank you for your time!

    • @lesterknome
      @lesterknome 4 года назад +1

      @@SpartacanUsuals If we cannot compute the integral of g analytically, in general are we able to compute the integral of f?

    • @robinranabhat3125
      @robinranabhat3125 4 года назад +1

      @@SpartacanUsuals can you give an concrete example though . If you know ..

    • @robinranabhat3125
      @robinranabhat3125 4 года назад

      @@ianpan0102 hi genius. I meant concrete example for multivariate problem. I meant an actual case where he actually does a freaking sampling without knowing the other distribution. And not just talk on how it's useful there. Even an 8 year old would understand that first part. If you have nothing informative to say, don't comment as if you know it.

  • @carloszarzar
    @carloszarzar 2 года назад

    Professor Ben Lambert congratulations on the videos. Great didactics. My suggestion is to always keep the footer of videos clean (no written information). Because a lot of people watch their videos with subtitles, it often gets confused the subtitles and their writing and makes it difficult to learn. Again, I congratulate you on the initiative to democratize information here on RUclips.

  • @omid_tau
    @omid_tau 5 лет назад +1

    dude it would be really helpful if you could reference the page in your book that the video refers too .

  • @shoryaagarwal561
    @shoryaagarwal561 3 года назад

    This would really help for someone to understand why importance sampling is used for sampling rays in ray tracing. Thanks!!

  • @user-cc8kb
    @user-cc8kb 5 лет назад

    great video

  • @emmanuelameyaw6806
    @emmanuelameyaw6806 3 года назад

    what if we are interested in another statistic like the median (r(x)) or quantile (h(x)) of g(x)? we just change 'x' in the derivation to 'h(x)' or 'r(x)'?

  • @zeeshan008x52
    @zeeshan008x52 5 лет назад

    you are a genius

  • @aishikpyne
    @aishikpyne Год назад

    I'm still a little confused on why, if we know the ratio of g(x)/f(x) then we can compute E_g[x] directly? Can you give an concrete example/illustration of a multivariate continuous/discrete case?

  • @peaelle42
    @peaelle42 7 месяцев назад

    but what happens if i actually wanna sample from that difficult-to-sample distribution instead of computing things like expectation etc etc? like, i want the points themselves.

  • @huxixi4120
    @huxixi4120 4 года назад +5

    But how we get or calculate the ratio between f and g? the true distribution f is unknown right?

    • @alikassem812
      @alikassem812 3 года назад

      As he explained, we have f(x) but we don't have g(x), so we will approximate it by sampling from other distribution which is h(x)/z. Then we will compute W

    • @thidmg
      @thidmg 3 года назад +1

      @@alikassem812 i think we have g(x), but sometimes is very difficult to do the math of g(x), so we use importance sampling. In this case g(x) is simple, but in bayesian statistics in general is a very complex function

    • @alikassem812
      @alikassem812 3 года назад

      @@thidmg yea, you are right. Thanks

    • @alexmtbful
      @alexmtbful Год назад

      @@thidmg agree- seems too easy and useless at the first look but I guess it's more a numerical issue why importance sampling has its right to exist and then it gets super useful

  • @chenyang1896
    @chenyang1896 4 года назад

    is there anybody knows the software's name which is the uploader was using?

    • @jaylenjames322
      @jaylenjames322 3 года назад

      The part where he was showing the automatically plotting graphs is called Mathematica. It's made by Wolfram.

  • @lisaka2951
    @lisaka2951 4 года назад

    Hi, I really enjoyed that video. But why is the expectation value of 1 always equal to 1?

    • @thidmg
      @thidmg 4 года назад

      hi, the expectation of a constant is always that constant, "the constant is constant" it does not varies anytime, it's always that constant number, in this example is 1 but can be any constant number like 1, 2, 3, 4...

    • @lisaka2951
      @lisaka2951 4 года назад

      @@thidmg Ohh.. thank you very much. That explanation did not come to my mind. Thank you and stay healthy!

  • @allenkalalu6006
    @allenkalalu6006 5 лет назад

    Hi! I am trying to work out model selection trough information criterion, please can you help in explaining the intuition behind Akaike Information criterion (AIC) also Bayesian Information criterion(BIC) and if you could start from Kullback Leibler divergence and how to derive AIC(or BIC) from it , and how AIC/BIC pick the best model,it will be deeply appreciate. Thanks in advance!!

  • @distrologic2925
    @distrologic2925 2 года назад

    at 9:18 there is an f(x) missing in the sum, isn't there? Because it is the expectation of f..?

  • @hossein_haeri
    @hossein_haeri Год назад

    You actually lost many people at 4:00 when you said "we only have access to the fair dice and somehow we want to work out the mean of the biased dice". Well... If we don't have access to the dice and it's probability distribution how do we even know what kind of dice are we dealing with? If we have its distribution then why are we bothering using the other ones samples???