Causal Discovery | Inferring causality from observational data

Поделиться
HTML-код
  • Опубликовано: 9 ноя 2024

Комментарии • 48

  • @ShawhinTalebi
    @ShawhinTalebi  2 года назад

    More in this series 👇
    Intro to Causality: ruclips.net/video/WqASiuM4a-A/видео.html
    Causal Inference: ruclips.net/video/PFBI-ZfV5rs/видео.html
    Intro to Causal Effects: ruclips.net/video/BOPOX_mTS0g/видео.html
    Propensity Scores: ruclips.net/video/dm-BWjyYQpw/видео.html
    Do-operator: ruclips.net/video/dejZzJIZdow/видео.html
    DAGs: ruclips.net/video/ASU5HG5EqTM/видео.html
    Regression techniques: ruclips.net/video/O72uByJlnMw/видео.html
    Towards Data Science article: towardsdatascience.com/causal-discovery-6858f9af6dcb?sk=2134f5b56c1ce943afdfebbf9e1dcb45

  • @zahrasaremi7308
    @zahrasaremi7308 2 года назад +2

    Thank you very much for this playlist! I'm doing my bachelor thesis on Learning Causal Structures form observational data and your videos are a great introdution.

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      Wow that's so cool! Glad the videos were helpful. Good luck on your thesis 😁

    • @fabianbleile9467
      @fabianbleile9467 Год назад

      how did it work out? i recall judea pearl being very sceptical with learning causal structures from observational data

  • @shaikfiaz
    @shaikfiaz 2 года назад +1

    Great series. If different algorithms give different causal models then how do we choose between them? Any suggestions?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад +1

      Great question! At the end of the day all we are trying to do with causal discovery is to generate a model to explain our data. This is what scientists have been doing for hundreds if not thousands of years. So we can do what we always do in science, we compare our predictions against experiments.
      In other words, we can use the causal model to make inferences and check those inferences against reality.

  • @kaldirYT
    @kaldirYT 3 года назад +2

    Another great vid, well done!

  • @ifycadeau
    @ifycadeau 3 года назад +1

    I’ve been waiting for this one 🙌🏾

  • @habibollalatifizadeh2842
    @habibollalatifizadeh2842 2 года назад +1

    Thank you for the Great video!
    I have a question on how to learn DAG when the variables in the data are continuous not just discrete (Boolean or categorical)?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      Thanks for the question! The techniques I discussed here should work for both continuous and discrete data. However, we can always discretize a continuous variable through binning or breaking down it into quantiles.

  • @PeacefulMusic-l7o
    @PeacefulMusic-l7o 6 месяцев назад

    THAT is veryyy helpful. Thanks a lot!!!

  • @roopalilalwani9217
    @roopalilalwani9217 2 года назад +1

    Hello, I am trying to learn more about Causality. What I understood is often to solve causal inference (which is focused on relationship between two variables), we need a causal graph. Especially if I want to use structural causal model approach and not potential outcome framework. And a way to get the causal graph is to have strong domain knowledge. And causal Discovery method explained here seems to be another way. Please let me know if my understanding is correct.

    • @roopalilalwani9217
      @roopalilalwani9217 2 года назад

      Another question I have is are there causal models which can be used when we only have incidents data and the data around non-incidents is not available. Can apriori or causal Bayesian network models in that scenario?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      Yes that sounds right. The way I see it is causal discovery is a tool we can use to derive causal models. Domain knowledge can help choose among candidate models derived from causal discovery.

    • @roopalilalwani9217
      @roopalilalwani9217 2 года назад

      @@ShawhinTalebi Thanks Shawhin! I edited the reply under the comment with another question. Please let me know your thoughts on that too.

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      @@roopalilalwani9217 I haven't come across anything like that, but I'd be curious to learn more!

    • @roopalilalwani9217
      @roopalilalwani9217 2 года назад

      @@ShawhinTalebi Thanks Shawhin. I am always working with a senior applied scientist who has solved something similar before. I will share something as soon as I have a concrete understanding!

  • @TheTessatje123
    @TheTessatje123 Год назад

    This is really great :-) I wonder though if it works for time-series and one-time events. For example, can you infer the effect of an event (a tweet from Bidon) to an uprise of S&P index?

    • @ShawhinTalebi
      @ShawhinTalebi  Год назад

      I can see it working for time series after a little data preparation. One-time events feel more challenging but I’m sure there is something out there for those too.

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 6 месяцев назад

    How about rank deficient causal discovery? Can you consider that for future.

    • @ShawhinTalebi
      @ShawhinTalebi  6 месяцев назад

      Great suggestion! I haven't used that before so looking forward to learning about it.

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 6 месяцев назад

    What do you think of latent causal variable? Doesn’t it make the problem into a black box?

    • @ShawhinTalebi
      @ShawhinTalebi  6 месяцев назад

      I haven't looked into latent causal variables yet, but that would also be a good topic for a follow-up.

  • @anadianBaconator
    @anadianBaconator Год назад

    this is fantastic!

    • @ShawhinTalebi
      @ShawhinTalebi  Год назад +1

      Thanks, glad you liked it :)

    • @anadianBaconator
      @anadianBaconator Год назад

      @@ShawhinTalebi if you don't mind, can you outline the steps you did to install the r packages? i am having tough time reproducing your results

    • @ShawhinTalebi
      @ShawhinTalebi  Год назад

      I found the cdt doc helpful: fentechsolutions.github.io/CausalDiscoveryToolbox/html/index.html
      I believe I ended up installing the packages list in this repo one-by-one: github.com/FenTechSolutions/CausalDiscoveryToolbox/blob/master/r_requirements.txt

  • @oiendrilabasak6262
    @oiendrilabasak6262 2 года назад +1

    In GES algorithm how are the direction of the edges determined?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      Hi Oiendrila, thanks for your question! GES starts with an empty graph (i.e. nodes with no edges), then directed edges are iteratively added based on some choice of score (e.g. BIC). So unlike the PC algorithm, there is no need to oriented edges as an extra step.
      Hope that helps!

    • @oiendrilabasak6262
      @oiendrilabasak6262 2 года назад +1

      @@ShawhinTalebi Thank you so much for your answer. Yes it helped!

  • @karannchew2534
    @karannchew2534 6 месяцев назад

    Looks like different algorithm give different graphs. How to validate them? How to decide which is the best?

    • @ShawhinTalebi
      @ShawhinTalebi  6 месяцев назад +1

      The simplest way is via expert domain knowledge. A more objective (and time consuming) way would be to generate predictions from the graphs and test them against experiements.

    • @karannchew2534
      @karannchew2534 6 месяцев назад

      @@ShawhinTalebi Thanks

  • @rohansinghwilkho8023
    @rohansinghwilkho8023 2 года назад +1

    Are there metrics to validate causal models built through causal discovery?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад +1

      Hi Rohan, thanks for the question! The short answer is yes. This is what is used for Trick 2, where your validation metric serves as a score. One I have come across is the Bayesian Information Criterion (BIC), which is mentioned briefly in reference [1] on slide 6. However, I am sure there are others out there.
      Now, the nuanced answer depends on what you mean by validate. Generally we do not know the true DAG that describes our system of interest. In such cases, we have to be careful about how we evaluate the efficacy of the DAG, since we have no ground truth from which to compare. For example, BIC balances the DAG's ability to generate the underlying data with a penalty for the number of parameters (i.e. nodes and edges).
      Hope that helps!
      Shaw

    • @rohansinghwilkho8023
      @rohansinghwilkho8023 2 года назад

      Exactly, my concern is regarding cases where we do not know the true DAG. How do we ascertain the correctness of the DAG we obtained in such cases?

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад

      That is a very good (and deep) question. Putting the philosophical implications aside, my take is we can do what we always do in science and test the DAG via experiment. If the DAG can generate predictions validated by experimental data, that may give us some confidence in its validity.

  • @ares106
    @ares106 2 года назад +1

    So based on the example in this video I think I can reasonably conclude that all 3 casual discovery "ticks" are more or less garbage at finding causal DAGs.

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад +1

      That’s fair, it’s still early for these types of techniques. In my experience, I’d say most approaches don’t give nice results for most problems out of the box. It typically takes a little TLC and insight to make things work well.

    • @ares106
      @ares106 2 года назад +3

      @@ShawhinTalebi can you imagine if you could accurately and automatically infer causal models and relationships from some reasonable amount of real world data full of confounders and messy chaos. You would almost become a God amongst men. Solve the stock market even.

    • @ShawhinTalebi
      @ShawhinTalebi  2 года назад +2

      I suppose that is the dream of this kind of approach

  • @soupizcool
    @soupizcool 2 года назад +2

    "If you give someone a graduate degree it's not going to have any affect on their age"
    My grey hairs beg to differ!

  • @KongQuestCo
    @KongQuestCo 3 года назад +2

    You ever get nervous before reading a name?