End-to-end Exactly-once Aggregation Over Ad Streams | Yelp

Поделиться
HTML-код
  • Опубликовано: 17 окт 2024

Комментарии • 10

  • @SreenathV
    @SreenathV 4 года назад +3

    Very good explanation of the problem statement and the solution aspects.

  • @sumitmlk
    @sumitmlk 5 лет назад +1

    Very nice self contained talk. Covers the problem and its solution, quite clearly. Thanks.

  • @primbo1212
    @primbo1212 9 месяцев назад

    how about if we need the data for 5 mins of a day. Does this approach still works?

  • @nishanksoni7120
    @nishanksoni7120 2 года назад +1

    awesome talk...covers all failure scenarios

  • @kamalsmusic
    @kamalsmusic Год назад

    Can we just have all the data for a given campaign_Id go to the same partition, so we don't have to keep track of the offset on a per partition basis?

    • @balasravandindukurthi4702
      @balasravandindukurthi4702 Год назад +1

      Multiple campaign_ids will still end up going to one partition right. So, we should always track offset aggregated per partition

    • @atabhatti6010
      @atabhatti6010 Год назад

      In one word: no. There are two issues: 1) one campaign may create more load than one node can handle and 2) there are many more campaigns than nodes or partitions. Because of #1, we will need to spread the campaign onto several partitions. Because of #2, we will map multiple campaigns into fewer partitions.

  • @HarshTandon-kn6md
    @HarshTandon-kn6md 9 месяцев назад

    Fantastic talk

  • @nosh3019
    @nosh3019 Год назад

    thanks! great talk :)