Data Warehousing With BigQuery: Best Practices (Cloud Next '19)

Поделиться
HTML-код
  • Опубликовано: 29 янв 2025

Комментарии • 21

  • @jennwng
    @jennwng Год назад +3

    At 13:36, why do we need to load into GCS first for batch loads? Can't we use Dafaflow/BigQuery BatchLoad / Data Fusion directly from GoldenGate?

  • @LuckyHongTJ
    @LuckyHongTJ Год назад +2

    At 13:35, what is the benefit of loading data from Oracle into GCS first and then BigQuery (i.e. won't directly loading into BigQuery from Oracle be faster)? I know GCS can serve as a staging area and we get another copy of the data for fault tolerance, but is there any other benefit? Thanks :)

  • @jennwng
    @jennwng Год назад +3

    At 11:58 why DataFlow / Data Fusion can speed up pipelines? Like, the essence is either to do the transformation in BigQuery or in DataFusion. Even if the query is complex, won't it still be faster to do the transformation directly in BigQuery, rather than connecting to DataFusion and transforming data there?

    • @vishnureddys4801
      @vishnureddys4801 10 месяцев назад

      It is because they can bill you in both Services, for using BigQuery and also dataflow.

  • @kunfang2457
    @kunfang2457 3 года назад +4

    the best 'best practice' sharing session i ve ever had

  • @3rdinnings326
    @3rdinnings326 Год назад +1

    Excellent session!

  • @anandakumarsanthinathan4740
    @anandakumarsanthinathan4740 2 года назад +1

    Beautifully explained. Many thanks. Now, why wouldn't anybody want to move to the sunny side of France and land a job at Teads !! Both of the presenters did an excellent job.

  • @TheMitali_
    @TheMitali_ 4 года назад +2

    How to perform sum of columns value in big query. And no of columns are notfixed , at runtime we need to decide no of columns need to consider for sum. Depends on user inputs

  • @krishnabg1350
    @krishnabg1350 4 года назад +5

    Can you share slides?

  • @ThisIsAli_Off
    @ThisIsAli_Off 2 года назад

    Very useful!

  • @Thiago280690
    @Thiago280690 Год назад +1

    6:30 do nada aparece uma nota de dez conto hahaha adorei

  • @Vinch157
    @Vinch157 5 лет назад +15

    Hello, will you also share the slides? Thank you

  • @mzamanmintu3694
    @mzamanmintu3694 Год назад

    Congratulations

  • @LouisChiaki
    @LouisChiaki 3 года назад +3

    How is this different from Spark?

    • @jean4j_
      @jean4j_ 3 года назад

      Well it's auto-managed and much simpler to work with.
      That's just SQL. You don't need to fine-tune the configuration like you would for a Spark job I think.

  • @sunilpipara
    @sunilpipara 2 года назад +2

    Kafka to S3 and S3 to Cassandra via Spark is wrong approach.

    • @anandakumarsanthinathan4740
      @anandakumarsanthinathan4740 2 года назад +1

      Yes, @Sunil Jain. I feel the same too. The source application would directly write it out to S3 for batch processing. Kafka would be required only for the streaming jobs.

  • @amieewright4417
    @amieewright4417 4 года назад

    H thanks u so much love you amiee Wright and Bella daws Xxxxx

  • @JOPINC
    @JOPINC 3 года назад

    Dedicado a JMB!!!! :)

  • @GreenPower4ever
    @GreenPower4ever 3 года назад

    P