Querying 100 Billion Rows using SQL, 7 TB in a single table

Поделиться
HTML-код
  • Опубликовано: 15 ноя 2024

Комментарии • 35

  • @TheElementFive
    @TheElementFive Год назад +4

    The first question you should always ask when working with a 100 billion row database: “Why do I have a 100 billion row database?”

    • @davidlean8674
      @davidlean8674 Год назад +4

      And the answer would be "because I work with a multinational enterprise customer". If you have a large market share in China (1 bill people) , India (1 Bill people), Europe 0.75 Bill, USA (350M people) it doesn't take long to get to 100 BIllion transactions.
      If you want to do Financial Year on Year comparisons, you need to keep at least 24 months of data, usually 36 months. .

  • @alok5253
    @alok5253 3 года назад +9

    Simple and concise, thank you!

  • @vaibhavis1
    @vaibhavis1 2 года назад +3

    Thanks for the explaination. I am curious that is it just scaling of the systems, or BigQuery does query optimization to reduce the latency as well?

  • @Hrzzz1
    @Hrzzz1 Год назад

    we can download this database to do some testes ?
    I nice ideal for next video is compare this same situation with noSQL database.

  • @mathteacher5670
    @mathteacher5670 Год назад

    excellent sir thank you so much highly motivational for passionate person

  • @JunaidKhan-gq8nw
    @JunaidKhan-gq8nw Месяц назад

    Great, Thanks a lot, sir.

  • @houssem25000
    @houssem25000 5 месяцев назад

    So I don't have to carry about performance when I make projects ?!

  • @WanderWisdom731
    @WanderWisdom731 2 года назад

    Wow.. this experiment was really amazing to benchmark the bigquery .

  • @Rpskmr
    @Rpskmr 8 месяцев назад

    Nice video but while voicing better to expand the screen than side by side videos

  • @ashitoshthakur9402
    @ashitoshthakur9402 3 года назад

    Wow what a gr8 video sir ji..pls sir make video on sql with ml and sql also..

  • @abhijayrajvansh
    @abhijayrajvansh 3 месяца назад +1

    it's always an Indian guy!

  • @vipulkumar7938
    @vipulkumar7938 3 года назад +1

    Well Explained, Thanks a lot

  • @merhaiakshay9625
    @merhaiakshay9625 3 года назад

    Please organize the videos and make playlists , great video , very informative and helpful, which led me to subscribe , thanks 😊

  • @AamirKhan-vu2om
    @AamirKhan-vu2om 2 года назад

    Heyy, very informative. I came here around searching for big data processing in seconds. Ive a question, I would like to build a system where I import terabytes of data into single table with keys and I want to perform all the DML operatiom in such a way it should take very less execution time as shown. Please help me out, how I can acheive. Im stuck.

  • @MDDM03
    @MDDM03 Год назад +1

    marketer of google cloud.. nothing states what to improve

  • @Mju98
    @Mju98 8 месяцев назад

    Hello sir. I tried to import 400k data into big query sandbox. But ended with more errors. Is this possible to import those data. Pls anyone help me it's urgent ( interview assignment)

  • @skill-learning
    @skill-learning 3 года назад

    I appreciate your effort. Could you put the used link for the google cloud project?

  • @PradeepMishra-qs2hz
    @PradeepMishra-qs2hz 2 года назад

    Awesome . Keep it up.

  • @nfacundot
    @nfacundot Год назад

    Hello, can I connect it on php?

  • @toxiclife
    @toxiclife Год назад

    what to do when I want to overwrite 100 millions of rows into new table, in minutes?
    df.write.mode("overwrite").saveAsTable("FINAL"), if you could please help with this?

  • @aminremiiii
    @aminremiiii 2 года назад

    Please for 50 days I am looking for this i wanna to create 2000 users in mysql and set the phone number as user name and password my be say me how can i create most users with default password? That's

  • @prathivenkatasaipavan9909
    @prathivenkatasaipavan9909 3 года назад

    Great explanation

  • @Helloimtheshiieet
    @Helloimtheshiieet 2 года назад

    Im confused were these indexes?

    • @elastiqai
      @elastiqai  Год назад

      BigQuery doesn't have indexes. It has partitions and clustering.

  • @visva2005
    @visva2005 3 года назад

    @Arpit Agrawal, Good. Let me know what database is behind this Console?

    • @elastiqai
      @elastiqai  3 года назад

      Google Cloud Bigquery 😁

  • @ungeedh
    @ungeedh 3 года назад

    Nicely explained.

  • @davidlean8674
    @davidlean8674 Год назад

    This is nice but not that impressive. Obviously, the table is being stored using Columnstore Compression techniques. So you only need to query the columns in the select list. And they are typically grouped in blocks of 1 M or more. These header pages keep rowcount values. So you are not reading every row. Just the block headers of a single column.
    If your query forced the scan of all rows in the "block" asking it to be combined with other fields in the same row or in other tables before you could filter it. You will no longer be in the columnstore sweet spot. and the difference in query speed would be more striking.
    Still good thou, as that is a common use case.

  • @arthurrodrigues5382
    @arthurrodrigues5382 2 года назад

    Amazing!

  • @himanish2006
    @himanish2006 2 года назад

    This is good...

  • @MdRakib-rc6ub
    @MdRakib-rc6ub 2 года назад

    I need your help

  • @muhamadridwan4766
    @muhamadridwan4766 2 года назад

    wow!

  • @sconnell194
    @sconnell194 3 года назад

    👍