AWS Tutorials - Amazon Athena Query Cost Optimization

Поделиться
HTML-код
  • Опубликовано: 30 июл 2024
  • Our website - aws-dojo.com
    Amazon Athena is a Serverless and interactive query service to query and analyze data in Amazon S3 using standard SQL. Amazon Athena cost depends on amount of data scanned to bring the query result. The amount of data scanned can be optimized using data partitioning and parquet format based storage. The video explain these two methods and then does a live demo to show the result of the optimization.
  • НаукаНаука

Комментарии • 9

  • @RoryDavidWatts
    @RoryDavidWatts 6 месяцев назад

    Great tutorial, thank you

  • @victorfeight9644
    @victorfeight9644 Год назад

    great summary

  • @venkatasubbareddy9184
    @venkatasubbareddy9184 11 месяцев назад

    Can you share me the data which you used in this tutorial ?

  • @mkalicharan
    @mkalicharan 2 года назад +1

    Excellent video, loved it. So going by this what will be the cost of the athena query? if the sum of all data scanned on multiple attemps across days is 1tb then we are charged 5$ isnt it?

  • @kingzao9046
    @kingzao9046 Год назад

    What happens if you do a JOIN ? How is the cost computed ?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      It will scan files of both data sets and sum of both scans will determine the cost.

  • @ivenk1976
    @ivenk1976 2 года назад

    If there are multiple partition like 'State' and other as 'likeSports' Then will my data storage size will double?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  2 года назад

      partition is like hierarchy so the first partition on "state" and then on "likesports". So first it will create multiple folders for "state" and then within "state" folder; create multiple folders for "likesports". So the storage size is not doubled.