AWS re:Invent 2021 - Building a data lake on Amazon S3

Поделиться
HTML-код
  • Опубликовано: 23 июл 2024
  • Flexibility is key when building and scaling a data lake, and by choosing the right storage architecture, you will have the agility to quickly experiment and migrate to AWS. This session explores best practices for building a data lake on Amazon S3, which allows you to leverage industry-leading AWS, open-source, and third-party analytics and ML tools and gain insights from your data. This session also explores how to optimize your storage on Amazon S3 for data lakes, including information on storage classes, S3 access points, and running HPC workloads with Amazon FSx for Lustre.
    Learn more about re:Invent 2021 at bit.ly/3IvOLtK
    Subscribe:
    More AWS videos bit.ly/2O3zS75
    More AWS events videos bit.ly/316g9t4
    ABOUT AWS
    Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts.
    AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers-including the fastest-growing startups, largest enterprises, and leading government agencies-are using AWS to lower costs, become more agile, and innovate faster.
    #AWS #AmazonWebServices #CloudComputing
  • НаукаНаука

Комментарии • 16

  • @MrDottyrock
    @MrDottyrock Год назад +2

    This is very eye opening. I used AWS for several years and never thought S3 could serve such purpose. this is fantastic!

    • @awssupport
      @awssupport Год назад

      We're glad to see you enjoyed it, Jamiu! 👀 ^SA

  • @djohnjimmy
    @djohnjimmy 2 года назад

    This is a very helpful intro into days lake design with S3. Thank you

  • @samuel_william
    @samuel_william Год назад

    This video clearly explains about the storage-s3. Very good video to learn about s3

  • @umairqamar2672
    @umairqamar2672 Год назад +1

    This was super duper amazingly wonderful to watch !

  • @mbaapohelviszonepoh1284
    @mbaapohelviszonepoh1284 Год назад +1

    This is very helpful. Thanks very much

  • @alexfaith5562
    @alexfaith5562 2 года назад

    What an awesome video!

  • @rifkiamil
    @rifkiamil Год назад

    We had 200gb of data in MS OLAP in 2008 coming out of terabyte ERP system. Not sure where getting his numbers from. 9:46

  • @severtone263
    @severtone263 2 года назад +1

    Thank you for this, this was very helpful

    • @masterek1998
      @masterek1998 Год назад

      Ijiibjj😅jiiiihbiijiiiiijiibijiiiiiiii

  • @yogenderpal
    @yogenderpal 2 года назад

    Hadoop replicates data in three different nodes, so we need to lose 2 nodes before we start to worry about data loss. He said we need to lose 3 data nodes.:)

    • @rbr951
      @rbr951 Год назад

      Good catch

  • @user-sw9kd9pv4n
    @user-sw9kd9pv4n 2 года назад +3

    24:24

  • @lordlee6473
    @lordlee6473 2 года назад +3

    Not much on data lake, more of a talk about S3 features for what it’s intended for, data storage. Actual data analysis and reporting is done with other AWS services.

    • @rbr951
      @rbr951 Год назад +1

      True that. To that extent a little disappointing. Datalake != s3