AWS Tutorials - Build Enterprise Scale Python ETL Jobs using AWS Glue on Ray

Поделиться
HTML-код
  • Опубликовано: 4 окт 2024
  • AWS Glue comes with a new engine option called Ray. Ray allows to process large amount of data using python script and python libraries. Ray is based on open-source compute framework and it helps build enterprise level scalable jobs as it leverages distributed processing of the data.

Комментарии • 9

  • @jvmunhoz98
    @jvmunhoz98 Год назад +1

    Great video.
    I'm currently refactoring some python shell jobs into Ray cause most of them were getting a bit too big for just 1 DPU.
    But I'm having problems importing job parameters into the script. I usually import the getResolvedOptions function from the awsglue.utils library, but ray it doesn't support awsglue, which is odd. Should I add it manually or just use another method for importing job paramenters?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      Approach to retrieve the parameter is different. Check this link - docs.aws.amazon.com/glue/latest/dg/author-job-ray-job-parameters.html

  • @KannanPoompatham
    @KannanPoompatham Год назад

    Hi, I watched some of your videos and liked it, but I didn't see any videos that covers AWS Amplify. Do you want to cover that as well?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      Hi, unfortunately, I don't have expertise in AWS Amplify. Sorry about that.

    • @KannanPoompatham
      @KannanPoompatham Год назад

      @@AWSTutorialsOnline No Problem. Thanks

  • @cheluvesha
    @cheluvesha Год назад

    Brother Thanks for the video, When to use Glue and when to use EMR?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      Glue can be used only with Apache Spark and Python. While, EMR support additional frameworks such as Hadoop, Hive, Presto etc. So use EMR when you want to work on these framework other than Apache Spark and Python. Hope it helps.

    • @cheluvesha
      @cheluvesha Год назад

      @@AWSTutorialsOnline thanks it makes sense now.

  • @unw1718
    @unw1718 Год назад

    is there a way to add a vpc?