AWS Tutorials - Joining Datasets in AWS Glue ETL Job

Поделиться
HTML-код
  • Опубликовано: 25 фев 2023
  • Joining two or more datasets to create a curated dataset for a business purpose is a very common requirement one would find when building an ETL job. Learn how you can build an ETL Glue Job using AWS Glue Studio which joins two datasets, transforms the joined dataset and finally writes to the destination location.
  • НаукаНаука

Комментарии • 7

  • @SusinMahe
    @SusinMahe Год назад +2

    Yes, exactly this is what I was looking for for the last few days. Thank you for making this.!!

  • @narens4471
    @narens4471 3 месяца назад

    Thanks for the video, Can you describe how this job was run behind the scenes and any way to control the parquet file size per block size?

  • @theotherdude1998
    @theotherdude1998 Год назад

    I am having an issue in aws glue where aws glue is not saving the on conditions in the join. I have no idea how to fix this and could use any help.

  • @anandrane9439
    @anandrane9439 Год назад

    sir can you please kindly provide the thease sample dataset you are using in the s3 bucket so we can practice on that it will be more convenient for us to practice ,and thank you for making this kind of unique and knowledge oriented videos,again thank you sir.

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад +1

      I can try but generally I source my data from kaggle.com. If you use this site, you find loads of sample data there including the one I use.