AWS Tutorials - AWS Glue Data Quality - Automated Data Quality Monitoring

Поделиться
HTML-код
  • Опубликовано: 17 дек 2022
  • AWS Data Quality is an automated serverless services to monitor and evaluate data qualilty both at rest and in move within the ETL job. It can evaludate qualilty for both statistics and values of the data. Learn how to use AWS Data Quality to evaluate data at rest as well as in move.
  • НаукаНаука

Комментарии • 20

  • @mranaljadhav8259
    @mranaljadhav8259 Год назад +3

    Welcome back sir, waiting for your more videos .. I learned alot from you... Thanks for providing this tutorials for free

  • @lucasoliveira7309
    @lucasoliveira7309 4 месяца назад

    Great video, i was already going to resolve that with a lambda, so more easy with glue data quality, thank you

  • @chengchangyu
    @chengchangyu Год назад

    thanks for the video. very details.

  • @arunr2265
    @arunr2265 Год назад

    Welcome back brother. waiting for your videos. Hope everything is fine

  • @pathakhemant-eb3du
    @pathakhemant-eb3du Год назад +1

    hey I love your tutorials, Thank You for making our life simpler. so I want know that can we do data warehouse testing with this tool when tables is in Redshift

  • @arun.ayilliath
    @arun.ayilliath Год назад +1

    Great demo! The retry count should have been 0 to prevent re-running.

  • @hsz7338
    @hsz7338 Год назад

    Thank you for taking us through the new feature that AWS Glue offers. Do you see Glue Data Quality replacing Glue Data Brew, at least from the Data Quality perspective?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад +1

      I don't think data quality in Brew will be replaced. Both will exist. Brew is more for adhoc data preparation and Glue job for automated. Both need data quality feature for their purposes.

  • @yinggamonkulsarapitak7948
    @yinggamonkulsarapitak7948 Год назад +1

    Great vid! Thanks!
    Can this Data Quality integrated with CI/CD and Terraform?

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      You mean to be able to configure Data Quality using infrastructure as Code. I am not sure - I did not check CloudFormation or Terraform. But it does support APIs for sure.

  • @scotter
    @scotter Год назад +1

    In your Glue demo, it *seemed* you skipped showing a part. How did you get from a file being dropped into the S3 bucket/sales to it becoming a table? I'm looking for the most code-light way to set this up so my lambda will somehow be triggered once the file is turned into a table, so my lambda can then run the rules defined in console and then write a log file to other S3 bucket/folder of which rows/columns failed. Thank you!

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  Год назад

      The table is created using AWS Crawler. I did not mention that because I have covered than my other tutorials.

  • @user-tm2dw4iv9k
    @user-tm2dw4iv9k Год назад +1

    Is it possible to this entire thing using boto3 in python

  • @cheluveshab9525
    @cheluveshab9525 Год назад

    Hi Brother, I’m a big fan of yours. I have learned many things from your channel and thanks a lot.
    Please provide your LinkedIn.