AWS Tutorials - Flat nested data with “Flatten” Transform in AWS Glue Studio
HTML-код
- Опубликовано: 4 окт 2024
- Many times, the data platforms work with nested data and it needs to flat the nested data for the business need. AWS Glue Studio Flatten transformation can flatten the nested structure at any level. Learn how to use flatten transform in an ETL job.
- Наука
Thanks, I just tried it with Glue 4.0 version. The data is available in the Glue data catalog with multiple nested levels of values. I did use the Flatten and store it as a CSV in S3. But got the following error.
Error Category: QUERY_ERROR; AnalysisException: Ambiguous reference to fields StructField(Status,StringType,true), StructField(status,StringType,true)
What is the difference between flatten and relationalize transform in AWS Glue? Can we use flatten tranform on highly nested xml files? Please explain
Is this a new transform that was recently added to Glue Studio ?
yes. it is.
I think it doesn't work with Glue 4.0
I did not find it anywhere in documentation. I will test for it.
It's strange. I dont see this transformation in my console. I dont see UUID transformation that you see either.
Sounds strange. Checked documentation. It is not limited to any region.
Can we catalog the nested data from s3 bucket or not?
yes it can.