61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)

delta lake tutorial 5 : Delta Lake Timetravel and Audit Log #deltalake #timetravel #auditlog #delta

59. Databricks Pyspark:Slowly Changing Dimension|SCD Type1| Merge using Pyspark and Spark SQL

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

KARATE KID: LEGENDS - Official Trailer (HD)

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

60. Databricks & Pyspark: Delta Lake Audit Log Table with Operation Metrics

Raja's Data Engineering

Просмотров 23 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 4 фев 2025

Комментарии • 40

@mohitupadhayay1439 8 месяцев назад ⁺³
I was having this requirement right now and the video popped up just at the right time.
Thank you Raja.
Databricks with DE won't be easy to learn if you weren't there.
@rajasdataengineering7585 8 месяцев назад
Thank you! Glad it helps
@ruinmaster5039 Год назад ⁺¹
The best explanation for such imp problem!
Please do this kind of videos as much as possible
@rajasdataengineering7585 Год назад
Thanks for your comment. Sure will create more videos on such scenarios
@blacknwhitenblue Год назад ⁺¹
very good content, concise and clear
@rajasdataengineering7585 Год назад
Thanks
@vipinkumarjha5587 3 года назад ⁺¹
This is superb... very useful
@rajasdataengineering7585 3 года назад
Thanks Vipin
@sravankumar1767 3 года назад ⁺¹
Superb bro👌👍
@krishnamurthy9720 3 года назад ⁺¹
Thanks for video..
@rajasdataengineering7585 3 года назад
Welcome
@seasql 2 года назад ⁺²
what about parallel transactions.. how we can catch the latest records(history(1) if table insert/delete/update from multiple sessions ... so we cannot ensure that history(1) returns latest statistics...
@ranjansrivastava9256 Год назад ⁺¹
After the merge operation , are you inserting this metrics in the audit_log table or it will capture automatically after merge/insert/delete operations in the audit_log. Please share your thoughts.
@travelfoodlife1449 9 месяцев назад
Same question ?
@mohitupadhayay1439 8 месяцев назад
Where did the path to the delta_merge came from?
@purnimasharma9734 2 года назад
The video is awesome. Thanks for sharing! Can you provide the notebook please?
@subbua4331 3 года назад ⁺¹
Thanks very helpful, is there a way to enable audit on delta table read (for example who selected which columns from a delta table)
@rajasdataengineering7585 3 года назад ⁺¹
Hi Subbu, as far as I know that option is not yet available. But need to check to confirm...will check and let you know
@subbua4331 3 года назад ⁺¹
@@rajasdataengineering7585 Thanks Raj!
@SurajKumar-hb7oc Год назад
Hii
Where you create the table audit_log and how the table show that data?
@rajunaik8803 Год назад ⁺¹
Hi Raja, this is great, but just a quick question - in your case that is history (1) will always give you one latest record irrespective of operation performed on delta table. which is not correct right?
Is there any way we can skip reading record from history in case of no operation on delta table?
@rajasdataengineering7585 Год назад
Hi Raju, good question.
Even table creation is considered as one of the operation. So we will always have at least one operation for any table
@rajunaik8803 Год назад ⁺¹
@@rajasdataengineering7585 Thanks Raja for reply, I was wondering, let's say my notebook is keep performing SCD Type 1 and keep inserting audit data into audit log table.
For any given run it has not produced any operation on delta table (no insert, no delete, no update nothing). But still my Audit logic will read latest record from history (this was already inserted into audit log as part of previous run) and create duplicate in audit log table, right?
Hope I am making sense here :)
@rajasdataengineering7585 Год назад ⁺¹
The only scenario when no insert/update/delete performed is when the source dataframe is empty. Which means there is no input file to your databricks pipeline.
When there is no input data, what is the need to run the pipeline?
@rajunaik8803 Год назад
@@rajasdataengineering7585 Understood. Thanks Raja !!
@manigandang6921 Год назад
Hi bro, nice but one doubt how did you insert those values into audit table?
@krishnamurthy9720 3 года назад ⁺¹
Raj, where this Delta table definition resides?
@rajasdataengineering7585 3 года назад
Hi Krishna, to get the DDL script of delta table you can use below command
%sql
show create table
@krishnamurthy9720 3 года назад ⁺¹
@@rajasdataengineering7585 will it show us the location of Delta table?
@rajasdataengineering7585 3 года назад
Yes it gives location as well in the create ddl script
@Learn2Share786 3 года назад
@@rajasdataengineering7585 does that mean delta table schema metadata is stored on databricks i.e. dbfs ?
@sravankumar1767 3 года назад ⁺¹
Can you please start the real time scenarios in pyspark
@rajasdataengineering7585 3 года назад ⁺¹
Sure Sravan. If you are looking for any particular real time scenario, please share that scenario with me and I will create a video. Thank you
@sravankumar1767 3 года назад ⁺³
@@rajasdataengineering7585 am actually attending the interviews, actually I don't have experience on ADB just only I have Adf without adb they are not considered. Completly they are asking ADB very depth.just seeing your videos giving answers. When they are asking real time scenarios am confused
@rajasdataengineering7585 3 года назад ⁺¹
Will help you with real time scenarios.
@prabhatgupta6415 Год назад
Yes Raja Sir. Please@@rajasdataengineering7585
@shwetac2929 Год назад
I want this notebook can u share this with us
@seshaiahambati1798 9 месяцев назад
My i have Delta hist explode function video
@niteshsoni2282 2 года назад ⁺²
bro change the BGM plzzz...very irritating
@rajasdataengineering7585 2 года назад
Sure bro

Следующие

Автовоспроизведение

61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)

61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)

delta lake tutorial 5 : Delta Lake Timetravel and Audit Log #deltalake #timetravel #auditlog #delta

delta lake tutorial 5 : Delta Lake Timetravel and Audit Log #deltalake #timetravel #auditlog #delta

59. Databricks Pyspark:Slowly Changing Dimension|SCD Type1| Merge using Pyspark and Spark SQL

59. Databricks Pyspark:Slowly Changing Dimension|SCD Type1| Merge using Pyspark and Spark SQL

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

KARATE KID: LEGENDS - Official Trailer (HD)

KARATE KID: LEGENDS - Official Trailer (HD)

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

The Most Illegal Baseball Bat Ever Created

The Most Illegal Baseball Bat Ever Created

65. Databricks | Pyspark | Delta Lake: Vacuum Command

65. Databricks | Pyspark | Delta Lake: Vacuum Command

23. Databricks | Spark | Cache vs Persist | Interview Question | Performance Tuning

23. Databricks | Spark | Cache vs Persist | Interview Question | Performance Tuning

Delta Lake - EXPLAINED - Full Tutorial

Delta Lake - EXPLAINED - Full Tutorial

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

75. Databricks | Pyspark | Performance Optimization - Bucketing

75. Databricks | Pyspark | Performance Optimization - Bucketing

Database Normalization for Beginners | How to Normalize Data w/ Power Query (full tutorial!)

Database Normalization for Beginners | How to Normalize Data w/ Power Query (full tutorial!)

64. Databricks | Pyspark | Delta Lake: Optimize Command - File Compaction

64. Databricks | Pyspark | Delta Lake: Optimize Command - File Compaction

Understanding Delta File Logs - The Heart of the Delta Lake

Understanding Delta File Logs - The Heart of the Delta Lake

24. Databricks| Spark | Interview Questions| Catalyst Optimizer

24. Databricks| Spark | Interview Questions| Catalyst Optimizer

Не умеешь готовить? Ага, поверили…. #рецепт #рецепты #еда #кулинария #cooking #food #простыерецепты

Не умеешь готовить? Ага, поверили…. #рецепт #рецепты #еда #кулинария #cooking #food #простыерецепты

мифы о здоровье💊 в какой верили до этого видео? #медицина #здоровье #питание

мифы о здоровье💊 в какой верили до этого видео? #медицина #здоровье #питание

День Рождения Мамы Самвела ! Надежда Наготовила На Целую Свадьбу! Гости В Восторге

День Рождения Мамы Самвела ! Надежда Наготовила На Целую Свадьбу! Гости В Восторге

Что купить в магазине

Что купить в магазине

Горы Бесплатной пиццы

Горы Бесплатной пиццы

НА ПОИСКИ ЗОЛОТА! РАЗБИЛ ЛАГЕРЬ. ПАЛАТКА, ЕДА НА ПЕЧИ. СТРОЮ СТРАТЕГИЧЕСКИЙ МОСТ. В ДИКОЙ ТАЙГЕ!

НА ПОИСКИ ЗОЛОТА! РАЗБИЛ ЛАГЕРЬ. ПАЛАТКА, ЕДА НА ПЕЧИ. СТРОЮ СТРАТЕГИЧЕСКИЙ МОСТ. В ДИКОЙ ТАЙГЕ!

Lp. Точка Невозврата #12 ИСТОРИЯ ПРОШЛОГО [Мир Архей]• Майнкрафт

Lp. Точка Невозврата #12 ИСТОРИЯ ПРОШЛОГО [Мир Архей]• Майнкрафт

Отдельный вид испытания в Египте - ТОРГОВЦЫ

Отдельный вид испытания в Египте — ТОРГОВЦЫ