Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Spark Out of Memory Issue | Spark Memory Tuning | Spark Memory Management | Part 1

Death Of A Unicorn | Official Trailer HD | A24

I Upgraded to MAX Dragon Fruit in Blox Fruits Update

Vermont vs. Marshall: 2024 NCAA men’s soccer championship highlights

Pyspark Advanced interview questions part 1

TechLake

Просмотров 60 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 31 дек 2024

Комментарии • 31

@abhilash0410 3 года назад ⁺⁸
Bro bring more real-time interview questions like these thank you so much !
@saachinileshpatil 11 месяцев назад ⁺¹
Thanks for sharing 👍, very informative
@rocku4evr 2 года назад ⁺¹
Great......fortunate to be your subscriber
@sjitghosh 3 года назад ⁺³
You are doing an excellent work. Helping a lot!!
@vedanthasm2659 3 года назад ⁺³
One of the best explanation. Bro..Please make more videos on Pyspark
@seshuseshu4106 3 года назад ⁺¹
Very good detailed explanation, thanks for your efforts, keep continue ..
@nsrchndshkh 3 года назад ⁺¹
Thanks Man. This was some detailed explanation. Kudos
@TRRaveendra 3 года назад
Ur welcome 👍
@akashpb4044 3 года назад ⁺¹
Awesome video... Cleared my doubts 👍👍👍
@fratkalkan7850 2 года назад
very clean explanation thank you sir
@achintamondal1494 2 года назад ⁺¹
Awesome video.
Could you please share the notebook, it will really help.
@janardhanreddy3267 10 месяцев назад
nice explanation ,please attach csv file or json in description to practice
@sanooosai 9 месяцев назад
great thank you
@rajanib9057 Год назад
can you pleaae explain how did spark filter those 2 colums as bad data? I don't see any where condition mentioned for the corrupt column
@shreekrishnavani7868 3 года назад
Nice explanation 👌 thanks
@varuns4472 2 года назад
Nice one
@janardhanreddy3267 10 месяцев назад
please upload all pyspark interview questions videos
@rahulyeole6411 2 года назад
Please share basic big data video
@naveendayyala1484 Год назад
plz share the notebook in .dbc format
@balajia8376 2 года назад
seems querying _corrupt_record is not working. I tried it today and not allowing me to query with the column name.cust_df.filter("_corrupt_record is not null"). AnalysisException: Since Spark 2.3, the queries from raw JSON/CSV files are disallowed when the
referenced columns only include the internal corrupt record column
(named _corrupt_record by default). For example:
spark.read.schema(schema).csv(file).filter($"_corrupt_record".isNotNull).count()
and spark.read.schema(schema).csv(file).select("_corrupt_record").show().
Instead, you can cache or save the parsed results and then send the same query.
For example, val df = spark.read.schema(schema).csv(file).cache() and then
df.filter($"_corrupt_record".isNotNull).count().
@TRRaveendra 2 года назад
cust_df.cache()
Cache dataframe and it's won't raise exception
@balajia8376 2 года назад
@@TRRaveendra Yes I did, even after that also not allowing to write a query on _corrupt_record is null or not null.
@balajia8376 2 года назад
seems badRecordsPath is only the solution.
@sachintiwari6846 Год назад
Woah what a explanation
@johnsonrajendran6194 3 года назад
are any such mode options available while reading parquet files?
@balajia8376 2 года назад
cust_df.select("_corrupt_record").show() is working but not allowing is null or not null. cust_df.select("_corrupt_record is null").show(). let me know if this is working for you. thank you.
@swagatikatripathy4917 3 года назад ⁺¹
Why do we write inferschema= true
@TRRaveendra 3 года назад ⁺²
InferSchema =True Creating datatypes based on data.
Header = True creating columns from file first line
@srikanthbachina7764 2 года назад
Hi pls share ur contact details I am looking for python, pyspark, databricks training
@balajia8376 2 года назад
root
|-- cust_id: integer (nullable = true)
|-- cust_name: string (nullable = true)
|-- manager: string (nullable = true)
|-- city: string (nullable = true)
|-- phno: long (nullable = true)
|-- _corrupt_record: string (nullable = true) . display(cust_df.filter("_corrupt_record is not null")). FileReadException: Error while reading file dbfs:/FileStore/tables/csv_with_bad_records.csv.
Caused by: IllegalArgumentException: _corrupt_record does not exist. Available: cust_id, cust_name, manager, city, phno

Следующие

Автовоспроизведение

Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Pyspark Advanced interview questions part 2 #Databricks #DeltaLake #PysparkInterviewQuestions

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Top 50 PySpark Interview Questions & Answers 2024 | PySpark Interview Questions | MindMajix

Spark Out of Memory Issue | Spark Memory Tuning | Spark Memory Management | Part 1

Spark Out of Memory Issue | Spark Memory Tuning | Spark Memory Management | Part 1

Death Of A Unicorn | Official Trailer HD | A24

Death Of A Unicorn | Official Trailer HD | A24

I Upgraded to MAX Dragon Fruit in Blox Fruits Update

I Upgraded to MAX Dragon Fruit in Blox Fruits Update

Vermont vs. Marshall: 2024 NCAA men’s soccer championship highlights

Vermont vs. Marshall: 2024 NCAA men’s soccer championship highlights

Nardwuar vs. Chappell Roan

Nardwuar vs. Chappell Roan

10 recently asked Pyspark Interview Questions | Big Data Interview

10 recently asked Pyspark Interview Questions | Big Data Interview

Apache Spark Interview Questions And Answers | Apache Spark Interview Questions 2020 | Simplilearn

Apache Spark Interview Questions And Answers | Apache Spark Interview Questions 2020 | Simplilearn

SQL Server Interview Questions and Answers | SQL Interview Questions

SQL Server Interview Questions and Answers | SQL Interview Questions

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

Most Important Question of PySpark in LTIMindTree Interview Question | Salary in each department |

Top 20 Apache Spark Interview Questions and Answers | Hadoop Interview Questions and Answers

Top 20 Apache Spark Interview Questions and Answers | Hadoop Interview Questions and Answers

75. Databricks | Pyspark | Performance Optimization - Bucketing

75. Databricks | Pyspark | Performance Optimization - Bucketing

The ONLY PySpark Tutorial You Will Ever Need.

The ONLY PySpark Tutorial You Will Ever Need.

Pyspark Interview Questions 3 : pyspark interview questions and answers

Pyspark Interview Questions 3 : pyspark interview questions and answers

SQL interview Questions and answers #SparkSQL #DeltaLakeSQL #SQLInterviewQuestions #BigDataSQL

SQL interview Questions and answers #SparkSQL #DeltaLakeSQL #SQLInterviewQuestions #BigDataSQL

ПРОБУЕМ САМЫЕ СТРАННЫЕ УСЛУГИ ЧЕЛЛЕНДЖ

ПРОБУЕМ САМЫЕ СТРАННЫЕ УСЛУГИ ЧЕЛЛЕНДЖ

Лайфхак как выучить японский

Лайфхак как выучить японский

▼ ИЩУ АНИМЕ ДЕВУШКУ 🎀

▼ ИЩУ АНИМЕ ДЕВУШКУ 🎀

Подземелья Чикен Карри #34 Пропавшая ёлочка (Складчикова, Зелигер, Паль, Гудков, BRB)

Подземелья Чикен Карри #34 Пропавшая ёлочка (Складчикова, Зелигер, Паль, Гудков, BRB)

CATNAP x DOGDAY VS MONSTER INCREDIBOX SPRUNKI (Incredibox Sprunki Animation)

CATNAP x DOGDAY VS MONSTER INCREDIBOX SPRUNKI (Incredibox Sprunki Animation)

Масяня. Эпизод 179. Самый плохой мульт

Масяня. Эпизод 179. Самый плохой мульт

главный минус популярности? #asata #борода #амирансардаров

главный минус популярности? #asata #борода #амирансардаров

Flyboard ride is so fun 🤪😚✌️ #amusementpark #adventure #flyboard #bluesilver

Flyboard ride is so fun 🤪😚✌️ #amusementpark #adventure #flyboard #bluesilver