5. kpmg pyspark interview question & answer | databricks scenario based interview question & answer

Поделиться
HTML-код
  • Опубликовано: 5 янв 2025

Комментарии • 22

  • @roshnisingh7661
    @roshnisingh7661 11 месяцев назад +1

    Thanks SS Unitech, your videos are very easy to understand 😊

  • @AvinashKasyap-j8o
    @AvinashKasyap-j8o 2 месяца назад +1

    Good job brother 👏

  • @teamdeepugaming2708
    @teamdeepugaming2708 Месяц назад

    The second parameter in to_date() supplied is diferrent from the output. you passed dd-MM-yy but displayed as yyyy-MM-dd.

  • @pranaykiran1780
    @pranaykiran1780 4 месяца назад +1

    The second argument in the to_date function takes the format of how the first argument is. Please correct it

  • @subaramanujam4997
    @subaramanujam4997 Месяц назад

    thanks so much👏

  • @Sujiga
    @Sujiga Месяц назад

    I am getting nonetype object has no attribute 'alias'....How to slove this error

  • @amritasingh1769
    @amritasingh1769 11 месяцев назад +1

    Super video

  • @nikhilasrirama3829
    @nikhilasrirama3829 11 месяцев назад +1

    Thanks susheel

    • @ssunitech6890
      @ssunitech6890  11 месяцев назад

      Thanks.
      Keep learning and sharing

  • @a2zhi976
    @a2zhi976 10 месяцев назад +1

    how to control variable value in dev and prod different values, can you please explain

    • @ssunitech6890
      @ssunitech6890  10 месяцев назад

      You can create one table in any environment or any file which keeps the values of dev, qa and prod, which accessing this table you can use lookup activities and store into variable in adf pipeline

  • @isharkpraveen
    @isharkpraveen 3 месяца назад +1

    iam get null in Newsaldt column

    • @ssunitech6890
      @ssunitech6890  3 месяца назад

      These should be some issue with your formula

  • @sivaani37
    @sivaani37 11 месяцев назад +1

    What about the 7th highest salary ?

    • @ssunitech6890
      @ssunitech6890  11 месяцев назад +1

      Already recorded one video on hiw to get top n salary on each department please this video
      ruclips.net/video/HnN-J8_u2Tc/видео.html

    • @sivaani37
      @sivaani37 11 месяцев назад

      But the link you shared is based on a window. What if I have to find the seventh highest salary irrespective of the dept?

    • @rajakumaranr2998
      @rajakumaranr2998 5 месяцев назад +2

      @@sivaani37 you can use row number without partitioning
      w = Window().orderBy(salary.asc()))
      df1 = df.withColumn("row_num", row_number().over(w))
      df1.filter(row_num==7)
      In that video , he used only 4 records, so it's not possible to show the 7th highest salary

  • @barmalini
    @barmalini 5 месяцев назад

    Thank you sir, this is very useful.
    I did not quite understand the purpose of joining a dataframe with itself, this part: df1.alias('a').join(df1.alias('b')
    Could someone explain?

    • @rajakumaranr2998
      @rajakumaranr2998 5 месяцев назад

      To find the MgrName.
      if you see the Note: MgrId is EmpId of employee,
      In that case Mgrname would be EmpId of Empname. for eg : MgrId is 100 mean then MgrName would be "raj"