08. Databricks | Pyspark: Add, Rename and Drop Columns

Поделиться
HTML-код
  • Опубликовано: 3 фев 2025

Комментарии • 26

  • @sowjanyagvs7780
    @sowjanyagvs7780 4 месяца назад

    What is Lit() : whenever we want to add a constant literal value to entire data frame, then we go with LIT(). we can also add these values only to certain records using when and otherwise. Eg: EMPDF = df.withcolname("Bonus",when(df.sal>50k, lit(sal*10)).otherwise(lit(sal*20))....Thanks for the amazing session Raj sir

  • @Jaipreksha
    @Jaipreksha Год назад +1

    Thank you Raja. Very nice video

  • @gulsahtanay2341
    @gulsahtanay2341 11 месяцев назад +1

    Great content

  • @anthony281190
    @anthony281190 2 года назад +2

    Excelent, From Dominican REpublic

  • @bonasiuday
    @bonasiuday 2 года назад

    Very nice explanation 👏

  • @tusharagarwal3553
    @tusharagarwal3553 2 месяца назад

    Where can we get these notebook..please share link for all the notebook.. so that we can revise

  • @kartikeshsaurkar4353
    @kartikeshsaurkar4353 3 года назад +1

    Thank you Raja. Keep it up.
    Please zoom a little bit or increase the font size in lab sessions to reduce the eye strain

  • @Ravishekar-qd4fc
    @Ravishekar-qd4fc 8 месяцев назад

    I have to modify the column values of existing record like location is India that alter to AUS. How to handle

  • @sravankumar1767
    @sravankumar1767 3 года назад +1

    superb bro.........

  • @lakshminarayana3168
    @lakshminarayana3168 Год назад

    hi Raja can you share the list of content uploaded to this playlist

  • @HPan-y2v
    @HPan-y2v Год назад

    Two things to add over here :
    1. If we have a column name that doesn't exist in the spark dataframe, then drop will just ignore it and not throw any error: df.drop("XYZ"). printSchema() won't throw any error in spark 3.0
    2. If you want to drop multiple columns , columns should be passed ONLY as strings :
    df.drop('firstname', 'lastname'). printSchema()
    If we will pass them as column then there would be an error. The below code would run into an error:
    df.drop(col('firstname'), col('lastname')). printSchema()

    • @supriyakoura7755
      @supriyakoura7755 6 месяцев назад

      not for me
      emp_DF.drop(col('gender'),col('salary')).show()

  • @mohammadumair7521
    @mohammadumair7521 Год назад

    will you upload all the notebook in you google drive and provide us the link of the notebook it will be the greater help of all I guess.

  • @unbeatableinsaan4926
    @unbeatableinsaan4926 3 года назад +1

    Please make whole basics playlist by topic wise

  • @GurudasKundukar
    @GurudasKundukar 2 года назад +1

    how to add identity coumn

  • @agamgupta-ed9ye
    @agamgupta-ed9ye Год назад

    Hi Raja,
    can you kindly share data sets with us so that we can practice also

  • @srinivasarao416
    @srinivasarao416 2 года назад

    Hi Sir, I am kindly requesting could you please share notebooks

  • @pavanrocks9960
    @pavanrocks9960 2 года назад

    Hi raja can you kindly share data sets and codes for better practice

  • @patnaik476
    @patnaik476 Год назад

    Very nice playlist. I have only one suggestion. Please don't use the ear deafening music in the start. Its very very loud and annoys everyone around.

    • @rajasdataengineering7585
      @rajasdataengineering7585  Год назад +1

      Agreed. I have already changed this music in later videos. Thanks for your suggestion