what is data warehouse | Lec-1

Поделиться
HTML-код
  • Опубликовано: 6 янв 2025

Комментарии • 43

  • @hritikapal683
    @hritikapal683 Год назад +7

    Spark series was delicate content thank-you for that really excited enough to get along with the DW series!

  • @sankuM
    @sankuM Год назад +4

    10:32 hey @@manish_kumar_1, I feel there shouldn't be any comparison b/w DW & Spark because these 2 are very different entities. DW helps store processed data while Spark actually helps process the data to land in DW. Don't you feel so??? BTW what was the work that kept you away for a month???

    • @manish_kumar_1
      @manish_kumar_1  Год назад +3

      You are correct. I just explained because some of the people may get confused that if parallel processing is there and also data is stored in columnar based file format, so why can't we use spark. That was the motivation behind comparing these two.
      I was out of station due to job requirements.

    • @sankuM
      @sankuM Год назад

      got it...🙌🙌@@manish_kumar_1

  • @rakeshverma6867
    @rakeshverma6867 Год назад +1

    Very good and easy-to-understand content. Keep it up, Manish.

  • @pankajrao6895
    @pankajrao6895 Год назад +1

    Areey maan gye guruji

  • @shananwar3199
    @shananwar3199 Год назад

    Good one.

  • @vijayvhanale8749
    @vijayvhanale8749 Год назад

    Very well explained ❤

  • @sunitamaurya5893
    @sunitamaurya5893 Год назад

    Manish ji, I really love your content i am following your Azure series. i really appriciate aapne jis tarah hum subko inte easy langauage me sub kuch explain kiya bahot se content mujhe bahot dino se clear nhi ho rahe the but aapke channel ko join karne k baad bahot hi sahi tarike se clear ho gya hai.. i really wanted to connect with you... Thanks a lot for doing this for us... And you motiviate to us... it's not that easy but if you have good mentor so journey bcame joyfull.. that making for us... :)

  • @rawat7203
    @rawat7203 Год назад

    Great to see you after long gap

  • @rishav144
    @rishav144 Год назад +1

    very well explained . Please regular videos upload krein

  • @DataAnalystt
    @DataAnalystt Год назад

    Thankyou

  • @webdeveloper-q1i
    @webdeveloper-q1i Год назад +1

    Commenting first being first viewer just to say your work is awesome!!!

    • @webdeveloper-q1i
      @webdeveloper-q1i Год назад +1

      Have talked to you over linkedin a few times, and you are as nice a person as a teacher

    • @manish_kumar_1
      @manish_kumar_1  Год назад

      Thank you so much 😀

  • @karthikkumar432
    @karthikkumar432 Год назад

    Finally you are back after a long gap.

  • @raviyadav-dt1tb
    @raviyadav-dt1tb 11 месяцев назад

    Do we need to learn data warehouse inspite if spark sql and scala hive ?? Please suggest

  • @mayankpatni5639
    @mayankpatni5639 Год назад

    Which is better data analyst vs data engineer who earn more and better future and can someone from non it background can become data engineer as a fresher

  • @Aman-lv2ee
    @Aman-lv2ee 8 месяцев назад +1

    in today's scenario Snowflake have all ai/ml capabilities, streaming and even handles all type of data, so very less difference with spark now

  • @prabhatgupta6415
    @prabhatgupta6415 Год назад +1

    bring something on delta lake

  • @pmmodi3583
    @pmmodi3583 Год назад

  • @blutoo1363
    @blutoo1363 Год назад

    Hello brother! Love your content. I was going through your spark playlist. I have a question. I have 2 csv files that I have stored by partitioning on a key. Both the files are partitioned on same key/key combination (values are same, although the actual name of the column could be different). Now I want to join the two, by reading them by spark.read , creating temp views and running basic sql against the temp views. I want to ask if there is a way to ensure that the partitions having the same key from both the files are stored on the same node when spark reads them to increase join speed?

  • @mohitmanna7308
    @mohitmanna7308 Год назад +1

    Why do we need Data Warehouse when lakehouse architecture is there?

  • @navjotsingh-hl1jg
    @navjotsingh-hl1jg Год назад

    very good manish bhai . bhai ab next video kab upload karo gaye

  • @gauravsingh-gn4zz
    @gauravsingh-gn4zz Год назад

    Data is updating in bw 12-2 pm . How to avoid duplicacy while writing in datawarehouse ??

  • @zahidalam7831
    @zahidalam7831 18 дней назад

    is your's data warehouse series is enough to crack azure data engineering data warehouse interview.

  • @arunmalali7768
    @arunmalali7768 Год назад

    Brother , i have a doubt . I am in a tier2 college . I want to pursue data engineering but does big tech companies hire data engineers who are just beginners or do they prefer masters and experienced people. After competitive coding should i do data engineering roadmap or any development. Please reply bhaiya ❤

    • @gauravsingh-gn4zz
      @gauravsingh-gn4zz Год назад

      After one year you can come into Data Engineering domain. But as fresher it is hard but not impossible.

    • @manish_kumar_1
      @manish_kumar_1  Год назад

      Yes keep learning all the required skills for DE. You will get one. No need to go for masters, just keep in mind that openings may be less for Beginner

  • @shreemanthmamatavbal7468
    @shreemanthmamatavbal7468 Год назад

    Are the videos uploaded daily or not ?

  • @raghavendrakulkarni3920
    @raghavendrakulkarni3920 Год назад

    Der kardi 1 month se wait kar rahe hai

  • @VishalSharma-lz6ky
    @VishalSharma-lz6ky 7 месяцев назад

    Why you are comparing data warehouse to spark?
    spark is different thing
    This is awkward, here you are telling Spark

    • @manish_kumar_1
      @manish_kumar_1  7 месяцев назад +1

      I think you don't want to understand the similarities and difference between these two tech stack. I don't find anything wrong here.
      As I said both uses distributed computing to solve business use case.
      There are multiple points where we can compare these two. Even Architect does the same thing before designing the solution, which will serve better.

    • @VishalSharma-lz6ky
      @VishalSharma-lz6ky 7 месяцев назад

      @@manish_kumar_1 yes You are right both uses distributed computing framework.
      My question is how we can use spark as a Datawarehouse solution.
      Spark is the general purpose in memory compute engine.
      But we can't use Datawarehouse on top spark.

  • @ruchidahiya4260
    @ruchidahiya4260 9 месяцев назад

    Thankyou @manish_kumar_1 for such simplified explanation 🙂