The four levels of data engineering!

Поделиться
HTML-код
  • Опубликовано: 4 фев 2025

Комментарии • 87

  • @The-KP
    @The-KP 10 месяцев назад +44

    Level 4 is really an attribute of Level 1, 2 and 3. If you're not communicating with stakeholders, even when just being assigned a ticket, you're not being a team player.

    • @pbxmy4521
      @pbxmy4521 2 месяца назад +2

      I've learned this lesson the hard way by spending months developing solutions only for them to never be used. Get context first before you develop was my takeaway.

  • @fantsepants1747
    @fantsepants1747 10 месяцев назад +187

    How does one practice beyond level 1? It feels impossible to get beyond level 1 unless you're a level 1 in a company that also deals beyond that.

    • @elOtorongo96
      @elOtorongo96 10 месяцев назад +17

      broooo I know, I was wondering the same, like: "damn I'm level 1 then". How do I step up, switching jobs?

    • @khado9793
      @khado9793 4 месяца назад +10

      @@elOtorongo96 You can setup a single node Hadoop cluster with your local machine and do project using big data technology then leverage that to get into a company.

    • @Bobthetomado
      @Bobthetomado 3 месяца назад

      ​@@khado9793TY King

    • @pbxmy4521
      @pbxmy4521 2 месяца назад

      @@elOtorongo96 start at level 4. Start developing connections and relationships with stakeholders, make an effort to understand the business context behind decisions, and work on developing solid architecture. You fill the gaps with tools made available to you by your employer. You can do a lot with python and sql. There is no need for distributed compute if you are not at a scale to leverage it. Most companies will never reach that scale.

    • @AryanPatel-wb5tp
      @AryanPatel-wb5tp 2 месяца назад +2

      Aws gives free trial use the free trial and make a project

  • @deafmute2501
    @deafmute2501 10 месяцев назад +41

    Communication is tough. I had a stakeholder wanting me to explain a couple of days ago what was going on with a data issue by using train cars and passengers analogies.

    • @pluto8404
      @pluto8404 9 месяцев назад +6

      I am going to have to start requesting people explain themselves using analogies of train cars and people. "explain to me stack over flow using trains and peoples", "explain to me the teaching of adam smith using trains and people", "what do you mean the ice cream machine is broken, explain that to me as if the ice cream were people and the cones were train cars"

    • @riomouris4767
      @riomouris4767 9 месяцев назад +2

      That's like Michael Scott's 'explain it to me like im 5 years old'

    • @getbetterben
      @getbetterben 6 месяцев назад

      @@pluto8404 Imagine an ice cream train station where flavors are passengers and cones are train cars. The main people-mover (ice cream machine) has broken down! The conveyor belt that helps ice cream board is stuck. Some flavors try climbing into cones themselves, while others are stranded. Staff scramble to assist, but it's slow and messy. The station manager urgently calls for repairs as ice cream passengers risk melting. Human customers watch helplessly, hoping their favorite flavors will somehow make the journey. This breakdown has thrown the whole sweet transportation system into chaos, leaving everyone in a frustratingly sticky situation!

  • @derekoreborn
    @derekoreborn 10 месяцев назад +24

    I’m a bachelor learning SQL. I want to date-a-model.

  • @za1ruc
    @za1ruc 10 месяцев назад +97

    Date-a-model 😂

    • @adityavikram36
      @adityavikram36 10 месяцев назад

      😂😂

    • @themichaelw
      @themichaelw 9 месяцев назад +8

      I, too, would like to acquire this skill.

    • @SustainaBIT
      @SustainaBIT 8 месяцев назад +1

      Your imagination is scary good 😂😂

  • @shubhanjandash5017
    @shubhanjandash5017 10 месяцев назад +2

    Hey dude, been following you since a long time on LinkedIn. Glad you're making video content for us. Bless you!

  • @supercompooper
    @supercompooper 10 месяцев назад +25

    My data modeling should appear in the cover of Vogue magazine.

  • @ZAGoL_Channel
    @ZAGoL_Channel 10 месяцев назад +3

    Thanks Zach ! I love your tips

  • @chrisgarty
    @chrisgarty 10 месяцев назад +12

    Did not see that plot twist coming: wizard level 4 is communication 😆

  • @bouzie8000
    @bouzie8000 10 месяцев назад +1

    I’m a software engineer but I’m tryna get like you. I love data so much

  • @SM-vz1ek
    @SM-vz1ek 2 месяца назад +3

    Data modeling was all fun and games until I discovered your free bootcamp 😂.. thank you very much 🎉

  • @prime8krish
    @prime8krish 8 месяцев назад +2

    Very well said, talk to stakeholders before building anything should be the no.1 job of any engieer.

  • @oriarsenal
    @oriarsenal 21 час назад

    Lmao level 4 was gold. I’m a data analyst myself and that actually made me laugh out loud

  • @Alex_1729
    @Alex_1729 10 месяцев назад +1

    Can you give some pointers when moving from lvl1 to lvl2? I am self-employed. Love the content 👍

  • @Geoff_the_Chum
    @Geoff_the_Chum 10 месяцев назад +9

    Unknowningly at level 2 because my job forces me acquire skills to keep up😂 now how do I learn more about.

  • @GambillDataEngineering
    @GambillDataEngineering Месяц назад

    Level 4 is the most important!! 🎉❤

  • @Gaby000999
    @Gaby000999 2 месяца назад +1

    So I am a mix of 1 and 4 because the company I work in is still on a very basic level, but going to levels 2 and 3 so I can switch into a more senior level seems impossible.😅😅

  • @AbsolutelyNOW
    @AbsolutelyNOW День назад

    Wowww this is excellent info.Thanks a lot.

  • @patientson
    @patientson 2 месяца назад +1

    Python (essentials, defensive, forensics, and offensive), Intro to Snowflake, mastery level snowflake distributed compute, and Data Modelling (fact and dimension tables - numbers + letters; star & snowflake schemas).... operational and analytical needs. Finally, permission from stakeholder to carry out what you do best.

  • @dennisirorere
    @dennisirorere 8 месяцев назад +3

    The scope of data engineering is much broader than the categories you've mentioned.
    There are different types of data engineering: business-facing and platform-facing.
    Business-facing data engineers interact with stakeholders, gather requirements, and focus on how data can drive impact within the organization.
    Platform-facing data engineers handle systems, source data from various external sources, and ensure that business data engineers have the data they need every day.
    Some organizations also have a data enablement team that provides data tools across the company.
    While technical depth varies, it's not a one-size-fits-all situation.
    Great video, by the way.

  • @jpucaval
    @jpucaval 9 месяцев назад

    I'm getting into 2 but making the big jump to 4 at the same time

  • @johnnycastillo9139
    @johnnycastillo9139 10 месяцев назад +3

    What do you recommend to learn distributed compute?

    • @EcZachly_
      @EcZachly_  10 месяцев назад +6

      Designing data intensive applications is a good book

  • @Sindoku
    @Sindoku 6 месяцев назад

    There is also me. a senior dev with mostly FED experience, who knows the basics of BE application dev (Node/Python/Go), but wont risk swapping to a BE job because I’m worried that I’ll suck at data engineering and will end up running sub optimal queries along with many other mistakes. I can definitely handle API creation and SQL/No-SQL as long as the ORM takes care of optimizing the more advanced queries for me, but to me, a decent BE dev is at least level 2 with data engineering, granted in reality I think most BE devs are actually level 1.

  • @samsspam1524
    @samsspam1524 10 месяцев назад +7

    I’m proud of myself. I know some of these words!

  • @zahidc2838
    @zahidc2838 9 месяцев назад +1

    Thanks, this was useful in gauging my own abilities.

  • @Ak47Hangu
    @Ak47Hangu 9 месяцев назад +1

    You should bring your course n experience to RUclips which will be new and people will get more information from a person who had already worked on all levels

  • @caceresmauro9767
    @caceresmauro9767 7 месяцев назад +1

    this is good info, thanks man

  • @AaneiMarco
    @AaneiMarco 10 месяцев назад +1

    I'm starting at level 1 : Python and SQL

  • @PrashantDwivedi121
    @PrashantDwivedi121 10 месяцев назад +1

    Thanks for the info

  • @rembautimes8808
    @rembautimes8808 10 месяцев назад +2

    Very funny 😂. Can you share what is the challenge writing a PB pipeline

  • @JustinLietz
    @JustinLietz 10 месяцев назад +2

    I’m an intern barely at level 1 😂😂

  • @Shwill
    @Shwill 10 месяцев назад +3

    Hey Zach, hopping to get your input here - so I know both SQL and Python. My team does all their modeling with PySpark. I use both (mainly SQL tho) , SQL for data transformations and PySpark for only cleaning up unstructured data and writing to storage accounts. Our only job is to model / create tables for analyst and the business to use. They’re pressuring me to only do PySpark because they don’t understand SQL well, I feel like they have it all backwards and just more so don’t want to learn SQL. What do you think? Am I in the wrong here?

    • @Shwill
      @Shwill 10 месяцев назад +3

      For more context, we’re doing all of our work out of Databricks Azure. Majority of source formats we tap into for modeling is delta, parquet, xlsx and csv - NO API calls

    • @EcZachly_
      @EcZachly_  10 месяцев назад +9

      You’re right. Keeping more in SQL makes sense. Facebook DE said, “try your very hardest to use SQL, only use pyspark if you can’t express what you need in SQL”

    • @Shwill
      @Shwill 10 месяцев назад

      @@EcZachly_ awesome, thanks for the reply!

  • @NatureAndMyself
    @NatureAndMyself 8 месяцев назад +1

    I am at level 2 but my stakeholders wants me to be on level 1 🙃 communication is hard 😶

  • @nachobek
    @nachobek 9 месяцев назад +1

    Good to know I’m level 1 and 4 👌

  • @_stokyo_
    @_stokyo_ 10 месяцев назад

    Level 5 is flexing a sick wardrobe of flamboyant hoodies whilst remaining calm, confident and collective executing levels 1-4

  • @caraziegel7652
    @caraziegel7652 2 месяца назад +1

    Does this mean that smaller companies that dont use distributed compute dont have anything past level 1?

  • @f_r_a.n_c_o
    @f_r_a.n_c_o 6 месяцев назад

    I'm level 1. Where are the best opportunities for finding mentorships and even landing entry level positions?

  • @praveennaik62
    @praveennaik62 3 месяца назад +2

    Dude you spoke like
    Civilization 1
    Civilization 2
    .
    Civilization 4 😂😂
    I am trying to figure out where I am...
    On a good day in civ 2 but on a depressing day civ 0-1

  • @TerrenceLP
    @TerrenceLP 8 месяцев назад +1

    Level 4 🎉😊 true master

  • @hanifmckagan4448
    @hanifmckagan4448 10 месяцев назад +1

    damn stem people are amazing

  • @AnudeepKolluri
    @AnudeepKolluri Месяц назад +1

    Level 2

  • @obtbe
    @obtbe 3 месяца назад

    Hey Zack. Are data engineering certifications really worth taking?

  • @KRISHNAKUMAR-yk3tt
    @KRISHNAKUMAR-yk3tt 9 месяцев назад

    Can you make a comparison when to go for teradata vs snowflake

  • @db1jdm
    @db1jdm 3 месяца назад +1

    Is it advisable to begin as a data analyst?

  • @ThatGuy_Nick
    @ThatGuy_Nick 8 месяцев назад +1

    Any books on this ?

  • @ilovetensor
    @ilovetensor 8 месяцев назад +1

    College fresher should learn to what level to get first job

  • @taranjitlotey
    @taranjitlotey 5 месяцев назад

    By modelling do you mean machine learning model?

  • @Applications069
    @Applications069 3 месяца назад +1

    Hey bro I am getting problem while install Kafka of path i have done everything and watched the youtube videos but it's not getting solve, can you help me to run kafka

    • @EcZachly_
      @EcZachly_  3 месяца назад

      Just use confluent bro

  • @umair0119
    @umair0119 10 месяцев назад +1

    ha, he said 'Date a model' :)

  • @Alex_1729
    @Alex_1729 10 месяцев назад +1

    If a smurf dies and no one can hear it, does it still scream?

    • @EcZachly_
      @EcZachly_  10 месяцев назад

      This takes the cake as the strangest comment I've received in 2024

    • @Alex_1729
      @Alex_1729 10 месяцев назад +1

      @@EcZachly_ So you DO read the comments... You just don't reply to the honest questions but to the strange ones

    • @EcZachly_
      @EcZachly_  10 месяцев назад

      @@Alex_1729 I’m broken in the head. My ADHD only finds dopamine in peculiarity

    • @Alex_1729
      @Alex_1729 10 месяцев назад +1

      @@EcZachly_ It's easy to hide behind words... You enjoy your day.

  • @Karma-xz4si
    @Karma-xz4si 2 месяца назад +1

    Did I skip level 3 🥹

  • @hiiifidelity
    @hiiifidelity 2 месяца назад +1

    mans said petabyte, yea im at level 0

  • @redghost433
    @redghost433 2 месяца назад +1

    Did he say petabytes a day😱

  • @ranganathan223
    @ranganathan223 9 месяцев назад +1

    💯

  • @Testing567
    @Testing567 10 месяцев назад +1

    the last one xD

  • @aliencommander
    @aliencommander 9 месяцев назад +1

    damn skipped level 2 and 3 apparently.

  • @rehabcityman
    @rehabcityman 10 месяцев назад +1

    Very good. I see cat hair on your mic though

    • @EcZachly_
      @EcZachly_  10 месяцев назад

      Dog hair

    • @rehabcityman
      @rehabcityman 10 месяцев назад +1

      @@EcZachly_ 😄👍🏻

  • @davidk7212
    @davidk7212 Месяц назад

    99% of data engineers don't work with petabyte or even terabyte pipelines. And even lvl 1 needs to talk to stakeholders in smaller companies.

  • @TheSergWolf
    @TheSergWolf 3 месяца назад +1

    Two + 5level = 7. I want 700k
    ))))

  • @Alex_1729
    @Alex_1729 10 месяцев назад +1

    Nothing worse than a youtuber who ♥ every single comment but never replies to anything...

  • @climentea
    @climentea 4 месяца назад

    Guess I'm level 0 :))

  • @Storytelling_Central
    @Storytelling_Central 10 месяцев назад +1

    There just 1 Level of I dont care

  • @bodaddy6771
    @bodaddy6771 25 дней назад

    i also do a lot of pipelining model ..am i a data engineer ?