Building ETL Pipelines Using Cloud Dataflow in GCP

Поделиться
HTML-код
  • Опубликовано: 11 дек 2024

Комментарии • 42

  • @rrafaelpaz
    @rrafaelpaz Год назад +4

    Very nice mate! Very well explained! Cheers from Brazil brotha!

  • @JS-kj1rc
    @JS-kj1rc 2 месяца назад +1

    Very helpful. Thanks

  • @venkatvlogs07
    @venkatvlogs07 8 месяцев назад +9

    too hurry not able to understand it as you are switching tabs and doing all the things and not mentioning where you are writing the code. The course should be designed so that even beginner should be able to understand it. please make a pin to pin point to point explanation video so that everyone can understand it. Thanks in advance ❤

  • @chaithuchinna94
    @chaithuchinna94 11 месяцев назад +1

    is there any course available sir to learn gcp ?if so pls help me provide the details

    • @cloudaianalytics6242
      @cloudaianalytics6242  11 месяцев назад

      Course Link: www.udemy.com/course/gcp-professional-dataengineer-certification-a-complete-guide
      Reach for Coupon Code - www.linkedin.com/in/vignesh-sekar-sujatha-02aa9b125/

  • @pournimaambikar5857
    @pournimaambikar5857 9 месяцев назад

    I am getting below error while trying to run dataflow job:
    import apache_beam as beam
    ModuleNotFoundError: No module named 'apache_beam'
    on both cloud sdk and cloud shell, wheras apache_beam is installed

    • @RajDas-uy2ro
      @RajDas-uy2ro 9 месяцев назад

      pip install apache-beam[gcp]

    • @cloudaianalytics6242
      @cloudaianalytics6242  9 месяцев назад

      pip install apache-beam[gcp] or try createing a virtual environment in cloud shell and run dataflow jobs from there after installing apache beam

  • @nathaniasantanigels
    @nathaniasantanigels Месяц назад +1

    how can i do if the data from gheet?

  • @ashishvats1515
    @ashishvats1515 Год назад +1

    Great video, i want to take input from JDBC connection a table and load to bigquery… could you please share any document related to this, to how take table as an input from JDBC and load to bigquery

    • @cloudaianalytics6242
      @cloudaianalytics6242  Год назад +1

      beam.apache.org/releases/pydoc/2.24.0/apache_beam.io.jdbc.html
      beam.apache.org/releases/pydoc/current/apache_beam.io.jdbc.html

    • @ashishvats1515
      @ashishvats1515 Год назад +1

      @@cloudaianalytics6242 thanks, if i’m facing any issue can i ping u on linkdin or telegram?

    • @cloudaianalytics6242
      @cloudaianalytics6242  Год назад

      @@ashishvats1515 😊 sure

    • @ashishvats1515
      @ashishvats1515 Год назад

      @@cloudaianalytics6242i’m tried but facing some errors… could you please share a example code of this or make a video on it…

  • @sumitdwivedi9474
    @sumitdwivedi9474 Год назад

    can you create this pipeline and do transformations within gcp dataflow itself?

  • @ashwinjoshi3331
    @ashwinjoshi3331 Год назад

    Thanks for the video. One question - in case the source is oracle on premise and sink is BigQuery then what changes are required to do ?

    • @cloudaianalytics6242
      @cloudaianalytics6242  Год назад

      Need to do bit research on this. definitely we can use some JDBC, ODBC connectors

    • @neharas
      @neharas Год назад

      what is on premise ? is it traditional computers? or some type of cloud

  • @sanketgurnalkar5813
    @sanketgurnalkar5813 Год назад

    How to give runtime parameters? can you give the code

    • @cloudaianalytics6242
      @cloudaianalytics6242  Год назад

      Sure, Ill make a video on it. Meanwhile you can get it from my GitHub repo
      github.com/vigneshSs-07?tab=repositories

  • @honeylokesh2340
    @honeylokesh2340 8 месяцев назад

    How to enroll your training???

    • @cloudaianalytics6242
      @cloudaianalytics6242  Месяц назад

      Please drop a mail to cloudaianalytics@gmail.com
      If you are interested in self paced. take a look at this self paced course in Udemy
      www.udemy.com/course/gcp-professional-dataengineer-certification-a-complete-guide/

  • @ashraf_isb
    @ashraf_isb 7 месяцев назад

    thanks man!

  • @tommedcouk
    @tommedcouk Год назад

    Dataflow isn’t the most widely used component in the Google Cloud Platform. Even if you Google this question, the sensible response is Compute Engine because it runs under pretty much all the other services, but also because a lot of companies do a lift and shift to cloud before integrating with the other services. You claim this twice at the beginning of the video, but it’s incorrect

    • @klgulen650
      @klgulen650 11 месяцев назад

      What about airflow ?

    • @Rajdeep6452
      @Rajdeep6452 9 месяцев назад

      Can’t integrate airflow (cloud composer) with vm instances on gcp.

    • @cloudaianalytics6242
      @cloudaianalytics6242  Месяц назад

      Apologies for the wrong information. Yes Compute engine is base for all, I agree. It really depends on the business use cases.

    • @cloudaianalytics6242
      @cloudaianalytics6242  Месяц назад

      It is widely used to orchestrate big data pipelines..In GCP airflow is in built with Composer but you can run independently as well.

  • @AnantPradhan-y7m
    @AnantPradhan-y7m 5 месяцев назад +1

    Couldn't understand. Complicated...

    • @cloudaianalytics6242
      @cloudaianalytics6242  Месяц назад

      Sorry to hear that. Ill try to break it down in upcoming videos. Please keep an eye on it

  • @pm4306
    @pm4306 Год назад

    very confusing ......as you keep jumping from 1 screen to another.....

    • @cloudaianalytics6242
      @cloudaianalytics6242  Год назад

      Sorry to hear. Can you use playback speed option in youtube to reduce the speed of video. Hope it helps

  • @1itech
    @1itech Год назад

    make little bit slow

  • @shamilak1
    @shamilak1 5 месяцев назад +1

    head_usa_names share the file