#78. Azure Data Factory - Execute Python script from ADF

Поделиться
HTML-код
  • Опубликовано: 8 ноя 2024

Комментарии • 69

  • @azurelearner4055
    @azurelearner4055 2 года назад +2

    Great!! Best videos on ADF covering almost all the concepts and are very easy to understand

  • @mmmm_yesverynice
    @mmmm_yesverynice 3 года назад

    Why is this better than an automation account

    • @AllAboutBI
      @AllAboutBI  3 года назад

      Sorry, what's an automation acct

  • @robertwiley2957
    @robertwiley2957 2 года назад +1

    This was a great overview! I was thinking about this today and was happy to find your overview. It's a great starting point. Thank you!

  • @stephenmartin6995
    @stephenmartin6995 Год назад

    Very informative. I am currently attempting to execute in-process python to alter a files contents. This is very helpful.

  • @metechieandsachinfan768
    @metechieandsachinfan768 4 года назад +2

    Thanks for a wonderful explanation. Indeed a great flow

  • @ranadeepbasani8890
    @ranadeepbasani8890 3 года назад +1

    Hi Madam, Great video. I am using ADF for Bulk and Incremental loads. In an instance i need to do a REST API call for bulk loads. per call i am able to retrieve only 20 records from REST API. where the max can be at 50 records which i am not able to apply in ADF. so for this i started to look into adding a custom python module to hit the API. Is there way we can do this ? please let me know madam

  • @NeumsFor9
    @NeumsFor9 2 года назад

    Remember Microsoft's PROJECT REAL from 2005? In it they write a script component to work around the limitations of the Lookup Transform when it comes to implementing Inferred Members without having to keep making calls to the RDBMS for each value not in a lookup transform. They wanted the best of the full cache mode and partial cache mode.
    If you remember it, this would be a GREAT USE CASE for , upon encountering a new value that is not in the cache of either an SSIS lookup or ADF Lookup transform.....and routing the value to another branch.......call the RDBMS, insert the new value, bring back the surrogate key in the pipeline, add it to the cache as a sorted in-memory dictionary and continue to the next row in the pipeline. The next time this same value is encountered in the same pipeline run, it can then reference that dicitionary, pick up the surrogate key, and rejoin the main ETL branch without AGAIN having to make ANOTHER RDBMS call.
    Does this make sense? It would be an awesome demo.....if, indeed you could use the external call transform to either run an Azure Function or call a custom activity or REST API that did the same. Not sure we could achieve the same speed as the SSIS script component, but it would still be cool to see.

  • @prakashchinni6126
    @prakashchinni6126 8 месяцев назад

    pls upload english subtitles for this video . very good concept

  • @tiananmensquareentertainme7361
    @tiananmensquareentertainme7361 3 года назад +1

    Thanks a lot madam. This helped a lot for my project. Only the last step the custom output part is not working. If there is some modification in those points do let me know. It's not a show stopper for me anyways

    • @AllAboutBI
      @AllAboutBI  3 года назад

      Glad to know. Did u use any library? Like pandas in ur script.

    • @tiananmensquareentertainme7361
      @tiananmensquareentertainme7361 3 года назад

      @@AllAboutBI Hello madam. It's working perfectly fine now. The write file path I had missed out. I thought that was irrelevant. Added those and it's working perfectly

  • @azurelearner4055
    @azurelearner4055 11 месяцев назад

    Thanks for the video and can you run the same code in azure batch itself without the azure data factory

  • @Maddychin08
    @Maddychin08 2 года назад

    Great video !! Can we execute shell script as well ??

  • @dukefler
    @dukefler Месяц назад

    Hello Ma'am.... I'll be really grateful if u cud extend this topic further and display Pandas and ML libraries etc in action on Batch Pool... Was just doing an interview task and found it difficult. In the end I went ahead with Databricks Python activity.

  • @kathirsubramaniam4597
    @kathirsubramaniam4597 3 года назад

    Thanks a lot. Could you please put a video on how to execute a PowerShell script from ADF?

  • @tiananmensquareentertainme7361
    @tiananmensquareentertainme7361 3 года назад

    Hello madam. Is there any automated way to down scale and up scale the dedicated nodes in a pool? This would help reduce the azure cost. Please let me know your thoughts

  • @clotoscar
    @clotoscar 3 года назад

    Thanks a lot :) from chile

  • @abhishekkumar-es1wl
    @abhishekkumar-es1wl 3 года назад

    Your videos are the best material on Azure, Thanks a lot...I have a question, can we restart the ADF pipeline from its point of failure? kind of checkpointing...

    • @AllAboutBI
      @AllAboutBI  3 года назад

      Thanks for the feedback.
      Yes it's possible using powershe'll. Pl check microsoft documentation on it

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 года назад

      @@AllAboutBI Thanks for the swift reply mam...For example I have 500 files to copy from source to destination and after 400 files pipeline or Copy activity got failed, then is it possible that pipelines resumes for exactly for the remaining 100 files?

    • @AllAboutBI
      @AllAboutBI  3 года назад

      Yes possible

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 года назад

      @@AllAboutBI Thanks again!!!

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 года назад

      Hi mam, I tried to search the same on net but could not get it. If you have idea then could you pls share any link or steps to achieve this?

  • @thepakcolapcar
    @thepakcolapcar Год назад

    Hello Madam
    Good informational video
    Does setting up of pool and batch service adds cost to azure billing?
    Does batch pool resources are counted only when a python script from ADF is run or is it billed for all the time it is there even if no adf batch executions are invoked?

    • @AllAboutBI
      @AllAboutBI  Год назад

      From the moment you spin up a vm charges start to accumulate even if the nodes are idle.

    • @thepakcolapcar
      @thepakcolapcar Год назад

      @@AllAboutBI
      Thank you very much for the quick response. I wish azure allowed using existing VM already set up for some other activities. This is where all these cloud providers try get revenue 🙂
      So its just the VM gets charged or are there any charges such as pool etc? I am just asking as to get an idea, the info need not be accurate.

  • @ankitpancholi5739
    @ankitpancholi5739 3 года назад +2

    Good video. Very clear. However when executing these steps I am getting an error
    {"errorCategory":0,"code":"CommandProgramNotFound","message":"The specified command program is not found","details":[{"Name":"Message","Value":"The system cannot find the file specified."}]}
    Unable to run the program. Any pointers please. stderr.txt is empty.

    • @AllAboutBI
      @AllAboutBI  3 года назад

      Pls watch latest video 114 I have given steps

    • @olegserdyuk5082
      @olegserdyuk5082 11 месяцев назад +2

      @@AllAboutBI it is not that easy to find video 114. Can you post a full name or the link?

  • @magdhalsmith2724
    @magdhalsmith2724 2 года назад

    maam in order run python script in ADF using custom activity, should we enable anything?

  • @shilpashilpa2099
    @shilpashilpa2099 3 года назад +1

    Very helpful content.... I tried to experiment and I'm getting the error "CommandProgramNotFound"; did I miss something in my python script?

    • @AllAboutBI
      @AllAboutBI  3 года назад +1

      Seems like the python file is not found. Pls check the path of the file you are trying to invoke

  • @adidasravi
    @adidasravi 3 года назад

    Hello madam. Had a question. I have a customized job created in batch service but how can I make adf to make use of that job. Eventhough I've created the job for the pool adf uses, the adf creates its own new job and uses it. I want adf to make use of the existing job in pool. Please let me know if you've any idea

  • @nico_mano
    @nico_mano Год назад

    Thanks a lot

  • @guptaashok121
    @guptaashok121 2 года назад

    When we try to manipulate ads data from python script. Do we need to establish any authentication.. between Pool and adls.. if you have any video for that..

  • @justinrey08
    @justinrey08 9 месяцев назад

    Will this incur any cost?

  • @santhoshns2932
    @santhoshns2932 3 года назад

    Thanks a lot for the Vedio.Could you please share how to import numpy and pandas through adf.Can you please make a Vedio and upload it for us.

  • @nandanagurrapu6201
    @nandanagurrapu6201 3 года назад +1

    Hi
    I have tried this example of executing python script. I have used same script as yours, but the pipeline failed saying
    The system can not find file specified
    Any help on this.
    Thanks

    • @shilpashilpa2099
      @shilpashilpa2099 3 года назад +1

      Hi Nandana, were you able to fix this error? I'm also getting same error...can you please help?

    • @AllAboutBI
      @AllAboutBI  2 года назад

      Hi check this video. ruclips.net/video/_3_eiHX3RKE/видео.html

  • @harnesh007
    @harnesh007 3 года назад

    Thanks for video. Is it possible to handle CSV file with custom + python script ?

    • @abhishekmitra6315
      @abhishekmitra6315 2 года назад

      I want to know this as well. Kindly let me know solution if found

  • @jayanthjayaraman2627
    @jayanthjayaraman2627 3 года назад

    How do i import the python libraries like shareplum and pandas, i have a python script but i need to integrate it to a pipeline , so i can grab the files from sharepoint using python

  • @mahendhirang1226
    @mahendhirang1226 2 года назад

    Hi
    I need details, how I can pass one file as a parameter into our python scripts. I have my source file in storage account..what my script doing converting a parquet(source file) to Xml file..can u pls tell me..

  • @swaminathanteamsak7420
    @swaminathanteamsak7420 4 года назад

    Thanks a lot. Iam trying to do some numpy operations. Can you pls make a video to download numpy and run some numpy codd

  • @hajaaa100
    @hajaaa100 2 года назад

    what should i do if my python code imports something like below,
    import pandas as pd
    import os
    from glob import glob
    How to bring that pandas refrence

  • @thepakcolapcar
    @thepakcolapcar Год назад

    after setting up the batch account and all other details, when i run the pipeling the custom activity errors out saying "Cannot create Shared Access Signature unless Account Key credentials are used." Any clue might help. Thanks

    • @AllAboutBI
      @AllAboutBI  Год назад

      What are you trying to do with the batch job.. if u can send details to funlearn0007@gmail.com i will see if I know something

  • @oriono9077
    @oriono9077 4 года назад

    Great

    • @AllAboutBI
      @AllAboutBI  4 года назад

      Thanks 🙏

    • @chharsha4932
      @chharsha4932 3 года назад

      @@AllAboutBI Madam can you please make a video for doing SQL to SQl copy activity using python

  • @technologyexcitesme8829
    @technologyexcitesme8829 4 года назад

    Please show output . That will help us

  • @skselva403
    @skselva403 2 года назад

    We have a Virtual machine, Pool level Virtual machine is different or the same, it is the same meaning how do I use it?

    • @AllAboutBI
      @AllAboutBI  2 года назад +1

      Sorry, I don't understand your question selva. Can you pls explain a bit

    • @skselva403
      @skselva403 2 года назад

      ​@@AllAboutBI Hi Mam thanks for the replay, Pool Irukura Virtual machine and Azure Virtual machine
      different illa same ma mam,
      Ethu same na Pool Azure Virtual machine na use panalama ?

  • @elanty1
    @elanty1 3 года назад

    Hi mam, can I execute power shell code using same method

  • @techhunter877
    @techhunter877 4 года назад

    Great content. But I get some error. Can u help

    • @AllAboutBI
      @AllAboutBI  4 года назад

      Sure. Mail me the details to funlearn0007@gmail.com

  • @aadhilimam8253
    @aadhilimam8253 5 месяцев назад

    can we try this on free tier account ?

  • @guruprasadramanaboina8592
    @guruprasadramanaboina8592 2 года назад

    Hi mam