azure-data-factory

Execute python scripts in Azure DataFactory

牧云@^-^@ 提交于 2020-07-05 11:04:28
问题 I have my data stored in blobs and I have written a python script to do some computations and create another csv. How can I execute this in Azure Data Factory ? 回答1: Mighty. You could use Azure Data Factory V2 custom activity for your requirements. You can directly execute a command to invoke python script using Custom Activity. Please refer to this sample on the github. Hope it helps you. 回答2: Another option is using a DatabricksSparkPython Activity. This makes sense if you want to scale out

Is it possible to connect to databricks deltalake tables from adf

廉价感情. 提交于 2020-07-03 10:10:30
问题 I'm looking for a way to be able to connect to Databricks deltalake tables from ADF and other Azure Services(like Data Catalog). I don't see databricks data store listed in ADF data sources. On a similar question - Is possible to read an Azure Databricks table from Azure Data Factory? @simon_dmorias seems to have suggested using ODBC connection to connect to databricks tables. I tried to set up the ODBC connection but it requires IR to be setup. There are 2 options I see when creating the IR.

azure data factory: use variables in query

↘锁芯ラ 提交于 2020-06-29 04:29:08
问题 I have created a copy activity that copies data from an on premise database to a Azure SQL Database. I need to modify dynamiccaly the query so it take a range of date, so I create two variables: - inidate - enddate that I want to use inside the where clause, but I don't know how to reference the variables. I tried this but it doesn't work: "SELECT * FROM tableOnPrem WHERE dateOnPrem BETWEEN '@variable('inidate')' AND '@variable('enddate')' please help. meny thanks 回答1: In a Pipeline (like for

Azure data factory: Handling inner failure in until/for activity

可紊 提交于 2020-06-29 03:48:14
问题 I have an Azure data factory v2 pipeline containing an until activity. Inside the until is a copy activity - if this fails, the error is logged, exactly as in this post, and I want the loop to continue. Azure Data Factory Pipeline 'On Failure' Although the inner copy activity’s error is handled, the until activity is deemed to have failed because an inner activity has failed. Is there any way to configure the until activity to continue when an inner activity fails? 回答1: Solution Put the error

Azure Data Factory - Limit the number of Databricks pipeline running at the same time

假如想象 提交于 2020-06-27 19:26:19
问题 I am using ADF to execute Databricks notebook. At this time, I have 6 pipelines, and they are executed consequently. Specifically, after the former is done, the latter is executed with multiple parameters by the loop box, and this keeps going. For example, after the first pipeline is done, it will trigger 3 instances of the second pipeline with different parameters, and each of these instances will trigger multiple instances of the third pipeline. As a result, the deeper I go, the more

Azure Data Factory - Limit the number of Databricks pipeline running at the same time

痞子三分冷 提交于 2020-06-27 19:25:43
问题 I am using ADF to execute Databricks notebook. At this time, I have 6 pipelines, and they are executed consequently. Specifically, after the former is done, the latter is executed with multiple parameters by the loop box, and this keeps going. For example, after the first pipeline is done, it will trigger 3 instances of the second pipeline with different parameters, and each of these instances will trigger multiple instances of the third pipeline. As a result, the deeper I go, the more

How to call a REST API using Azure Data Factory Pipelines?

故事扮演 提交于 2020-06-25 06:41:26
问题 I would like to execute a REST API with oauth authentication using ADF Pipeline. Without oauth2, I could call any REST APIs. So here my question is, does this ADF pipeline support REST APis with oauth2 authentication ? if yes. Please provide a solution The API which is am trying, you can find from this URL https://docs.microsoft.com/en-us/rest/api/datafactory/pipelineruns/querybyfactory Thanks Binu 回答1: You could call the REST API with a Web activity in the pipeline, select the Authentication

The client with object id does not have authorization to perform action 'Microsoft.DataFactory/datafactories/datapipelines/read' over scope

為{幸葍}努か 提交于 2020-06-24 06:03:09
问题 I was trying to invoke data factory pipeline from azure function programmatically. Its throwing following error. link: http://eatcodelive.com/2016/02/24/starting-an-azure-data-factory-pipeline-from-c-net/ AuthorizationFailed: The client 'XXXX-XXXXX-XXXX' with object id 'XXX829e05'XXXX-XXXXX' does not have authorization to perform action 'Microsoft.DataFactory/datafactories/datapipelines/read' over scope '/subscriptions/XXXXXX-4bf5-84c6-3a352XXXXXX/resourcegroups/fffsrg/providers/Microsoft

Use dynamic value as table name of a table storage in Azure Data Factory

我的未来我决定 提交于 2020-06-17 14:19:05
问题 I have an ADF pipeline that uses copy data activity for copying data from blob storage to table storage. This pipeline runs on a trigger once every day. I have provided a table name in table storage data set as 'Table1'. Instead of providing a hard coded table name value (Table1), is it possible to provide a dynamic value as table name in the table storage such that RUN ID of pipeline run is used as the table name in the table storage and copy data from blob to that table in table storage?

How to target a devops branch when script creating objects in ADFv2?

不羁的心 提交于 2020-06-12 17:51:02
问题 Using azure data factory v2 with GIT / Azure DevOps integration: If you for example create a trigger using Set-AzDataFactoryV2Trigger via powershell according to the documentation, the trigger is created directly in the adf_publish branch. This is an issue, as this will result in a mismatch between the master branch and adf_publish, meaning you'll not be able to publish going forward as this of course raises an error. How do I get the cmdlet to create the trigger in a new or specific branch,