azure-data-flow

Split a json string column or flatten transformation in data flow (ADF)

那年仲夏 提交于 2021-02-11 14:36:31
问题 I copy the following csv file to a data flow in ADF. The column Data has json format, but it is considered string. I want to flatten Data column into individual rows. I tried the flatten transformation, it did not work as Data column is not json. How do I deal with it? I also tried split expression, and it did not work either. Thank you 回答1: Just from your screenshot, We can find that : The data in Data are not JSON format. Data most look like an Array. The 'array' has 9 elements. Me must

Multiple Data flows vs all Transformations in one

南楼画角 提交于 2021-02-05 09:27:18
问题 Hi I am new to Azure data factory and not all familiar with the back-end processing that run behind the scenes. I am wondering if there is a performance impact to running couple of data flows in parallel when compared to having all the transformations in one data flow. I am trying to stage some data with a not exists transformation. i have to do it for multiple tables. when i test ran two data flows in parallel the clusters were brought up together for both the data flows simultaneously. But

Azure data factory data flow silently NULLing date column

只愿长相守 提交于 2021-01-29 09:57:35
问题 I'm trying to use Azure Data Factory to upsert a CSV into an Azure SQL table. All seemed well until I checked the results. One of the columns is a nullable date. The CSV contains a value like so 1/2/2020 12:00:00 AM . The data flow silently inserts a NULL instead of throwing an error because it didn't like the input. So how can I get my data flow to convert the string to a datetime properly, and then to error out on issues like this in the future? I really don't want silent failures and bad

Azure Data Flow creating / managing keys for identity relationships

微笑、不失礼 提交于 2021-01-29 02:04:16
问题 Curious to find out what the best way is to generate relationship identities through ADF. Right now, I'm consuming JSON data that does not have any identity information. This data is then transformed into multiple database sink tables with relationships (1..n, etc.). Due to FK constraints on some of the destination sink tables, these relationships need to be "built up" one at a time. This approach seems a bit kludgy, so I'm looking to see if there are other options that I'm not aware of. Note

How to write Filter Expression in Azure Data Flow?

十年热恋 提交于 2021-01-05 10:30:32
问题 I have filter in Data Flow of Azure Data Factory where I need to write expression for the following SQL logic(Where statement): Where ID NOT IN (Select PID from Table1 where ptype_c in (100, 2000)) 回答1: If your source is SQL Database, you can just choose Query option. If no, you can try like this. 1.create a Filter activity, get the data which ptype_c is in (100, 2000). 2.create a Select activity, change the column name. 3.new a branch of Source. 4.create a Exist activity. 来源: https:/

How to create a table in SQL Database from a CSV file in Blob which contain all the column name with its data type through Data Flow or ADF pipeline?

只愿长相守 提交于 2020-04-30 07:13:19
问题 I am having a CSV file in my Azure Blob Storage which contain all the column name with its data Data type of respective tables. I want to create a table in SQL Database from this Blob file with the same column name with its corresponding datatype without doing the mapping. I have created a table through data flow but I have to set the data type of each column manually. But I don't want to do this. When I create a table it should accept the same data types in the source as well as synch which

shaping json data in the sink

自作多情 提交于 2019-12-25 01:38:56
问题 How do union these two streams into a single JSON output using data factory/data flow? I have two streams of data. Stream 1 (csv) : 123,alex,03/18/1985 Stream 2 (csv) : 123,blue,new 123,purple,old Desired output: { "Stream1": { "id": 123, "name": "alex", "dob": "03/18/1985" }, "Stream2": [ { "id": 123, "color": "blue", "status": "new" }, { "id": 123, "color": "purple", "status": "old" } ] } How do union these two streams into a single JSON output? 来源: https://stackoverflow.com/questions