stream-analytics

How to find Duplicate documents in Cosmos DB

元气小坏坏 提交于 2020-01-25 04:19:12
问题 I have seen like a huge amount of data write to cosmos DB from stream analytics job on a particular day. It was not supposed to write huge amount of documents in a day. I have to check if there is duplication of documents on that particular day. Is there any query/any way to find out duplicate records in cosmos DB? 回答1: Is there any query/any way to find out duplicate records in cosmos DB? Quick answer is YES.Please use distinct keyword in the cosmos db query sql.And filter the _ts (System

ffprobe Fastest way to read video packets

感情迁移 提交于 2019-12-20 06:30:27
问题 I've been using ffprobe to read packets from video streams for analysis. However it's painfully slow. A 30min 720p video takes roughly 8min to read. I'm using the following command at the moment: ffprobe.exe -i video.mp4 -show_entries packet=size,pts_time,flags,duration_time This gives me the following information for every packet: [PACKET] pts_time=1981.392000 duration_time=0.032000 size=1536 flags=K_ [/PACKET] I did read a fair bit in the documentation but it looks like this is just it or

Convert Apache Flink Datastream to a Datastream that makes tumbling windows of 2 events and sum on a value

删除回忆录丶 提交于 2019-12-11 16:59:42
问题 I have a Flink Table with the following columns: final String[] hNames = {"mID", "dateTime", "mValue", "unixDateTime", "mType"}; I want to create a DataStream in Apache Flink that makes tumbling windows of a length of 2 each and calculates the average mValue for that window. Below I've used the SUM function since it seems there isnt a AVG function. These windows must be grouped on the mID (is a Integer) or dateTime column. I key the windows by the column mType , since these represent a

Wrapping JSON into output Stream Analytics query

我的未来我决定 提交于 2019-12-08 09:37:13
问题 I am using Stream Analytics query to filter my inputted Complex Json object. Input: { "id" : "001", "firstArray":[ { "tid" : 9, "secondArray":[ { "key1" : "value1", "key2" : "value2" }, {...} ] }, { "tid" : 8, "secondArray":[ { "key1" : "value1", "key2" : "value2" }, {...} ] } ] } This is my query: WITH T1 AS ( SELECT FirstArray.ArrayValue.Tid as Tid, FirstArray.ArrayValue.secondArray as SecondArray FROM inputfromeventhub MySource OUTER APPLY GetElements(MySource.firstArray) AS FirstArray )

ffprobe Fastest way to read video packets

杀马特。学长 韩版系。学妹 提交于 2019-12-02 13:31:24
I've been using ffprobe to read packets from video streams for analysis. However it's painfully slow. A 30min 720p video takes roughly 8min to read. I'm using the following command at the moment: ffprobe.exe -i video.mp4 -show_entries packet=size,pts_time,flags,duration_time This gives me the following information for every packet: [PACKET] pts_time=1981.392000 duration_time=0.032000 size=1536 flags=K_ [/PACKET] I did read a fair bit in the documentation but it looks like this is just it or is there a way to make this faster? Or would there be any alternative CLI tool to get the above