Specifying dynamically generated table name based on line contents

后端未结

关注

 1  1633

爱一瞬间的悲伤 2021-01-17 07:31

I would like to setup Dataflow pipeline that reads from file in a gcs bucket, and writes to bigquery table. Caveat being, table to write to should be decided based on conten

1条回答

执念已碎 (楼主)

2021-01-17 07:58

On your first question: see Writing different values to different BigQuery tables in Apache Beam

On your second question: one way to accomplish that would be to have your appengine app publish every change notification to Cloud Pubsub, and have a constantly running streaming Dataflow pipeline watching the pubsub topic and writing to BigQuery.

On your third question: yes, assuming your data representation on GCS is fixed, the rest seems like a reasonable ingestion architecture to me :)

0 讨论(0)
发布评论:

提交评论
- 加载中...