Google DataFlow Cannot read and write in different locations (Python SDK v0.5.5)

前端 未结 1 1262
暖寄归人
暖寄归人 2021-01-14 22:20

I\'m writing a very basic DataFlow pipeline using the Python SDK v0.5.5. The pipeline uses a BigQuerySource with a query passed in, which is querying BigQuery tables from da

相关标签:
1条回答
  • 2021-01-14 23:25

    Thanks for reporting this issue. I assume you are using DirectRunner. We changed the implementation of BigQuery read transform for DirectRunner to create a temporary dataset (for SDK versions 0.5.1 and later) to support large datasets. Seems like we are not setting the region correctly here. We'll look into fixing this.

    This issue should not occur if you use DataflowRunner which creates temporary datasets in the correct region.

    0 讨论(0)
提交回复
热议问题