How to load multiple files (same schema) into a table in BigQuery?

后端 未结 3 510
心在旅途
心在旅途 2020-12-20 16:51

I have a folder of csv files with the same schema that I want to load into a bigquery table.

Is there an option to give folder path as the input to BQ command to lo

相关标签:
3条回答
  • 2020-12-20 16:56

    Note that

    You can use only one wildcard for objects (filenames) within your bucket. The wildcard can appear inside the object name or at the end of the object name. Appending a wildcard to the bucket name is unsupported.

    so something like gs://my_bucket/some/*/files* is not supported.

    Source: https://cloud.google.com/bigquery/docs/loading-data-cloud-storage#load-wildcards

    0 讨论(0)
  • 2020-12-20 16:58

    If using cloud storage is an option, you can put them all in a common prefix in a bucket and use a wildcard e.g. gs://my_bucket/some/path/files* to specify a single load job with multiple inputs quickly.

    0 讨论(0)
  • 2020-12-20 17:15

    The files can be in subdirectories, if you want to recursively include all CSV:

    bq load --source_format=CSV \
    dataset_name.table_name \
    "gs://my_bucket/folder/*.csv"
    

    This puts a wildcard on intermediate path and filename. (ex. * expands to subfolder/folder2/filename)

    0 讨论(0)
提交回复
热议问题