How to convert an 500GB SQL table into Apache Parquet?

前端 未结 2 1928
暖寄归人
暖寄归人 2021-02-05 21:56

Perhaps this is well documented, but I am getting very confused how to do this (there are many Apache tools).

When I create an SQL table, I create the table using the fo

2条回答
  •  醉话见心
    2021-02-05 22:06

    The odbc2parquet command line tool might also be helpful in some situations.

    odbc2parquet \
    -vvv \ # Log output, good to know it is still doing something during large downloads
    query \ # Subcommand for accessing data and storing it
    --connection-string ${ODBC_CONNECTION_STRING} \
    --batch-size 100000 \ # Batch size in rows
    --batches-per-file 100 \ # Ommit to store entire query in a single file
    out.par \ # Path to output parquet file
    "SELECT * FROM YourTable"
    

提交回复
热议问题