Perhaps this is well documented, but I am getting very confused how to do this (there are many Apache tools).
When I create an SQL table, I create the table using the fo
The odbc2parquet command line tool might also be helpful in some situations.
odbc2parquet \
-vvv \ # Log output, good to know it is still doing something during large downloads
query \ # Subcommand for accessing data and storing it
--connection-string ${ODBC_CONNECTION_STRING} \
--batch-size 100000 \ # Batch size in rows
--batches-per-file 100 \ # Ommit to store entire query in a single file
out.par \ # Path to output parquet file
"SELECT * FROM YourTable"