BigQuery connector for Spark on Dataproc - cannot authenticate using service account key file

前端 未结 1 1418
北恋
北恋 2021-01-16 14:06

I have followed Use the BigQuery connector with Spark to successfully get data from a publicly available dataset. I now need to access a bigquery dataset that is owned by on

相关标签:
1条回答
  • 2021-01-16 15:10

    The issue seems to be here:

    Warning: Ignoring non-spark config property: mapred.bq.auth.service.account.json.keyfile=/tmp/keyfile.json

    To fix this, you should set Hadoop properties with spark.hadoop prefix in Spark:

    gcloud dataproc jobs submit pyspark ./bq_pyspark.py \
      --cluster $CLUSTER --region $REGION \
      --properties=spark.hadoop.mapred.bq.auth.service.account.json.keyfile=/tmp/keyfile.json
    
    0 讨论(0)
提交回复
热议问题