avro error on AWS EMR

前端 未结 4 895
轻奢々
轻奢々 2021-01-27 01:52

I\'m using spark-redshift (https://github.com/databricks/spark-redshift) which uses avro for transfer.

Reading from Redshift is OK, while writing I\'m getting



        
4条回答
  •  野趣味
    野趣味 (楼主)
    2021-01-27 02:03

    spark-redshift maintainer here.

    Other EMR users have encountered similar errors when using newer versions of the spark-avro library (which spark-redshift depends on). In a nutshell, the problem seems to be that EMR's older version of Avro takes precedence over the new version required by spark-avro. At https://github.com/databricks/spark-avro/issues/91, an issue that seems to match the exception reported here, one user suggested embedding the Avro JARs with their application code: https://github.com/databricks/spark-avro/issues/91#issuecomment-142543149

提交回复
热议问题