avro error on AWS EMR

前端 未结 4 902
轻奢々
轻奢々 2021-01-27 01:52

I\'m using spark-redshift (https://github.com/databricks/spark-redshift) which uses avro for transfer.

Reading from Redshift is OK, while writing I\'m getting



        
4条回答
  •  花落未央
    2021-01-27 02:03

    just for reference - workaround by Alex Nastetsky

    delete jars from master node

    find / -name "*avro*jar" 2> /dev/null -print0 | xargs -0 -I file sudo rm file
    

    delete jars from slave nodes

    yarn node -list | sed 's/ .*//g' | tail -n +3 | sed 's/:.*//g' | xargs -I node ssh node "find / -name "*avro*jar" 2> /dev/null -print0 | xargs -0 -I file sudo rm file
    

    Setting configs correctly as proposed by Jonathan is worth a shot too.

提交回复
热议问题