发表新帖

发表新帖

avro error on AWS EMR

前端未结

关注

 4  902

轻奢々 2021-01-27 01:52

I\'m using spark-redshift (https://github.com/databricks/spark-redshift) which uses avro for transfer.

Reading from Redshift is OK, while writing I\'m getting

4条回答

花落未央 (楼主)

2021-01-27 02:03
just for reference - workaround by Alex Nastetsky

delete jars from master node
```
find / -name "*avro*jar" 2> /dev/null -print0 | xargs -0 -I file sudo rm file
```
delete jars from slave nodes
```
yarn node -list | sed 's/ .*//g' | tail -n +3 | sed 's/:.*//g' | xargs -I node ssh node "find / -name "*avro*jar" 2> /dev/null -print0 | xargs -0 -I file sudo rm file
```
Setting configs correctly as proposed by Jonathan is worth a shot too.
0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...

热议问题