Query fails on presto-cli for a table created in hive in orc format with data residing in s3

旧城冷巷雨未停 提交于 2020-02-25 03:55:37

问题


I set up an Amazon EMR instance which includes 1 Master & 1 Core (m4 Large) with the following version details: EMR : 5.5.0 Presto: Presto 0.170 Hadoop 2.7.3 HDFS Hive 2.1.1 Metastore

My Spark app wrote out the data in ORC to Amazon S3. Then I created the table in hive (create external table TABLE ... partition() stored as ORC location 's3a"//'), and tried to query from presto-cli, and I get the following error for query SELECT * from TABLE: Query 20170615_033508_00016_dbhsn failed: com.facebook.presto.spi.type.DoubleType

The only query that works is: SELECT COUNT(*) from TABLE

Any ideas?


回答1:


Found out the problem. The column orders when it was stored as orc did not match those when table was created in hive :)!!!



来源:https://stackoverflow.com/questions/44558125/query-fails-on-presto-cli-for-a-table-created-in-hive-in-orc-format-with-data-re

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!