How to access the HIVE ACID table in Spark sql?

后端 未结 4 1622
暗喜
暗喜 2021-01-15 19:07

How could you access the HIVE ACID table, in Spark sql?

4条回答
  •  被撕碎了的回忆
    2021-01-15 19:52

    @aniket Spark doesn't support reading Hive Acid tables directly. (https://issues.apache.org/jira/browse/SPARK-15348/SPARK-16996) The data layout for transactional tables requires special logic to decide which directories to read and how to combine them correctly. Some data files may represent updates of previously written rows, for example. Also, if you are reading while something is writing to this table your read may fail (w/o the special logic) because it will try to read incomplete ORC files. Compaction may (again w/o the special logic) may make it look like your data is duplicated. It can be done (WIP) via LLAP - tracked in https://issues.apache.org/jira/browse/HIVE-12991

提交回复
热议问题