I want to do some \"near real-time\" data analysis (OLAP-like) on the data in a HDFS. My research showed that the three mentioned frameworks report significant performance g
Here is an answer of "How does Impala compare to Shark?" from Reynold Xin, the leader of the Shark development effort at UC Berkeley AMPLab.