Which database to choose (Cassandra, MongoDB, ?) for storing and querying event / log / metrics data?

后端 未结 3 2151
一整个雨季
一整个雨季 2021-02-14 09:29

In sql terms we\'re storing data like this:

table events (
  id
  timestamp
  dimension1
  dimension2
  dimension3
  etc.
)

All dimension value

3条回答
  •  抹茶落季
    2021-02-14 10:20

    "Group by" and "stupidly fast" do not go together. That's just the nature of that beast... Hence the limitations on Mongo's group operation; Cassandra doesn't even support it natively (although it does for Hive or Pig queries via Hadoop... but those are not intended to be stupidly fast).

    Systems like Twitter's Rainbird (which uses Cassandra) doing realtime analytics do it by denormalizing/pre-computing the counts: http://www.slideshare.net/kevinweil/rainbird-realtime-analytics-at-twitter-strata-2011

提交回复
热议问题