I\'m fairly new to Objectify, and I had a quick question for the best way to do something:
Lets say I have an application that allows people to send and receive messag
This is copied from my response on the objectify-appengine google group: https://groups.google.com/forum/?fromgroups#!topic/objectify-appengine/LlOyRJZRbnk
There are three main options when dealing with "aggregation data" like what you describe:
1) Calculate it when you need it
You've concluded, rightly I think, that this is too expensive.
2) Calculate it at batch intervals and store this result
Not very satisfying since it involves a delay. Plus you don't want to comb through your entire database every night.
3) Update the aggregation when the data changes
This approach involves a little more work every time the data changes, but it's almost certainly what you want to do.
Create some sort of collection of contacts for each user. When a message arrives, make sure a sender contact exists for that recipient. Maybe you also want to delete the contact when the recipient deletes the last message from a sender.
Be careful not to bump into entity group transaction rate limits (one write per second). I'll walk you through some options:
1) You could store a list of contacts in each recipient:
class Person {
@Id Long id;
Set> contacts;
}
This would be a distinct problem if, say, the recipient received mail from 20 new people all at once. This is almost certainly a bad idea. On the other hand, it's blazingly fast and efficient to look up who your contacts are. A minor improvement would be to move this into a separate entity parented by the person so you aren't always loading that data:
class Contacts {
@Parent Key owner;
@Id long id = 1; // there's only ever one of these per person, and it should have a predictable key for fetching
Set> contacts;
}
Of course, the Set in a single entity gives you a 50,000 entry limit. It might be slightly smaller than this if you hit the 1M entity size limit first. If your keys are ~20 chars, it'll be about the same. If this is an issue you could allow multiple Contacts entities, at which point you have something that looks like the Relation Index Entity pattern from Brett Slatkin's 2009 Google I/O talk: http://www.youtube.com/watch?v=AgaL6NGpkB8
2) You could store a list of contacts in the other direction
class Person {
@Id Long id;
@Index Set> contactOf;
}
This makes it a bit more expensive to find out who your contacts are - you need a keys-only query, not a simple get-by-key. But you aren't really limited by the entity write rate anymore. People probably don't send more than one message per second, and if they send out 1000 messages in bulk, you can update the contactOf in a single transaction.
As above, you probably want to move this index into a separate entity:
class Contacts {
@Parent Key person;
@Id long id = 1; // there's only ever one of these per person, and it should have a predictable key for fetching
Set> of;
}
3) You could also store these contacts in a completely separate entity
class Contact {
@Parent Key person;
@Id Long id;
@Index Key owner;
}
This is really just a less-space-efficient way of doing solution #2.
The important thing is to keep updating this structure when every message is sent or received.