问题
The graph size is in the billions of nodes, and tens of billions of vertices.
It will store webpages urls, and links between webpages and it will be used for testing ranking algorithms.
Any language is fine but java is prefered.
Solutions i found so far:
- neo4j
- storing in sorted flat files
Yes, i have already read Best Way to Store/Access a Directed Graph.
Update
The data can be distributed on multiple computers and does not need to be fully in-memory.
回答1:
Depending on your implementation, another solution could be Terracotta. I think supports object graphs of this magnitude using a distributed virtual heap.
http://www.terracotta.org/web/display/docs/Concept+and+Architecture+Guide#ConceptandArchitectureGuide-VirtualHeap
来源:https://stackoverflow.com/questions/1526479/how-to-store-a-large-directed-unweighted-graph-with-billions-of-nodes-and-vertic