Data structure for relationships

后端 未结 2 787
隐瞒了意图╮
隐瞒了意图╮ 2021-01-01 08:27

I am converting a VB6 to C# and I want to make my data structure that holds values and relationships more efficient. In VB I have a collection of values and another collecti

相关标签:
2条回答
  • 2021-01-01 08:39

    It sounds like what you have is a Graph. This is a structure with Nodes and Edges. There are many many libraries and tools that deal with Graphs. Microsoft even put out a paper on how to deal with them. I think graphs are great and extremely useful in many situations.

    One big benefit with graphs is the ability to assign priorities to the edges between the nodes. Then when you want to find the path between two nodes, boom, the graph can choose the path with the ideal priority.

    In your situation, your values are the nodes and your relationships are the edges.

    0 讨论(0)
  • 2021-01-01 08:57

    You need to ask yourself (and tell us) what kind of pattern of use you expect. Do these relations get added in order or randomly, do yours queries come in order (as you show them) or randomly, and is this essentially a batch process -- load them up, read off the queries -- or do you expect to do it "on line" in the sense that you may add some, then query some, then add some more and query some more?

    Will you know how many you want to store beforehand, and how many do you expect to store? Dozens? Thousands? Tens of millions?

    Here are some suggestions:

    • if you know beforehand how many you expect to store, it's not a really big number, you don't expect to add them after first loading up, there aren't any duplicates in the left-hand side of the pair, and they're reasonably "dense" in the sense that there aren't big gaps between numbers in the left-hand one of the pair, then you probably want an array. Insertion is O(1), access is O(1), but can't have duplicate indices and expanding it after you build it is a pain.
    • if the number is really large, like > 108, you probably want some kind of database. Databases are relatively very slow -- 4 to 5 orders of magnitude greater than in-memory data structures -- but handle really big data.
    • If you have insertions after the first load, and you care about order, you're going to want some sort of tree, like a 2-3 tree. Insertion and access both O(lg n). You'd probably find an implmentation under a name like "ordered list" (I'm not a C# guy.)
    • Most any other case, you probably want a hash. Average insertion and access both O(1), like an array; worst case [which you won't hit with this data] is O(n)
    0 讨论(0)
提交回复
热议问题