Java large datastructure for storing a matrix

后端 未结 9 1964
北恋
北恋 2021-01-13 02:17

I need to store a 2d matrix containing zip codes and the distance in km between each one of them. My client has an application that calculates the distances which are then s

相关标签:
9条回答
  • 2021-01-13 03:05

    Stephen C. has a good point: if the distances are as-the-crow-flies, then you could probably save memory by doing some calculations on the fly. All you'd need is space for the longitude and latitude for 952 zip codes and then you could use the vicenty formula to do your calculation when you need to. This would make your memory usage O(n) in zipcodes.

    Of course, that solution makes some assumptions that may turn out to be false in your particular case, i.e. that you have longitude and latitude data for your zipcodes and that you're concerned with as-the-crow-flies distances and not something more complicated like driving directions.

    If those assumptions are true though, trading a few computes for a whole bunch of memory might help you scale in the future if you ever need to handle a bigger dataset.

    0 讨论(0)
  • 2021-01-13 03:06

    A 2d array would be more memory efficient. You can use a small hashmap to map the 952 places into a number between 0 and 951 . Then, just do:

    float[][] distances= new float[952][952];
    

    To look things up, just use two hash lookups to convert the two places into two integers, and use them as indexes into the 2d array.

    By doing it this way, you avoid the boxing of floats, and also the memory overhead of the large hashmap.

    However, 906304 really isn't that many entries, you may just need to increase the Xmx maximum heap size

    0 讨论(0)
  • 2021-01-13 03:09

    I upvoted Chi's and Benjamin's answers, because they're telling you what you need to do, but while I'm here, I'd like to stress that using the hashcode of the two strings directly will get you into trouble. You're likely to run into the problem of hash collisions.

    This would not be a problem if you were concatenating the two strings (being careful to use a delimiter which cannot appear in the place designators), and letting HashMap do its magic, but the method you suggested, using the hashcodes for the two strings as a key, that's going to get you into trouble.

    0 讨论(0)
提交回复
热议问题