I have a server process implemented in haskell that acts as a simple in-memory db. Client processes can connect then add and retrieve data. The service uses more memory than I
I have a theory. My theory is that your program is using a lot of something like ByteStrings
. My theory is that because the main content of ByteStrings
is malloc
ated, they are not displayed while profiling. Thus you could run out of heap without the largest content of your heap showing up on the profiling graph.
To make matters even worse, when you grab substrings of ByteStrings
, they by default retain the pointer to the originally allocated block of memory. So even if you are trying to only store a small fragement of some ByteString
you could end up retaining the whole of the originally allocated ByteString
and this won't show up on your heap profile.
That is my theory anyways. I don't know enough facts about how GHC's heap profiler works nor about how ByteStrings
are implemented to know for certain. Maybe someone else can chime in and confirm or dispute my theory.
Edit2: tibbe notes that the buffer used by ByteString
s are pinned. So if you are allocating/freeing lots of small Bytestring
s, you can fragment your heap meaning you run out of useable heap with half of it unallocated.
Edit: JaffaCake tells me that sometimes the heap profiler will not display the memory allocated by ByteStrings.