Does gcc, icc, or Microsoft's C/C++ compiler support or know anything about NUMA?
If I have a multi-processor board that has cache-coherent non-uniform memory access ( NUMA ) , i.e. separate "northbridges" with separate RAM for each processor, does any compiler know how to automatically spread the data across the different memory systems such that processes working on local threads are mostly retrieving their data from the RAM associated with the processor the thread is running on? I have a setup where 1 GB is attached to processor 0, 1 GB is attached to processor 1, et c. up to 4 processors. In the coherent memory space the physical memory for the RAM on the 1st processor