To be precise, I only need to increase a double by another double and want it to be thread safe. I don\'t want to use mutex for that since the execution speed would dramatically
So use the integral atomic as a memory barrier. Here's a page with source and explanation: http://preshing.com/20121019/this-is-why-they-call-it-a-weakly-ordered-cpu/