Consider an array like atomic shared_array[]. What if you want to SIMD vectorize for(...) sum += shared_array[i].load(memory_order_relaxed
atomic shared_array[]
for(...) sum += shared_array[i].load(memory_order_relaxed