OpenMP implementation of reduction

后端未结

关注

 2  2033

小蘑菇 2021-01-29 02:52

I need to implement reduction operation (for each thread the value should be stored in different array entry). However, it runs slower for more threads. Any suggestions?

2条回答

借酒劲吻你 (楼主)

2021-01-29 03:25
Did you try to use reduction?
```
double global_sum = 0.0;
#pragma omp parallel for shared(h,n,a) reduction(+:global_sum) 
for (i = 1; i < n; i++) {
    global_sum += f(a  + i* h);
}
```
Howerver there may be a lot of other reasons why it runs slow. For example you should not create 16 threads if you have only 2 CPU cores and so on.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...