Python randomly drops to 0% CPU usage, causing the code to “hang up”, when handling large numpy arrays?

前端未结

关注

 3  1790

I have been running some code, a part of which loads in a large 1D numpy array from a binary file, and then alters the array using the numpy.where() method.

Here is

相关标签:

3条回答

暖寄归人

2021-01-12 17:04

The drops in CPU usage were unrelated to python or numpy, but were indeed a result of reading from a shared disk, and network I/O was the real culprit. For such large arrays, reading into memory can be a major bottleneck.

0 讨论(0)
发布评论:

提交评论
- 加载中...
借酒劲吻你

2021-01-12 17:26

Did you click or select the Console window? This behavior can "hang" the process. Console enters "QuickEditMode". Pressing any key can resume the process.

0 讨论(0)
发布评论:

提交评论
- 加载中...
自闭症患者

2021-01-12 17:27
np.where is creating a copy there and assigning it back into arr. So, we could optimize on memory there by avoiding a copying step, like so -
```
vol_avg = (np.sum(arr) - (arr[arr >=  1.0] - 1.0).sum())/(num**3)
```
We are using boolean-indexing to select the elements that are greater than 1.0 and getting their offsets from 1.0 and summing those up and subtracting from the total sum. Hopefully the number of such exceeding elements are less and as such won't incur anymore noticeable memory requirement. I am assuming this hanging up issue with large arrays is a memory based one.
0 讨论(0)
发布评论:

提交评论
- 加载中...