Effect of increasing parallelism on throughput

百般思念 提交于 2019-12-13 03:53:57

问题


I ran a job first with Parallelism 1 and then with Parallelism 3. With Parallelism=1, the kafka source was reading records at rate ~500 records per second. With Parallelism=3, the throughput got divided among the three parallelisms, each reading approximately ~150 records per second. Note that the source is publishing records at a much higher rate (~1000 records per second).

Is this expected? I would imagine the throughput to increase with parallelism, but it is remaining the same. I checked the Backpressure status on the source, it was High.

Screenshots for reference:

Parallelism 1:

Parallelism 3:

来源:https://stackoverflow.com/questions/56601740/effect-of-increasing-parallelism-on-throughput

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!