superscalar

What considerations go into predicting latency for operations on modern superscalar processors and how can I calculate them by hand?

六眼飞鱼酱① 提交于 2019-11-26 11:25:40
I want to be able to predict, by hand, exactly how long arbitrary arithmetical (i.e. no branching or memory, though that would be nice too) x86-64 assembly code will take given a particular architecture, taking into account instruction reordering, superscalarity, latencies, CPIs, etc. What / describe the rules must be followed to achieve this? I think I've got some preliminary rules figured out, but I haven't been able to find any references on breaking down any example code to this level of detail, so I've had to take some guesses. (For example, the Intel optimization manual barely even

What considerations go into predicting latency for operations on modern superscalar processors and how can I calculate them by hand?

依然范特西╮ 提交于 2019-11-26 02:02:14
问题 I want to be able to predict, by hand, exactly how long arbitrary arithmetical (i.e. no branching or memory, though that would be nice too) x86-64 assembly code will take given a particular architecture, taking into account instruction reordering, superscalarity, latencies, CPIs, etc. What / describe the rules must be followed to achieve this? I think I\'ve got some preliminary rules figured out, but I haven\'t been able to find any references on breaking down any example code to this level