I am working on a very low level part of the application in which performance is critical.
While investigating the generated assembly, I noticed the following instructio
Actually, this is not something specific to the lea
instruction.
This type of addressing is called Scaled Addressing Mode. The multiplication is achieved by a bit shift, which is trivial:
You could do 'scaled addressing' with a mov
too, for example (note that this is not the same operation, the only similarity is the fact that ebx*4
represents an address multiplication):
mov edx, [esi+4*ebx]
(source: http://www.cs.virginia.edu/~evans/cs216/guides/x86.html#memory)
For a more complete listing, see this Intel document. Table 2-3 shows that a scaling of 2, 4, or 8 is allowed. Nothing else.
Latency (in terms of number of cycles): I don't think this should be affected at all. A shift is a matter of connections, and selecting between three possible shifts is the matter of 1 multiplexer worth of delay.