Why does shifting 0xff left by 24 bits result in an incorrect value?

问题

I would like to shift 0xff left by 3 bytes and store it in a uint64_t, which should work as such:

uint64_t temp = 0xff << 24;

This yields a value of 0xffffffffff000000 which is most definitely not the expected 0xff000000.

However, if I shift it by fewer than 3 bytes, it results in the correct answer.

Furthermore, trying to shift 0x01 left by 3 bytes does work.

Here's my output:

0xff shifted by 0 bytes: 0xff
0x01 shifted by 0 bytes: 0x1
0xff shifted by 1 bytes: 0xff00
0x01 shifted by 1 bytes: 0x100
0xff shifted by 2 bytes: 0xff0000
0x01 shifted by 2 bytes: 0x10000
0xff shifted by 3 bytes: 0xffffffffff000000
0x01 shifted by 3 bytes: 0x1000000

With some experimentation, left shifting works up to 3 bits for each uint64_t up to 0x7f, which yields 0x7f000000. 0x80 yields 0xffffffff80000000.

Does anyone have an explanation for this bizarre behavior? 0xff000000 certainly falls within the 2⁶⁴ - 1 limits of uint64_t.

回答1:

I suspect the behavior is compiler dependent, but I am seeing the same thing.

The fix is simple. Be sure to cast the 0xff to a uint64_t type BEFORE performing the shift. That way the compiler will handle it as the correct type.

uint64_t temp = uint64_t(0xff) << 24

回答2:

Does anyone have an explanation for this bizarre behavior?

Yes, type of operation always depend on operand types and never on result type:

double r = 1.0 / 2.0; 
    // double divided by double and result double assigned to r
    // r == 0.5

double r = 1.0 / 2; 
    // 2 converted to double, double divided by double and result double assigned to r
    // r == 0.5

double r = 1 / 2; 
    // int divided by int, result int converted to double and assigned to r
    // r == 0.0

When you understand and remenber this you would not fall on this mistake again.

回答3:

Shifting left creates a negative (32 bit) number which then gets filled to 64 bits.

Try

0xff << 24LL;

回答4:

Let's break your problem up into two pieces. The first is the shift operation, and the other is the conversion to uint64_t.

As far as the left shift is concerned, you are invoking undefined behavior on 32-bit (or smaller) architectures. As others have mentioned, the operands are int. A 32-bit int with the given value would be 0x000000ff. Note that this is a signed number, so the left-most bit is the sign. According to the standard, if you the shift affects the sign-bit, the result is undefined. It is up to the whims of the implementation, it is subject to change at any point, and it can even be completely optimized away if the compiler recognizes it at compile-time. The latter is not realistic, but it is actually permitted. While you should never rely on code of this form, this is actually not the root of the behavior that puzzled you.

Now, for the second part. The undefined outcome of the left shift operation has to be converted to a uint64_t. The standard states for signed to unsigned integral conversions:

If the destination type is unsigned, the resulting value is the smallest unsigned value equal to the source value modulo 2n where n is the number of bits used to represent the destination type.

That is, depending on whether the destination type is wider or narrower, signed integers are sign-extended[footnote 1] or truncated and unsigned integers are zero-extended or truncated respectively.

The footnote clarifies that sign-extension is true only for two's-complement representation which is used on every platform with a C++ compiler currently.

Sign-extension means just that everything left of the sign bit on the destination variable will be filled with the sign-bit, which produces all the fs in your result. As you noted, you could left shift 0x7f by 3-bytes without this occurring, That's because 0x7f=0b01111111. After the shift, you get 0x7f000000 which is the largest signed int, ie the largest number that doesn't affect the sign bit. Therefore, in the conversion, a 0 was extended.

Converting the left operand to a large enough type solves this.

uint64_t temp = uint64_t(0xff) << 24

来源：https://stackoverflow.com/questions/46246411/why-does-shifting-0xff-left-by-24-bits-result-in-an-incorrect-value

标签

c++

bit-manipulation

bit-shift