I just started reading Hacker\'s Delight and it defines abs(-231) as -231. Why is that?
I tried printf(\"%x\", abs(0x80000000))
on a f
because it uses the neg instruction to perform this operation.
In the Art of Assembly language programming book they said like this.
If the operand is zero, its sign does not change, although this clears the carry flag. Negating any other value sets the carry flag. Negating a byte containing -128, a word containing -32,768, or a double word containing -2,147,483,648 does not change the operand, but will set the overflow flag. Neg always updates the A, S, P, and Z flags as though you were using the sub instruction
source :http://www.arl.wustl.edu/~lockwood/class/cs306/books/artofasm/Chapter_6/CH06-2.html#HEADING2-313 So it will set the overflow flag and be silently.That's the reason.
Because integers are stored in memory as a two's complement binary number, the positive version of the minimum value overflows back to negative.
That is to say (in .NET, but still applies):
int.MaxValue + 1 == int.MinValue // Due to overflow.
And
Math.Abs((long)int.MinValue) == (long)int.MaxValue + 1
Actually, in C, the behavior is undefined. From the C99 standard, §7.20.6.1/2:
The
abs
,labs
, andllabs
functions compute the absolute value of an integerj
. If the result cannot be represented, the behavior is undefined.
and its footnote:
The absolute value of the most negative number cannot be represented in two’s complement.
0x8000.. is stored as 10000.... (binary). This is known as twos complement, which means that the highest bit (the one at the left) is used to store the sign of the value and negative values are stored with negative binary - 1. The abs() function now checks the signbit, sees that it is set and computes the positive value.
Now this is a negative number again which we didn't want, the reason is a overflow, try the number 0x9000... which is 10010...
With this number the overflow is stopped by the 0 bit on the right
The representation of a two's complement number has the most significant bit as a negative number. 0x80000000 is 1 followed by 31 zeroes, the first 1 represents -2^31 not 2^31. Therefore there is no way to represent 2^31 as the highest positive number is 0x7FFFFFFF, which is 0 followed by 31 ones, which equals 2^31-1.
abs(0x80000000) is therefore undefined in the two's complement since it is too large, due to this the machine just gives up and gives you 0x80000000 again. Typically at least.
Obviously, mathematically, |−231| is 231. If we have 32 bits to represent integers, we can represent at most 232 numbers. If we want a representation that is symmetric about 0, we have a few decisions to make.
For the following, as in your question, I am assuming 32-bit wide numbers. At least one bit pattern must be used for 0. So that leaves us with 232−1 or less bit patterns for the rest of the numbers. This number is odd, so we can either have a representation that's not exactly symmetric about zero, or have one number be represented with two different representations.
0x80000000
is "negative zero" (i.e., zero), and 0x00000000
is "positive zero" or regular zero. In this scheme, the most positive number is 0x7fffffff
(2147483647) and the most negative number is 0xffffffff
(−2147483647). This scheme has the advantage that it is easy for us to "decode", and that it is symmetric. This scheme has a disadvantage in that calculating a + b
when a
and b
are of different signs is a special case, and has to be dealt with specially.0x7fffffff
(2147483647), and the maximum negative number is 0x80000000
(−2147483647). There are again two representations of 0: positive zero is 0x00000000
and negative zero is 0xffffffff
. This scheme also has issues with calculations involving negative numbers.1
to it. In this scheme, there is only one 0, namely 0x00000000
. The most positive number is 0x7fffffff
(2147483647) and the most negative number is 0x80000000
(−2147483648). There is an asymmetry in this representation. The advantage of this scheme is that one doesn't have to deal with special cases for negative number. The representation takes care of giving you the right answer as long as the result doesn't overflow. For this reason, most of the current hardware represents integers in this representation.In two's complement representation, there is no way to represent 231. In fact, if you look at your compiler's limits.h
or equivalent file, you might see a definition for INT_MIN
in such a way:
#define INT_MIN (-2147483647 - 1)
This done rather than
#define INT_MIN -2147483648
because 2147483648 is too large to fit in an int
in a 32-bit two's complement representation. By the time the unary minus operator "gets" the number to operate on, it is too late: overflow has already occurred and you can't fix it.
So, to answer your original question, the absolute value of the most negative number in a two's complement representation cannot be represented in that encoding. Also, from the above, to get from a negative value to a positive value in two's complement representation, you take its ones' complement and then add 1. So, for 0x80000000
:
1000 0000 0000 0000 0000 0000 0000 0000 original number
0111 1111 1111 1111 1111 1111 1111 1111 ones' complement
1000 0000 0000 0000 0000 0000 0000 0000 + 1
you get the original number back.