I read this Questions about Java's String pool and understand the basic concept of string pool but still don\'t understand the behavior.
First: it works if you
Strings are guaranteed to be pooled when you call String.intern()
on a string.
String s1 = "abcd".intern();
String s2 = "abc";
s2 += "d";
s2 = s2.intern();
s1 == s2 // returns true
When compiler sees a constant it's smart enough to optimize and pool the string literal, i.e.:
String s1 = "abcd";
String s2 = "abcd";
s1 == s2 // returns true
Java Language Specification states:
Each string literal is a reference (§4.3) to an instance (§4.3.1, §12.5) of class String (§4.3.3). String objects have a constant value. String literals-or, more generally, strings that are the values of constant expressions (§15.28)-are "interned" so as to share unique instances, using the method String.intern.
So in the case of s2 += "d"
, compiler wasn't as clever as you are and just pooled "d"
.
I'm not sure about this, so this is pretty much speculation, but I suspect that there may be some compiler trickery going on in the first example (where it's inline and pretty obvious what's going on), but it's not clever enough to pull it off in the second example (where it's not so obvious).
If I'm right, either the compiler sees "a" + "bc"
and simply compresses that down at compile time to "abc"
or it's seeing the two lines and pooling the strings because it realizes they will be used. I'm betting on the former..
Not all strings necessarily get pooled.
I think what happens here is: 1. for String s1 = "a" + "bc"; String s2 = "ab" + "c"; Java compiler is smart enough to know that the literal value of s1 and s2 are the same, so the compiler points them to the same literal value in the string pool
there is no way the compiler know if s1 and s2 would end up being the same value, At runtime, unless you call String.intern(), jvm won't check the string literal pool to see if the value is already there.
See the documentation for String#intern(). The last line there states:
All literal strings and string-valued constant expressions are interned.
Your +=
example is neither a literal string nor a string-valued constant expression, so it is not put in the String pool.
The compiler can perform constant evaluation but not in the case where you modify the values
Try instead following and see what happens if you drop final
from either variable.
final String s1 = "abc";
final String s2 = "abc";
System.out.println("s1 == s2? " + (s1 == s2));
String s3 = s1 + "d";
String s4 = s2 + "d";
System.out.println("s3 == s4? " + (s3 == s4));
This is my guess:
String s1 = "a" + "bc"; String s2 = "ab" + "c";
I think that are compile time these are determined to produce the same string and so only one object is made for both.
But when you add "d" to both of them, this is done separately for both strings (since it's done during real time, there could be things like exceptions interrupting it etc, so it can't pre-do it) and so it doesn't automatically make them reference one object.