I need to remove a doubled letter from a string using regex operations in java. Eg: PRINCEE -> PRINCE APPLE -> APLE
Like this:
final String str = "APPLEE";
String replaced = str.replaceAll("(.)\\1", "$1");
System.out.println(replaced);
Output:
APLE
As @Jim comments correctly, the above matches any double character, not just letters. Here are a few variations that just match letters:
// the basics, ASCII letters. these two are equivalent:
str.replaceAll("([A-Za-z])\\1", "$1");
str.replaceAll("(\\p{Alpha})\\1", "$1");
// Unicode Letters
str.replaceAll("(\\p{L})\\1", "$1");
// anything where Character.isLetter(ch) returns true
str.replaceAll("(\\p{javaLetter})\\1", "$1");
For additional reference:
Character.isXyz(char)
enables a pattern named
\p{javaXyz}
(mind the
capitalization). This mechanism is
described in the Pattern
javadocs\p
and
\P
constructs as in Perl. \p{prop}
matches if the input has the
property prop, while \P{prop}
does
not match if the input has that
property. This mechanism is also
described in the Pattern
javadocs