Negate characters in Regular Expression

狂风中的少年 提交于 2019-11-27 14:34:39

The caret inside of a character class [^ ] is the negation operator common to most regular expression implementations (Perl, .NET, Ruby, Javascript, etc). So I'd do it like this:

[^\W\s\d]
  • ^ - Matches anything NOT in the character class
  • \W - matches non-word characters (a word character would be defined as a-z, A-Z, 0-9, and underscore).
  • \s - matches whitespace (space, tab, carriage return, line feed)
  • \d - matches 0-9

Or you can take another approach by simply including only what you want:

[A-Za-z]

The main difference here is that the first one will include underscores. That, and it demonstrates a way of writing the expression in the same terms that you're thinking. But if you reverse you're thinking to include characters instead of excluding them, then that can sometimes result in an easier to read regular expression.

It's not completely clear to me which special characters you don't want. But I wrote out both solutions just in case one works better for you than the other.

In Perl, it would be something like:

$string !~ /[\d \W]/

Of course, it depends on your definition of "special characters". \W matches all non-word characters. A word character is any alphanumeric character plus the space character.

The most important thing to learn about regular expressions is not their syntax, but the ability to clearly explain what you are looking for. That's really 90% of the problem.
Also, it's usually better to state what you want rather than what you don't want.

Try ^[^0-9\p{P} ]$

lei
var StringInputToClean = @"[:(M)?*a',\y<>&a#~%{}+.@\\ /27!;$+]";

var pattern = @"[^a-zA-Z0-9\s]";

string replacement = "";

var result = Regex.Replace(StringInputToClean, pattern, replacement);
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!