Utf8 correct regex for CamelCase (WikiWord) in perl
问题 Here was a question about the CamelCase regex. With the combination of tchrist post i'm wondering what is the correct utf-8 CamelCase . Starting with (brian d foy's) regex: / \b # start at word boundary [A-Z] # start with upper [a-zA-Z]* # followed by any alpha (?: # non-capturing grouping for alternation precedence [a-z][a-zA-Z]*[A-Z] # next bit is lower, any zero or more, ending with upper | # or [A-Z][a-zA-Z]*[a-z] # next bit is upper, any zero or more, ending with lower ) [a-zA-Z]* #