How can I remove characters, like punctuation, commas, dashes etc from a string, in a multibyte safe manner?
I will be working with input from many different languag
I used this:
$clean = preg_replace( "/[^\p{L}|\p{N}]+/u", " ", $raw ); $clean = preg_replace( "/[\p{Z}]{2,}/u", " ", $clean );