How can I use Unicode-aware regular expressions in JavaScript?
For example, there should be something akin to \\w that can match any code-point in Lette
\\w
This will do it:
/[A-Za-z\u00C0-\u00FF ]+/.exec('hipopótamo maçã pólen ñ poção água língüa')
It explicitly selects a range of unicode characters. It will work for latin characters, but other strange characters may be out of this range.