I want to write a regular expression that matches each word in a sentence:
My regular expression:\"\\b(\\w+)\\b
\"
Result:
You can do it with function to translate Aracbic Characters list, Its very simple to do.
As Like :
function (regexStr) {
regexStr = replace(regexStr,"ۿ","\u0600");
regexStr = replace(regexStr,"","\u06FF");
return regexStr;
}
Or in another idea replacing [alf] and [ya] to see your text direction correctly
var regexStr = "/[[alf]-[ya]]/";
function (regexStr) {
regexStr = replace(regexStr,"[alf]","\u0600");
regexStr = replace(regexStr,"[ya]","\u06FF");
return regexStr;
}
I'd suggest this :
\p{InArabic}
Try this:-
function HasArabicCharacters(text)
{
var arregex = /[\u0600-\u06FF]/;
alert(arregex.test(text));
}
Arabic character set of list
[\u0600-\u06ff]|[\u0750-\u077f]|[\ufb50-\ufc3f]|[\ufe70-\ufefc]
As of Unicode 6.1, the Arabic script is contained in the following blocks:
Arabic (0600—06FF, 225 characters)
Arabic Supplement (0750—077F, 48 characters)
Arabic Extended-A (08A0—08FF, 39 characters)
Arabic Presentation Forms-A (FB50—FDFF, 608 characters)
Arabic Presentation Forms-B (FE70—FEFF, 140 characters)
Rumi Numeral Symbols (10E60—10E7F, 31 characters)
Arabic Mathematical Alphabetic Symbols (1EE00—1EEFF, 143 characters)
Contents are taken from wikipedia - Arabic script in Unicode