I\'m trying to extract a specific information from different html pages. Basically the information is a 10 digits number which may have different forms such :
000
Consider other delimiters besides hyphens, not to mention parentheses.
(?:1\s*?[-.]?\s*)?(?:\(\s*d{3}\s*\)|d{3})\s*?[-.]?\s*\d{3}\s*?[-.]?\s*\d{4}\b
Okay, maybe that's more comprehensive than you need, but really this can get as complicated as you like. You can expand it to look for international phone numbers, extensions, and so forth, but that might not be worth it for you.