Ok, so i\'m working on a regular expression to search out all the header information in a site.
I\'ve compiled the regular expression:
regex = re.compile
This question has been asked in several forms over the last few days, so I'm going to say this very clearly.
Use BeautifulSoup, html5lib or lxml.html. Please.