I scrapped some html via xpath, that I then converted into an etree. Something similar to this:
text1 link text2 <
element.xpath('normalize-space()') also works.