Python: Extracting specific data with html parser
问题 I started using the HTMLParser in Python to extract data from a website. I get everything I wanted, except the text within two tags of HTML. Here is an example of the HTML tag: <a href="http://wold.livingsources.org/vocabulary/1" title="Swahili" class="Vocabulary">Swahili</a> There are also other tags starting with . They have other attributes and values and therefore I do not want to have their data: <a href="http://wold.livingsources.org/contributor#schadebergthilo" title="Thilo Schadeberg"