I am trying to learn how to parse HTML, but as I don\'t have a lot of experience in either Java or Android, it\'s a little complicated. I have read the IBM XML parsing tutor
IMO there are two easy ways to parse HTML:
Alternatively, if you want to write your own parser (which I doubt you should, for homework: it would be long and complicated to implement it properly/completely), see the specs for parsing HTML.
Check out the following HTML parsers. There are more out there. Maybe one will work for you:
HTMLCleaner: http://htmlcleaner.sourceforge.net/
TagSoup: http://ccil.org/~cowan/XML/tagsoup/
Jericho: http://jericho.htmlparser.net/docs/index.html