parse meta tags in Java

后端 未结 3 928
终归单人心
终归单人心 2021-01-22 21:51

I have a collection of HTML documents for which I need to parse the contents of the tags in the section. These are the only HTML tags whose values I\'m

相关标签:
3条回答
  • 2021-01-22 22:32

    You can likely use the Jericho HTML Parser. In particular, have a look at this to see how you can go about finding specific tags.

    0 讨论(0)
  • 2021-01-22 22:51

    If it suits your application you can use Tidy to convert HTML to valid XML, and then use as much XPath as you like!

    0 讨论(0)
  • 2021-01-22 22:52

    JTidy should provide a good starting point for this.

    0 讨论(0)
提交回复
热议问题