parse meta tags in Java

后端未结

关注

 3  955

I have a collection of HTML documents for which I need to parse the contents of the tags in the section. These are the only HTML tags whose values I\'m

相关标签:

3条回答

别跟我提以往

2021-01-22 22:32

You can likely use the Jericho HTML Parser. In particular, have a look at this to see how you can go about finding specific tags.

0 讨论(0)
发布评论:

提交评论
- 加载中...
情深已故

2021-01-22 22:51

If it suits your application you can use Tidy to convert HTML to valid XML, and then use as much XPath as you like!

0 讨论(0)
发布评论:

提交评论
- 加载中...
有刺的猬

2021-01-22 22:52

JTidy should provide a good starting point for this.

0 讨论(0)
发布评论:

提交评论
- 加载中...