jTidy and TagSoup documentation

泪湿孤枕 提交于 2019-12-10 04:24:46

问题


I'm looking for documentation (officially documentation if it is possible) for TagSoup and jTidy libraries.

I want use this libraries to manipulate html "tagsoup" files that include xml tags with different namespaces mixed between html (html, xhtml or html5) tags.

I have tested HTMLCleaner, NekoHTML and Jericho, but i don't find documentation for jTidy and TagSoup, apart from simplest examples to clear a file.

I need documentation about manipulate contents, replace tags, extract info, etc...

Thanks

Note: After test all options, I used StAX / Woodstox :

  • http://wiki.fasterxml.com/WoodstoxHome

  • https://en.wikipedia.org/wiki/StAX

  • https://docs.oracle.com/javase/tutorial/jaxp/stax/using.html


回答1:


The answer to a similar question on the tagsoup-friends google group may help:

Documentation for TagSoup

You've probably already seen them, but the javadoc for JTidy is available here: http://jtidy.sourceforge.net/apidocs/index.html



来源:https://stackoverflow.com/questions/4452566/jtidy-and-tagsoup-documentation

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!