Which is the best HTML tidy pack? Is there any option in HTML agility pack to make HTML webpage tidy?

我只是一个虾纸丫 提交于 2019-11-27 03:05:30

问题


I am using html agility pack to parse html tabular information. Now there is some html content with missing ending tags and from such page because of missing ending tags html agility pack does not parse information properly.So I want to insert ending tags where there are missing ending tags so html agility pack parse information properly. So to insert the missing ending tags what should I do ?Should I do write my own code for that or use html tidy pack to do that ?

If html tidy pack then which is the best html tidy pack,and how to use it any example if possible ? And if my own code than what it can be like ?

Is there any option in html agility pack which can make us able to first make the html page tidy and then parse the webpage.


回答1:


In Html Agility Pack I could not find any option that make html page tidy.There is one option that inserts the missing closing tags but it works in some html page only.That Option in html agility pack is,

  HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
      doc.OptionFixNestedTags=true;

I have also tried regex for that but it also work for some html page only.

So I found the best html tidy pack is :

http://www.devx.com/dotnet/Article/20505/1763/page/2.

We can see there : how to import the dll and how to use that tidy pack, there is sample code also available. It is great at all.It can insert the missing closing tags and makes your html page tidy .

Thanks for helping everyone..



来源:https://stackoverflow.com/questions/2490765/which-is-the-best-html-tidy-pack-is-there-any-option-in-html-agility-pack-to-ma

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!