HTMLAgilityPack - Remove Node with out stripping the inner text

混江龙づ霸主 提交于 2019-12-20 01:49:09

问题


My html content is

<a href="#asdf">asdf</a>
<H5 align="left"><A href="#d570525d497.htm#toc">Table 
of Contents</A><br></H5>

I'm using HTML Agility Pack to load the html. I want to find <a> nodes and remove the node without removing its inner text as mentioned below

asdf
<H5 align="left">Table 
of Contents<br></H5>

I'm using below code,

var htmlDocument = new HtmlDocument();
htmlDocument.LoadHtml(htmlPage);
var Nodes = htmlDocument.DocumentNode.SelectNodes("//a");
foreach (var Node in Nodes)
{
    Node.InnerText.Trim();
}

It's not working. Something wrong with the code?


回答1:


Remove the node from parent while keeping the grand children

foreach (var Node in Nodes)
{
    Node.ParentNode.RemoveChild(Node, true); //<-- keepGrandChildren
}

var newhtml = htmlDocument.DocumentNode.OuterHtml;

output:

asdf
<h5 align="left">Table 
of Contents<br></h5>


来源:https://stackoverflow.com/questions/18734828/htmlagilitypack-remove-node-with-out-stripping-the-inner-text

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!