问题
My html content is
<a href="#asdf">asdf</a>
<H5 align="left"><A href="#d570525d497.htm#toc">Table
of Contents</A><br></H5>
I'm using HTML Agility Pack to load the html. I want to find <a>
nodes and remove the node without removing its inner text as mentioned below
asdf
<H5 align="left">Table
of Contents<br></H5>
I'm using below code,
var htmlDocument = new HtmlDocument();
htmlDocument.LoadHtml(htmlPage);
var Nodes = htmlDocument.DocumentNode.SelectNodes("//a");
foreach (var Node in Nodes)
{
Node.InnerText.Trim();
}
It's not working. Something wrong with the code?
回答1:
Remove the node from parent while keeping the grand children
foreach (var Node in Nodes)
{
Node.ParentNode.RemoveChild(Node, true); //<-- keepGrandChildren
}
var newhtml = htmlDocument.DocumentNode.OuterHtml;
output:
asdf
<h5 align="left">Table
of Contents<br></h5>
来源:https://stackoverflow.com/questions/18734828/htmlagilitypack-remove-node-with-out-stripping-the-inner-text