C# Html Agility Pack ( SelectSingleNode )

做~自己de王妃 提交于 2019-12-13 13:01:08

问题


I'm trying to parse this field, but can't get it to work. Current attempt:

var name = doc.DocumentNode.SelectSingleNode("//*[@id='my_name']").InnerHtml;


<h1 class="bla" id="my_name">namehere</h1>

Error: Object reference not set to an instance of an object.

Appreciate any help.

@John - I can assure that the HTML is correctly loaded. I am trying to read my facebook name for learning purposes. Here is a screenshot from the Firebug plugin. The version i am using is 1.4.0.

http://i54.tinypic.com/kn3wo.jpg

I guess the problem is that profile_name is a child node or something, that's why I'm not able to read it?


回答1:


The reason your code doesn't work is because there is JavaScript on the page that is actually writing out the <h1 id='profile_name'> tag, so if you're requesting the page from a User Agent (or via AJAX) that doesn't execute JavaScript then you won't find the element.

I was able to get my own name using the following selector:

string name = 
    doc.DocumentNode.SelectSingleNode("//a[@id='navAccountName']").InnerText;



回答2:


Try this:

var name = doc.DocumentNode.SelectSingleNode("//@id='my_name'").InnerHtml;



回答3:


HtmlAgilityPack.HtmlNode name = doc.DocumentNode.SelectSingleNode("//h1[@id='my_name']").InnerText;



回答4:


 public async Task<List<string>> GetAllTagLinkContent(string content)
    {


        string html = string.Format("<html><head></head><body>{0}</body></html>", content);
        HtmlDocument doc = new HtmlDocument();
        doc.LoadHtml(html);
        var nodes = doc.DocumentNode.SelectNodes("//[@id='my_name']");
        return nodes.ToList().ConvertAll(r => r.InnerText).Select(j => j).ToList();

    }

It's ok with ("//a[@href]"); You can try it as above.Hope helpful



来源:https://stackoverflow.com/questions/3818459/c-sharp-html-agility-pack-selectsinglenode

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!