html-agility-pack | 易学教程

NullReferenceException with HtmlDocument reference in C#

阅读更多关于 NullReferenceException with HtmlDocument reference in C#

问题 I am using HtmlAgilityPack in order to scrape information off of Google Translate for a translation program. I have downloaded the HtmlAgilityPack dll, and successfully referenced it in my program. I am using Assembly in Unity. Below is my code for the two programs: using UnityEngine; using System.Collections; using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Net; using HtmlAgilityPack; public class GUIScript : MonoBehaviour { private string

C# Render HTML to get element's height

阅读更多关于 C# Render HTML to get element's height

问题 I need to create an HTML page from a C# code with HTML Agility Pack. What I want is that if the element I'm trying to write cannot fit into a div with a specific height, it must create a new div and close the other one (like a paging if you want). My big concern is how to know the height of my element inside C#, I assume I'm force to render it somewhere with the CSS associated, but how? 回答1: One way to do this is using an HTML renderer like WebKit.NET. You would then be able to call the

C# Html Agility Pack ( SelectSingleNode )

阅读更多关于 C# Html Agility Pack ( SelectSingleNode )

问题 I'm trying to parse this field, but can't get it to work. Current attempt: var name = doc.DocumentNode.SelectSingleNode("//*[@id='my_name']").InnerHtml; <h1 class="bla" id="my_name">namehere</h1> Error: Object reference not set to an instance of an object. Appreciate any help. @John - I can assure that the HTML is correctly loaded. I am trying to read my facebook name for learning purposes. Here is a screenshot from the Firebug plugin. The version i am using is 1.4.0. http://i54.tinypic.com

c# how to download html which loads using ajax

阅读更多关于 c# how to download html which loads using ajax

问题 now a days there are web pages which developed using some ajax based frameworks (dynamically or lazy loading). Just wondering if there is any way to download html contents of such pages as when i try to download using htmlAgilityPack but all i get is header and empty body part but when i try to inspect element then only i can see proper htmls/div but of that page when i try to look into view source i see empty body... is there any third party like htmlAgilityPack or any other way? 回答1: You

c# HtmlAgilityPack Wrapp taking several images, titles and links Windows Phone [closed]

阅读更多关于 c# HtmlAgilityPack Wrapp taking several images, titles and links Windows Phone [closed]

问题 Closed . This question needs details or clarity. It is not currently accepting answers. Want to improve this question? Add details and clarify the problem by editing this post. Closed 5 years ago . How to fetch the data on this page? I need to get the image and its corresponding title and display in a list box. Using class or not. I do not know how to do, please help me! Do not even know where to start! I've tried several things and none work, then turned into a mess, would someone guide me

HtmlAgilityPack produces missing closing tags in OuterHtml

阅读更多关于 HtmlAgilityPack produces missing closing tags in OuterHtml

问题 I am using HtmlAgilityPack to parse and manipulate html text. However it seems the DocumentNode.OuterHtml gives missing closing tags. To isolate the issue now I am doing nothing else just parse and get the OuterHtml (no manipulation): var document = new HtmlDocument(); document.LoadHtml(myHtml); result = document.DocumentNode.OuterHtml; Original: (myHtml) <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http

Extracting a table row with a particular attribute,using HTMLAGILITY pack

阅读更多关于 Extracting a table row with a particular attribute,using HTMLAGILITY pack

问题 Consider this piece of code: <tr> <td valign=top class="tim_new"><a href="/stocks/company_info/pricechart.php?sc_did=MI42" class="tim_new">3M India</a></td> <td class="tim_new" valign=top><a href='/stocks/marketstats/indcomp.php?optex=NSE&indcode=Diversified' class=tim>Diversified</a></td> I want to write a piece of code using HTMLAgility pack which would extract the link in the first line. using System; using System.Collections.Generic; using System.Linq; using System.Text; using

Is there anyway to use “BrowserSession” to download files? C#

阅读更多关于 Is there anyway to use “BrowserSession” to download files? C#

问题 I have a site that requires login before it lets you download files. Currently I am using the BrowserSession Class to login and do all the scraping required (at least for the most part). BrowserSession Class source at bottom of post: The download Links show up on the document nodes. But I don't know how to add download functionality to that class, and If I try to download them with a webclient it fails, I already had to heavily modify the BrowserSession class, (I should have Modified it as a

Not getting correct data from span

阅读更多关于 Not getting correct data from span

问题 I've been making a custom user handler for Jessecar's SteamBot, which is unrelated to the problem I'm having, but essentially what I've done, is I've made it so you can set the bot to play a specific game by App ID, and I've been using this to idle on games for Steam Trading Cards, the only issue is, the only way I can check if it's finished, is by checking its inventory and how many cards are supposed to drop, which isn't too much of a hassle, but the main reason I created this was for

Selecting nodes that have an attribute with spaces using HTMLAgilityPack

阅读更多关于 Selecting nodes that have an attribute with spaces using HTMLAgilityPack

问题 I'm trying to select all the li elements in from the following: <div> <ul> <li class="third left"> ... </li> <li class="third left"> ... </li> </ul> </div> I'm using the following XPath query but this returns nothing. Dim result As HtmlNodeCollection = htmlDoc.DocumentNode.SelectNodes("//li[@class='third left']") Are attributes with spaces treated differently? Using this tool the XPath works correctly. 回答1: As @SimonMourier said, your XPath is correct. I suggest you doing the following: