BeautifulSoup and ASP.NET/C#

前端 未结 3 1638
臣服心动
臣服心动 2021-02-05 20:56

Has anyone integrated BeautifulSoup with ASP.NET/C# (possibly using IronPython or otherwise)? Is there a BeautifulSoup alternative or a port that works nicely with ASP.NET/C#

相关标签:
3条回答
  • 2021-02-05 21:06

    You could try this although it currently has a few bugs:

    http://nsoup.codeplex.com/

    0 讨论(0)
  • 2021-02-05 21:13

    Html Agility Pack is a similar project, but for C# and .NET


    EDIT:

    To extract all readable text:

    document.DocumentNode.InnerText
    

    Note that this will return the text content of <script> tags.

    To fix that, you can remove all of the <script> tags, like this:

    foreach(var script in doc.DocumentNode.Descendants("script").ToArray())
        script.Remove();
    foreach(var style in doc.DocumentNode.Descendants("style").ToArray())
        style.Remove();
    

    (Credit: SLaks)

    0 讨论(0)
  • 2021-02-05 21:26

    I know this is quite old, but I decided to post this for future reference. I came across this searching for a similar solution.

    I found a library built on top of Html Agility Pack called scrapysharp

    I've used it in quite similar manner as I would BeautifulSoup https://bitbucket.org/rflechner/scrapysharp/wiki/Home

    0 讨论(0)
提交回复
热议问题