HTML Agility Pack HtmlDocument Show All Html?

I am using the following to get a web page which works fine

    public static HtmlDocument GetWebPageFromUrl(string url)
    {
        var hw = new HtmlWeb();
        return hw.Load(url);
    }

But how to I spit the entire contents of the HTML out from the HtmlDocument into a string?

I tried HtmlDocument.ToString() but that doesn’t give me all the HTML in the document? Any ideas?

Answers:

Thank you for visiting the Q&A section on Magenaut. Please note that all the answers may not help you solve the issue immediately. So please treat them as advisements. If you found the post helpful (or not), leave a comment & I’ll get back to you as soon as possible.

Method 1

DocumentNode.OuterHtml contains the full html:

HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
doc.Load("sample.html");
string html = doc.DocumentNode.OuterHtml;

In your example:

public static string GetWebPageHtmlFromUrl(string url)
{
    var hw = new HtmlWeb();
    HtmlDocument doc = hw.Load(url);
    return doc.DocumentNode.OuterHtml;
}


All methods was sourced from stackoverflow.com or stackexchange.com, is licensed under cc by-sa 2.5, cc by-sa 3.0 and cc by-sa 4.0

0 0 votes
Article Rating
Subscribe
Notify of
guest

0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x