SgmlReader is a .NET library that is handy for converting SGML content (like HTML and OFX) into well formed XML via XmlReader, XmlDocument, XDocument or XPathDocument. It runs on Windows and Linux using Mono.
SgmlReader for Portable Library.
SgmlReader is "SGML" markup language parser, and derived from System.Xml.XmlReader in .NET CLR.
But, most popular usage the "HTML" parser. (It's scraper!!)
/* Use SgmlReader in Html parse mode. */
XDocument document = SgmlReader.Parse(stream);
Done!
ReadSharp is a PCL that extracts the main (meaningful) content of a website and returns the extracted content as HTML and it's associated title, as well as other useful information (images, icons, ...).