• HtmlAgilityPack

    By:

    Last Published: | Latest Version: 1.4.9.5

    This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to... <a href="/packages/HtmlAgilityPack/1.4.9.5">More information</a>

  • Abot Web Crawler

    By:

    Last Published: | Latest Version: 1.5.1.67

    Abot is an open source C# web crawler built for speed and flexibility. It takes care of the low level plumbing (multithreading, http requests, scheduling, link parsing, etc..). You just register for events to process the page data. You can also plugin your own implementations of core interfaces to take complete control over the crawl process.

  • HtmlAgilityPack for .NET Core

    By:

    Last Published: | Latest Version: 1.5.0.1

    This is a port of HtmlAgilityPack library created by Simon Mourrier and Jeff Klawiter for .NET Core platform. This NuGet package supports can be used with Universal Windows Platform, ASP.NET 5 (using .NET Core) and full .NET Framework 4.6. Original description: This is an agile HTML parser that builds a read/write DOM and supports plain XPATH... <a href="/packages/HtmlAgilityPack.NetCore/1.5.0.1">More information</a>

  • AbotX Web Crawler

    By:

    Last Published: | Latest Version: 1.2.123

    A powerful C# web crawler that makes advanced crawling features easy to use. AbotX builds upon the open source Abot C# Web Crawler by providing a powerful set of wrappers and extensions.

  • SkyScraper

    By:

    Last Published: | Latest Version: 1.0.46

    Web scraper / crawler / spider. Supports robots protocol and user agent.

  • NCrawler

    By:

    Last Published: | Latest Version: 3.0.1

    Crawler and scrapping framework which is written in C#

  • NCrawler.HtmlProcessor

    By:

    Last Published: | Latest Version: 3.0.0

    HTML processing of crawled data for NCrawler

  • Spidy

    By:

    Last Published: | Latest Version: 1.0.0

    FSharp Web Crawler

  • MisterHexCrawler

    By:

    Last Published: | Latest Version: 1.0.0.5

    Simple web crawler that return IObservable using Reactive Extension(Rx) and async await.

  • NCrawler.EsentServices

    By:

    Last Published: | Latest Version: 3.0.0

    This is ESENT storage providers for the NCrawler

  • NCrawler.FileStorageServices

    By:

    Last Published: | Latest Version: 3.0.0

    Storage in file system for the NCrawler

  • NCrawler.EntityFramework

    By:

    Last Published: | Latest Version: 3.0.2

    Provides storing crawler data using EF for NCrawler

  • HtmlAgilityPack.Net45

    By:

    Last Published: | Latest Version: 2.0.20

    This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to... <a href="/packages/HtmlAgilityPack.Net45/2.0.20">More information</a>

  • DevZH.HtmlAgilityPack

    By:

    Last Published: | Latest Version: 1.4.9.4-final

    HtmlAgilityPack for .NET Core

  • AbotCore

    By:

    Last Published: | Latest Version: 0.1.13-beta

    .NET Core port of sjdirect/abot. Abot is an open source C# web crawler built for speed and flexibility. It takes care of the low level plumbing (multithreading, http requests, scheduling, link parsing, etc..). You just register for events to process the page data. You can also plugin your own implementations of core interfaces to take complete... <a href="/packages/AbotCore/0.1.13-beta">More information</a>

  • HtmlAgilityPack for .NET Core

    By:

    Last Published: | Latest Version: 1.5.0.2

    This is a port of HtmlAgilityPack library created by Simon Mourrier and Jeff Klawiter for .NET Core platform. This NuGet package supports can be used with Universal Windows Platform, ASP.NET 5 (using .NET Core) and full .NET Framework 4.6. Original description: This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or... <a href="/packages/HtmlAgilityPack.NetCoreCodePages/1.5.0.2">More information</a>

  • BCJobs.HtmlAgilityPack

    By:

    Last Published: | Latest Version: 1.4.11

    This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to... <a href="/packages/BCJobs.HtmlAgilityPack/1.4.11">More information</a>

  • HtmlDexterityPack

    By:

    Last Published: | Latest Version: 1.5.0

    This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to... <a href="/packages/HtmlDexterityPack/1.5.0">More information</a>

  • Crawler-Lib Engine Test Helper

    By:

    Last Published: | Latest Version: 2.3.5544.21265

    The Crawler-Lib Engine Test Helper simplifies the test of tasks. It can be used to develop unit tests and integration tests for tasks.

  • Crawler-Lib Engine

    By:

    Last Published: | Latest Version: 2.3.5544.21265

    The Crawler-Lib Engine is a general purpose workflow enabled task processor. It has evolved from a web crawler over data mining and information retrieval. It is throughput optimized and can perform thousands of tasks per second on standard hardware. Due to its workflow capabilities it allows to structure and parallelize even complex kind of work.... <a href="/packages/CrawlerLib.Engine/2.3.5544.21265">More information</a>