IronWebScraper 2023.1.22

.NET Standard 2.0
There is a newer version of this package available.
See the version list below for details.
dotnet add package IronWebScraper --version 2023.1.22
NuGet\Install-Package IronWebScraper -Version 2023.1.22
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="IronWebScraper" Version="2023.1.22" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronWebScraper --version 2023.1.22
#r "nuget: IronWebScraper, 2023.1.22"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install IronWebScraper as a Cake Addin
#addin nuget:?package=IronWebScraper&version=2023.1.22

// Install IronWebScraper as a Cake Tool
#tool nuget:?package=IronWebScraper&version=2023.1.22

Nuget Installs Passed windows livechat

Iron WebScraper - The C# WebScraping Library

Get Started | Code Examples | Licensing | Free Trial

Iron WebScraper is a C# web scraping library, allowing developers to simulate & automate human browsing behavior to extract content, files & images from web applications as native .Net objects. Iron Web Scraper manages politeness & multithreading in the background, leaving a developer�s own application easy to understand & maintain.

Iron WebScraper can be used to migrate content from existing websites as well as build search indexes and monitor website structure & content changes.

Iron WebScraper allows developers to:

  • Read & extract structured content from web pages using html DOM, Javascript, Xpath, jQuery Style CSS Selectors.
  • Fast multi threading allows hundreds of simultaneous requests.
  • Politely avoid over stalling remote servers using IP/domain level throttling & optionally respecting robots.txt
  • Manage multiple identities, DNS, proxies, user agents, request methods, custom headers, cookies & logins.
  • Data exported from websites becomes native C# objects which can be stored or used immediately.
  • Exceptions managed in all but the developers own code. Errors and captchas auto retried on failure
  • Save, pause, resume, autosave scrape jobs.
  • Built in web cache allows for action replay, crash recovery, and querying existing web scrape data. Change scrape logic on the fly, then replay job without internet traffic.

Iron WebScraper has cross platform support compatibility with:

  • .NET 6 and .NET 5, .NET Core, Standard, and Framework
  • Windows, macOS, Linux, Docker, Azure, and AWS

IronWebScraper Cross Platform Compatibility Support Image

Additionally, our API reference and full licensing information can easily be found on our website.

Using Iron WebScraper

Installing the IronWebScraper NuGet package is quick and easy, please install the package like this:

PM> Install-Package IronWebScraper

Once installed, you can get started by adding using IronWebScraper to the top of your C# code. Here is an example to get started:

using IronWebScraper;

namespace YourApp
{
    public class Program
    {
        private static void Main(string[] args)
        {
            var ScrapeJob = new BlogScraper();
            ScrapeJob.Start();
        }
    }

    public class BlogScraper : WebScraper
    {
        public override void Init()
        {
            LoggingLevel = LogLevel.All;
            Request("https://www.zyte.com/blog/", Parse);
        }

        public override void Parse(Response response)
        {
            foreach (HtmlNode title_link in response.Css(".oxy-post-title"))
            {
                string strTitle = title_link.TextContentClean;
                Scrape(new ScrapedData() { { "Title", strTitle } });
            }

            if (response.CssExists("div.oxy-easy-posts-pages > a[href]"))
            {
                string next_page = response.Css("div.oxy-easy-posts-pages > a[href]")[0].Attributes["href"];
                Request(next_page, Parse);
            }
        }
    }
}

Licensing & Support available

For code examples, tutorials and documentation visit https://ironsoftware.com/csharp/webscraper/

For support please email us at developers@ironsoftware.com

You can email us at developers@ironsoftware.com for support directly from our code team. We offer licensing and extensive support for commercial deployment projects.

Product Versions
.NET net5.0 net5.0-windows net6.0 net6.0-android net6.0-ios net6.0-maccatalyst net6.0-macos net6.0-tvos net6.0-windows net7.0 net7.0-android net7.0-ios net7.0-maccatalyst net7.0-macos net7.0-tvos net7.0-windows
.NET Core netcoreapp2.0 netcoreapp2.1 netcoreapp2.2 netcoreapp3.0 netcoreapp3.1
.NET Standard netstandard2.0 netstandard2.1
.NET Framework net461 net462 net463 net47 net471 net472 net48 net481
MonoAndroid monoandroid
MonoMac monomac
MonoTouch monotouch
Tizen tizen40 tizen60
Xamarin.iOS xamarinios
Xamarin.Mac xamarinmac
Xamarin.TVOS xamarintvos
Xamarin.WatchOS xamarinwatchos
Compatible target framework(s)
Additional computed target framework(s)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
2023.2.2 205 2/27/2023
2023.1.22 1,224 1/19/2023
2021.11.0 19,511 10/29/2021
2021.9.0 819 8/24/2021
2021.6.5 560 6/7/2021
4.0.4.3 23,961 4/14/2017
4.0.4 1,124 3/30/2017

* Removes support for .Net Framework 4.5.2, minimum supported version now is .Net Framework 4.6.2
* Adds support for NET 6 and NET 7
* Removes Installation class, the license key can now be applied and checked for authenticity in IronWebScraper.License class
* Fixes a bug where fetching a response from a website could encounter an error with unsupported security protocols in .Net Framework.
* Fixes some potential deadlocks in async methods during response processing
* Intellisense documentation improvements
* Package and dll are now signed
* Removes dependency on System.Common.Drawing. Now using IronSoftware.System.Drawing.
* Fixes a bug, where robots.txt on some websites might hang the application while starting the scraper