See the version list below for details.
dotnet add package IronWebScraper --version 220.127.116.11
NuGet\Install-Package IronWebScraper -Version 18.104.22.168
<PackageReference Include="IronWebScraper" Version="22.214.171.124" />
paket add IronWebScraper --version 126.96.36.199
#r "nuget: IronWebScraper, 188.8.131.52"
// Install IronWebScraper as a Cake Addin #addin nuget:?package=IronWebScraper&version=184.108.40.206 // Install IronWebScraper as a Cake Tool #tool nuget:?package=IronWebScraper&version=220.127.116.11
Iron WebScraper is a C# web scraping library, allowing developers to simulate & automate human browsing behavior to extract content, files & images from web applications as native .Net objects. Iron Web Scraper manages politeness & multithreading in the background, leaving a developer’s own application easy to understand & maintain.
Iron Web Scraper can be used to migrate content from existing websites as well as build search indexes and monitor website structure & content changes. It's functionality includes:
» Fast multi threading allows hundreds of simultaneous requests.
» Politely avoid over stalling remote servers using IP/domain level throttling & optionally respecting robots.txt
» Manage multiple identities, DNS, proxies, user agents, request methods, custom headers, cookies & logins.
» Data exported from websites becomes native C# objects which can be stored or used immediately.
» Exceptions managed in all but the developers own code. Errors and captchas auto retried on failure
» Save, pause, resume, autosave scrape jobs.
» Built in web cache allows for action replay, crash recovery, and querying existing web scrape data. Change scrape logic on the fly, then replay job without internet traffic.
Requires .NET 4.5.2 - Licensing & Support available for commercial deployments. For code examples, documentation & more visit http://ironsoftware.com/cshapr/webscraper. For support please email us at firstname.lastname@example.org.
|.NET Framework||net452 net46 net461 net462 net463 net47 net471 net472 net48 net481|
This package has no dependencies.
This package is not used by any NuGet packages.
This package is not used by any popular GitHub repositories.
Yield method changes to Scrape
Autosave is now instananeous, rather than scheduled
Performance on huge scrape jobs (1 Million+ pages improved)