ExcavatorSharp.WebScraper.x64
1.2.8
dotnet add package ExcavatorSharp.WebScraper.x64 --version 1.2.8
NuGet\Install-Package ExcavatorSharp.WebScraper.x64 -Version 1.2.8
<PackageReference Include="ExcavatorSharp.WebScraper.x64" Version="1.2.8" />
paket add ExcavatorSharp.WebScraper.x64 --version 1.2.8
#r "nuget: ExcavatorSharp.WebScraper.x64, 1.2.8"
// Install ExcavatorSharp.WebScraper.x64 as a Cake Addin
#addin nuget:?package=ExcavatorSharp.WebScraper.x64&version=1.2.8
// Install ExcavatorSharp.WebScraper.x64 as a Cake Tool
#tool nuget:?package=ExcavatorSharp.WebScraper.x64&version=1.2.8
ExcavatorSharp is a multi-threaded server for scraping web data. It converts HTML code into a structured array of data. The library allows data scraping from multiple sites in parallel mode, within a single running application. Create scraping tasks and perform data extraction on a schedule.
The library is designed for professional extraction and parsing of large volumes of data. Under the hood there are .css-selectors and xpath support, data export into .csv/.xlsx/.sql/.json, online data export, support for proxy servers, dynamic content crawling, interaction with the site via javascript and much more. The library uses .NET Sockets and Chromium Embedded Framework.
The library can be used separately as crawler or parser. We support the formats sitemap.xml and robots.txt. We support the gzip / deflate compression.
Attention! Only x64 versions are supported for .NET 4.5.2 and 4.6 platforms. AnyCPU build does not support! You will NOT be able to run the library when building AnyCPU. This is caused by the features of CEF.
Product | Versions |
---|---|
.NET Framework | net452 net46 net461 net462 net463 net47 net471 net472 net48 net481 |
-
- cef.redist.x64 (>= 79.1.36)
- cef.redist.x86 (>= 79.1.36)
- CefSharp.Common (>= 75.1.360)
- CefSharp.OffScreen (>= 75.1.360)
- EPPlus (<= 4.5.3.3)
- HtmlAgilityPack (>= 1.11.23)
- HtmlAgilityPack.CssSelectors (>= 1.0.2)
- Newtonsoft.Json (>= 12.0.3)
- RestSharp (>= 106.10.1)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
1.2.8 | 550 | 8/10/2020 |
1.2.7 | 347 | 8/10/2020 |
1.2.3 | 404 | 5/20/2020 |
1.2.2 | 367 | 5/10/2020 |
1.2.1 | 371 | 5/5/2020 |
1.2.0 | 396 | 4/30/2020 |
1.1.0 | 370 | 4/23/2020 |
1.0.53 | 365 | 4/12/2020 |
1.0.52 | 384 | 4/11/2020 |
1.0.51 | 384 | 4/11/2020 |
1.0.6 | 384 | 4/23/2020 |
1.0.5 | 367 | 4/11/2020 |
1.0.4 | 407 | 4/3/2020 |
1.0.3 | 381 | 2/12/2020 |
1.0.2 | 429 | 1/30/2020 |
1.0.1 | 394 | 1/30/2020 |
1.0.0 | 354 | 1/23/2020 |
1) Added ability to extract data from iframe blocks
2) Added possibility to take a screenshot in the project testing mode
3) Fixed current errors and increased productivity