Datafication.WebConnector
1.0.9
dotnet add package Datafication.WebConnector --version 1.0.9
NuGet\Install-Package Datafication.WebConnector -Version 1.0.9
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Datafication.WebConnector" Version="1.0.9" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="Datafication.WebConnector" Version="1.0.9" />
<PackageReference Include="Datafication.WebConnector" />
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add Datafication.WebConnector --version 1.0.9
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: Datafication.WebConnector, 1.0.9"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package Datafication.WebConnector@1.0.9
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=Datafication.WebConnector&version=1.0.9
#tool nuget:?package=Datafication.WebConnector&version=1.0.9
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
Datafication.WebConnector
Web scraping and HTML table extraction connector for Datafication DataBlocks.
Key Features
- HTML Table Extraction: Parse tables with automatic header detection
- CSS Selector Scraping: Extract structured data using CSS selectors
- Link Extraction: Filter and extract links with regex patterns
- Image Extraction: Detect lazy-loaded images, size filtering
- Page Metadata: Open Graph, Twitter Cards, JSON-LD extraction
- Browser Rendering: Puppeteer support for JavaScript-heavy SPAs
- Screenshot/PDF Export: Render DataBlocks as images or PDFs
Installation
dotnet add package Datafication.WebConnector
Quick Start
using Datafication.Connectors.WebConnector.Connectors;
// Extract HTML tables from a web page
var config = new HtmlTableConnectorConfiguration
{
Source = new Uri("https://example.com/data.html"),
TableSelector = "table.data-table",
FirstRowIsHeader = true
};
var connector = new HtmlTableConnector(config);
var tableData = await connector.GetDataAsync();
// Scrape with CSS selectors
var cssConfig = new CssSelectorConnectorConfiguration
{
Source = new Uri("https://example.com/products"),
Selector = ".product-card",
SubSelectors = new Dictionary<string, string>
{
{ "Name", "h2.title" },
{ "Price", ".price" }
}
};
Documentation
For samples, usage examples, and source code, visit our GitHub repository: github.com/DataficationSDK/Datafication.WebConnector
If you find this library useful, consider giving it a star!
For full API documentation and guides: datafication.co
License
Licensed under the Datafication SDK License Agreement.
- Free: <5 developers AND <$500K annual revenue
- Commercial license required: 5+ developers OR >$500K annual revenue
For licensing inquiries: support@datafication.co
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
-
net8.0
- AngleSharp (>= 1.1.2)
- Datafication.Core (>= 1.0.9)
- PuppeteerSharp (>= 19.0.1)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.