Mayordomo.Web.Extractor
1.1.14
Use ReadableWeb
dotnet add package Mayordomo.Web.Extractor --version 1.1.14
NuGet\Install-Package Mayordomo.Web.Extractor -Version 1.1.14
<PackageReference Include="Mayordomo.Web.Extractor" Version="1.1.14" />
<PackageVersion Include="Mayordomo.Web.Extractor" Version="1.1.14" />
<PackageReference Include="Mayordomo.Web.Extractor" />
paket add Mayordomo.Web.Extractor --version 1.1.14
#r "nuget: Mayordomo.Web.Extractor, 1.1.14"
#:package Mayordomo.Web.Extractor@1.1.14
#addin nuget:?package=Mayordomo.Web.Extractor&version=1.1.14
#tool nuget:?package=Mayordomo.Web.Extractor&version=1.1.14
Mayordomo.Web.Extractor
Overview
Small, modular .NET 10 library and tools to extract article content, images and metadata from web pages. The solution contains extractors, abstractions, HTML parsing implementations, tests and benchmarks.
Projects
Mayordomo.Web.Extractor.Abstractions� public interfaces for extractors and processorsMayordomo.Web.Extractor� composition and higher-level servicesMayordomo.Web.Extractor.HtmlAgilityPack� HTML Agility Pack based extractor implementationMayordomo.Web.Extractor.AngleSharp� AngleSharp based extractor implementationMayordomo.Web.Extractor.Tests� unit testsMayordomo.Web.Extractor.Benchmarks� benchmark projectsMayordomo.Web.Extractor.TestConsole� sample/test console app
Requirements
- .NET 10 SDK
- Optional:
dotnet-efor other tooling only if needed for local tasks
Build
Restore and build all projects:
dotnet restore
dotnet build --configuration Release
Run tests
Run unit tests from solution root:
dotnet test
Run benchmarks
Benchmarks use BenchmarkDotNet. Run from the benchmark project directory:
dotnet run -c Release -p Mayordomo.Web.Extractor.Benchmarks
Package and publish
This repository includes a GitHub Actions workflow to pack and publish NuGet packages: .github/workflows/publish-nuget.yml.
The workflow builds and packs with a version based on commit count and pushes packages to NuGet when the NUGET_API_KEY secret is provided.
Dependency updates
Dependabot configuration is provided in .github/dependabot.yml to open weekly PRs for NuGet package updates.
Contributing
- Open issues or PRs for bugs and improvements
- Follow existing coding conventions in the repository
- Update or add tests for behavior changes
License
No license file included in the repository. Add a LICENSE file if you intend to open source this code.
Contact
For local development questions, run the sample console app Mayordomo.Web.Extractor.TestConsole or inspect tests in Mayordomo.Web.Extractor.Tests.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
-
net10.0
- Mayordomo.Web.Extractor.Abstractions (>= 1.1.14)
- Mayordomo.Web.Extractor.HtmlAgilityPack (>= 1.1.14)
- Microsoft.Extensions.Configuration.Abstractions (>= 10.0.1)
- Microsoft.Extensions.Configuration.Binder (>= 10.0.1)
- Microsoft.Extensions.DependencyInjection (>= 10.0.1)
- Microsoft.Extensions.DependencyInjection.Abstractions (>= 10.0.1)
- Microsoft.Extensions.Http (>= 10.0.1)
- Microsoft.Extensions.Logging.Abstractions (>= 10.0.1)
- StackExchange.Redis (>= 2.10.1)
- Validated.Primitives (>= 1.1.67)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated | |
|---|---|---|---|
| 1.1.14 | 370 | 12/16/2025 |
Initial public package with core extractors, HTML AgilityPack and AngleSharp implementations, unit tests and benchmarks.