Ananke.Documents 0.1.0

dotnet add package Ananke.Documents --version 0.1.0
                    
NuGet\Install-Package Ananke.Documents -Version 0.1.0
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Ananke.Documents" Version="0.1.0" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="Ananke.Documents" Version="0.1.0" />
                    
Directory.Packages.props
<PackageReference Include="Ananke.Documents" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add Ananke.Documents --version 0.1.0
                    
#r "nuget: Ananke.Documents, 0.1.0"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package Ananke.Documents@0.1.0
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=Ananke.Documents&version=0.1.0
                    
Install as a Cake Addin
#tool nuget:?package=Ananke.Documents&version=0.1.0
                    
Install as a Cake Tool

Ananke.Documents

NuGet License

Document extractors for the Ananke knowledge pipeline — IDocumentExtractor implementations for PDF and Markdown that feed into DocumentProcessor for ingestion, chunking, embedding, and vector storage.

Install

dotnet add package Ananke.Documents

Quick start

using Ananke.Orchestration.Knowledge;
using Ananke.Documents;

var embeddingModel = OpenAIEmbeddingModel.Create(apiKey);
var knowledgeStore = new InMemoryKnowledgeStore(embeddingModel);

var processor = new DocumentProcessor(
    new HttpClient(),
    [new PdfExtractor(), new MarkdownExtractor()],
    new SlidingWindowChunker(),
    knowledgeStore);

// Extract, chunk, embed, and store in one call
await using var pdf = File.OpenRead("design-patterns.pdf");
var result = await processor.ProcessAsync(pdf, "application/pdf", "design-patterns");
// result => "8 sections, 42 chunks stored"

Extractors

Class Input What it does
PdfExtractor application/pdf Extracts text from PDF files using PdfPig, preserving headings, links, and structure as Markdown
MarkdownExtractor text/markdown, text/plain Parses Markdown structure into normalized sections suitable for chunking

Both implement IDocumentExtractor — you can add your own by implementing the same interface.

The pipeline

DocumentProcessor orchestrates the full ingest path:

Stream/URL → IDocumentExtractor → Markdown text
           → IDocumentChunker   → text chunks
           → IEmbeddingModel    → vector embeddings
           → IKnowledgeStore    → stored + indexed

The same processor works from agent tool calls, background jobs, admin scripts, or HTTP endpoints.

Requirements

  • Ananke.Orchestration (transitive) — provides IDocumentExtractor, DocumentProcessor, IKnowledgeStore, SlidingWindowChunker
  • PdfPig ≥ 0.1.13 (transitive)
  • Markdig ≥ 0.40.0 (transitive)
Package What it adds
Ananke.Orchestration Core knowledge pipeline: DocumentProcessor, IKnowledgeStore, InMemoryKnowledgeStore
Ananke.Orchestration.OpenAI OpenAIEmbeddingModel for generating embeddings
Ananke.Qdrant Qdrant-backed IKnowledgeStore for persistent, distributed storage
Ananke Meta-package — includes everything

Documentation

Full docs, demos, and architecture: github.com/sevensamurai/Ananke

License

Apache 2.0

Product Compatible and additional computed target framework versions.
.NET net10.0 is compatible.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last Updated
0.1.0 39 3/3/2026