E5Embedding.Net 2.0.2

.NET 8.0

dotnet add package E5Embedding.Net --version 2.0.2

NuGet\Install-Package E5Embedding.Net -Version 2.0.2

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="E5Embedding.Net" Version="2.0.2" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="E5Embedding.Net" Version="2.0.2" />
                    

                            Directory.Packages.props

<PackageReference Include="E5Embedding.Net" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add E5Embedding.Net --version 2.0.2

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: E5Embedding.Net, 2.0.2"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package E5Embedding.Net@2.0.2

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=E5Embedding.Net&version=2.0.2
                    

                            Install as a Cake Addin

#tool nuget:?package=E5Embedding.Net&version=2.0.2
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

E5Embedding.Net

A high-performance .NET library for generating text embeddings using E5 models with ONNX Runtime. This library provides GPU acceleration support (CUDA/DirectML) with automatic CPU fallback, making it ideal for production environments.

Features

🚀 High Performance: Optimized ONNX Runtime inference with GPU acceleration
🎯 E5 Model Support: Built specifically for E5 embedding models
🔧 Flexible Tokenization: Supports both SentencePiece and BERT-style tokenizers
💻 GPU Acceleration: Automatic GPU detection with CUDA and DirectML support
🔄 Automatic Fallback: Seamless fallback to CPU if GPU initialization fails
📦 Easy Integration: Simple API for embedding single texts or batches
🎨 Production Ready: Comprehensive error handling and logging support

Installation

Install the package via NuGet:

dotnet add package E5Embedding.Net

Or via Package Manager:

Install-Package E5Embedding.Net

Quick Start

Basic Usage

using E5Embedding.Net;
using Microsoft.Extensions.Logging;

string testText = "This is a test text for embedding generation using E5 model.";

var config = new E5EmbeddingConfiguration
{
    OnnxModelPath = "C:/WorkSpace/test/model.onnx",
    SentencePieceModelFile = "C:/WorkSpace/test/sentencepiece.bpe.model",
    TokenizerConfigFile = "C:/WorkSpace/test/tokenizer_config.json",
    TokenizerJsonFile = "C:/WorkSpace/test/tokenizer.json",
    MaxSequenceLength = 512,
    Dimension = 1024,
    BatchSize = 16
};

// Logger is optional and can be null
// If you don't need logging, you can pass null: new OnnxEmbeddingService(config, null)
var loggerFactory = LoggerFactory.Create(builder => builder.AddConsole());
var logger = loggerFactory.CreateLogger<OnnxEmbeddingService>();

using var embeddingService = new OnnxEmbeddingService(config, logger);

var embedding = await embeddingService.EmbedAsync(testText);

Console.WriteLine($"Embedding dimension: {embedding.Length}");
Console.WriteLine($"[{string.Join(", ", embedding.Select(v => v.ToString("F6")))}]");

// Generate embeddings for multiple texts
var texts = new[] { "Text 1", "Text 2", "Text 3" };
var embeddings = await embeddingService.EmbedBatchAsync(texts);

Dependency Injection

using E5Embedding.Net;
using Microsoft.Extensions.DependencyInjection;

// In your Startup.cs or Program.cs
services.AddSingleton<E5EmbeddingConfiguration>(sp =>
{
    var configuration = sp.GetRequiredService<IConfiguration>();
    return new E5EmbeddingConfiguration
    {
        OnnxModelPath = configuration["E5:OnnxModelPath"],
        SentencePieceModelFile = configuration["E5:SentencePieceModelFile"],
        TokenizerConfigFile = configuration["E5:TokenizerConfigFile"],
        TokenizerJsonFile = configuration["E5:TokenizerJsonFile"],
        MaxSequenceLength = configuration.GetValue<int>("E5:MaxSequenceLength"),
        Dimension = configuration.GetValue<int>("E5:Dimension"),
        BatchSize = configuration.GetValue<int>("E5:BatchSize", 16)
    };
});

services.AddSingleton<IEmbeddingService>(sp =>
{
    var config = sp.GetRequiredService<E5EmbeddingConfiguration>();
    var logger = sp.GetService<ILogger<OnnxEmbeddingService>>();
    return new OnnxEmbeddingService(config, logger);
});

Configuration

E5EmbeddingConfiguration Properties

Property	Type	Description	Default
`OnnxModelPath`	`string`	Path to the ONNX model file	Required
`SentencePieceModelFile`	`string`	Path to SentencePiece model file	`"sentencepiece.bpe.model"`
`TokenizerConfigFile`	`string`	Path to tokenizer config JSON	`"tokenizer_config.json"`
`TokenizerJsonFile`	`string`	Path to tokenizer JSON file	`"tokenizer.json"`
`MaxSequenceLength`	`int`	Maximum sequence length for tokenization	Required
`Dimension`	`int`	Expected embedding dimension	`1024`
`BatchSize`	`int`	Batch size for processing multiple texts	`16`

GPU Acceleration

The library automatically detects and uses GPU acceleration when available:

CUDA: Automatically used on systems with NVIDIA GPUs and CUDA support
DirectML: Used on Windows systems with compatible GPUs
CPU Fallback: Automatically falls back to CPU if GPU initialization fails

GPU usage is logged when the service is initialized. Check your logs to see which provider is being used.

Tokenizers

SentencePieceTokenizer

Used by default for E5 models. Supports BPE (Byte Pair Encoding) tokenization.

using E5Embedding.Net.Tokenization;

var tokenizer = new SentencePieceTokenizer(
    sentencePieceModelFile: "path/to/sentencepiece.bpe.model",
    tokenizerConfigFile: "path/to/tokenizer_config.json",
    tokenizerJsonFile: "path/to/tokenizer.json",
    maxSequenceLength: 512
);

var encoding = tokenizer.Encode("Your text here");

BertTokenizer

BERT-style WordPiece tokenizer for models that require it.

using E5Embedding.Net.Tokenization;

var tokenizer = new BertTokenizer(
    tokenizerConfigFile: "path/to/tokenizer_config.json",
    tokenizerJsonFile: "path/to/tokenizer.json",
    maxSequenceLength: 512
);

var encoding = tokenizer.Encode("Your text here");
var pairEncoding = tokenizer.EncodePair("Query text", "Passage text");

Requirements

.NET 8.0 or later
ONNX Runtime (included via NuGet package)
E5 model files (ONNX format)
Tokenizer files (SentencePiece model, config, and JSON)

Model Files

You need the following files from your E5 model:

model.onnx - The ONNX model file
sentencepiece.bpe.model - SentencePiece tokenizer model (for SentencePieceTokenizer)
tokenizer_config.json - Tokenizer configuration
tokenizer.json - Tokenizer vocabulary and metadata

Download Model Files

You can download the ONNX model files from Hugging Face:

Download URL: https://huggingface.co/intfloat/multilingual-e5-large/tree/main/onnx

The repository contains various ONNX model formats:

model.onnx - Standard ONNX model
model.onnx_data - Model weights data
model_O4.onnx - Optimized ONNX model (O4)
model_qint8_avx512_vnni.onnx - Quantized model for AVX512 VNNI
Tokenizer files (sentencepiece.bpe.model, tokenizer_config.json, tokenizer.json)

Performance Tips

Batch Processing: Use EmbedBatchAsync for multiple texts to improve throughput
Batch Size: Adjust BatchSize based on your memory and performance requirements
GPU: Ensure GPU drivers are installed for best performance
Dispose: Always dispose the service when done to free resources

Error Handling

The library provides comprehensive error handling:

ArgumentNullException: When required parameters are null
FileNotFoundException: When model or tokenizer files are missing
InvalidOperationException: When configuration is invalid or dimensions don't match
AggregateException: When both GPU and CPU initialization fail

License

MIT License - see LICENSE file for details

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues, questions, or contributions, please visit the GitHub repository.

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 is compatible. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net10.0
- Microsoft.Extensions.Logging.Abstractions (>= 10.0.8)
- Microsoft.ML.OnnxRuntime.Gpu (>= 1.26.0)
- Microsoft.ML.Tokenizers (>= 2.0.0)
net8.0
- Microsoft.Extensions.Logging.Abstractions (>= 10.0.8)
- Microsoft.ML.OnnxRuntime.Gpu (>= 1.26.0)
- Microsoft.ML.Tokenizers (>= 2.0.0)
net9.0
- Microsoft.Extensions.Logging.Abstractions (>= 10.0.8)
- Microsoft.ML.OnnxRuntime.Gpu (>= 1.26.0)
- Microsoft.ML.Tokenizers (>= 2.0.0)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
2.0.2	82	5/25/2026
2.0.1	74	5/25/2026
2.0.0	91	5/25/2026
1.0.2	150	2/1/2026
1.0.1	122	12/30/2025
1.0.0	121	12/30/2025