ANcpLua.Agents.Hosting.BitNet 1.4.11-alpha.1

This is a prerelease version of ANcpLua.Agents.Hosting.BitNet.

dotnet add package ANcpLua.Agents.Hosting.BitNet --version 1.4.11-alpha.1

NuGet\Install-Package ANcpLua.Agents.Hosting.BitNet -Version 1.4.11-alpha.1

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="ANcpLua.Agents.Hosting.BitNet" Version="1.4.11-alpha.1" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="ANcpLua.Agents.Hosting.BitNet" Version="1.4.11-alpha.1" />
                    

                            Directory.Packages.props

<PackageReference Include="ANcpLua.Agents.Hosting.BitNet" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add ANcpLua.Agents.Hosting.BitNet --version 1.4.11-alpha.1

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: ANcpLua.Agents.Hosting.BitNet, 1.4.11-alpha.1"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package ANcpLua.Agents.Hosting.BitNet@1.4.11-alpha.1

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=ANcpLua.Agents.Hosting.BitNet&version=1.4.11-alpha.1&prerelease
                    

                            Install as a Cake Addin

#tool nuget:?package=ANcpLua.Agents.Hosting.BitNet&version=1.4.11-alpha.1&prerelease
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

ANcpLua.Agents.Hosting.BitNet

Consumer toolkit for Microsoft Agent Framework — local-LLM hosting via Microsoft's BitNet b1.58 over the OpenAI-compatible /v1 surface.

Alpha-channel package. Keep isolated from stable/preview consumers unless explicitly intended.

Compatible with: Microsoft.Agents.AI 1.4.x
Tested against: Microsoft.Agents.AI 1.4.0 + Microsoft.Extensions.AI 10.5.x
Capability tested against: BitNet b1.58 2B-4T weights served by Microsoft's prebuilt bitnet.cpp Docker image

Standing up a BitNet server

The hosting package only speaks HTTP to an OpenAI-compatible endpoint — it never builds, downloads, or spawns the binary. You can satisfy that contract any way you like; Microsoft's prebuilt Docker image is the easiest path:

scripts/bitnet-docker.sh start    # idempotent — stops any prior container first
export BITNET_URL=http://localhost:11434

The script pins the image by digest (sha256:9d5f7f4e...cd243a as of 2026-05-12), so byte-identical runs are guaranteed until you intentionally re-resolve.

What it bundles: bitnet.cpp, the b1.58-2B-4T GGUF weights, the patched llama-server, all under /v1/chat/completions on port 11434. No Python, cmake, LUT codegen, or git clone involved.

If your environment cannot pull this image (air-gapped hosts, vendor mirrors, custom builds), point BITNET_URL at any other OpenAI-compatible /v1/chat/completions endpoint — LM Studio, vLLM with a BitNet build, your own llama-server build, a private inference gateway. The hosting package does not care how the server got there. See Other OpenAI-compatible servers below.

Health check:

curl -fsS http://localhost:11434/health && echo ok

Stop:

scripts/bitnet-docker.sh stop

Pinning by digest (production / supply-chain)

The scripts/bitnet-docker.sh start command already runs digest-pinned. If you want to re-resolve to a newer Microsoft build, the recipe is:

docker buildx imagetools inspect \
  mcr.microsoft.com/appsvc/docs/sidecars/sample-experiment:bitnet-b1.58-2b-4t-gguf \
  --format '{{json .Manifest.Digest}}'

Edit the IMAGE= constant at the top of scripts/bitnet-docker.sh with the new digest, commit, done.

Other OpenAI-compatible servers

LM Studio, vLLM with a BitNet build, a private inference gateway, your own fork — all work. Point BITNET_URL at them.

Note on Ollama: as of mid-2026 Ollama does not run BitNet b1.58 (upstream llama.cpp lacks the i2_s tensor type that BitNet requires; Ollama inherits that gap — see ollama/ollama#10334).

Why a dedicated hosting package

Stock Microsoft.Extensions.AI.OpenAI works against any OpenAI-compatible endpoint — but llama-server builds older than ggml-org/llama.cpp#19831 (merged 2026-02-23) silently ignore the SDK-emitted max_completion_tokens field and generate to the context limit. This package promotes the LegacyMaxTokensPolicy shim out of test-only territory and ships it as part of the runtime path. Once Microsoft's BitNet fork picks up the upstream merge, the policy becomes a no-op self-deleting decorator.

Zero-ceremony path — `ANcpLua.NET.Sdk.BitNet`

If your repo already uses an MSBuild SDK from ANcpLua/ANcpLua.NET.Sdk, switch the variant from .Web to .BitNet and you get this hosting package as an implicit PackageReference — no <PackageReference Include="ANcpLua.Agents.Hosting.BitNet" /> line of your own needed. The pinned version lives in the SDK's Version.props and ships in lockstep with releases here.

// global.json
{ "msbuild-sdks": { "ANcpLua.NET.Sdk.BitNet": "3.4.31" } }


<Project Sdk="ANcpLua.NET.Sdk.BitNet">
  <PropertyGroup>
    <TargetFramework>net10.0</TargetFramework>
    <NoWarn>$(NoWarn);ANCPLBITNET001</NoWarn>
  </PropertyGroup>
</Project>

Hard requirement — Central Package Management. The SDK forces ManagePackageVersionsCentrally=true and the enforcement target errors if the consumer overrides it. Ship a Directory.Packages.props at or above the consumer (an empty one with <ManagePackageVersionsCentrally>true</ManagePackageVersionsCentrally> is enough); without it, restore fails with NU1015 on the SDK-injected analyzers.

Then call builder.AddQylBitNetChatClient() in Program.cs exactly as below.

Usage — four modes

Mode 0 — zero config (defaults to `BITNET_URL`)

builder.AddQylBitNetChatClient();   // connection name defaults to "bitnet"

Resolves:

public sealed class MyAgent([FromKeyedServices("bitnet")] IChatClient chat) { ... }

Mode 1 — Aspire-style named connection

builder.AddQylBitNetChatClient("bitnet");
// reads ConnectionStrings:bitnet = "http://localhost:11434"

Mode 2 — programmatic configuration (keyed multi-endpoint)

builder.AddQylBitNetChatClient("local", o =>
{
    o.Endpoint = new Uri("http://localhost:11434");
    o.Model    = "bitnet-b1.58-2B-4T";
    o.ApiPath  = "/v1";
});

Mode 3 — assembly attribute + bundled source generator

[assembly: QylBitNetEndpoint("bitnet", "http://localhost:11434", Model = "bitnet-b1.58-2B-4T")]

// Program.cs — one call wires every declared endpoint
builder.AddDiscoveredQylBitNetClients();

What gets registered

IChatClient keyed by the connection name (singleton)
IOptions<QylBitNetClientOptions> bound to BitNet:<name>:* config section
Health check named bitnet:<name> against <endpoint>/health
OpenTelemetry decoration via Microsoft.Extensions.AI.UseOpenTelemetry() — emits standard gen_ai.* spans/meters

Environment overrides

For test scenarios and the BitNetFixture contract (ANcpLua.Agents.Testing.BitNet.BitNetFixture):

BITNET_URL — overrides Endpoint. When unset, the fixture auto-starts the Docker image itself (unless BITNET_FIXTURE_NO_DOCKER is truthy).
BITNET_API_PATH — overrides ApiPath (default /v1)
BITNET_MODEL — overrides Model (default bitnet-b1.58-2B-4T)
BITNET_FIXTURE_NO_DOCKER — set truthy to opt the fixture out of auto-Docker

Env vars win over config so existing fixture consumers keep working.

Troubleshooting

Symptom	Likely cause	Fix
`BitNet endpoint is not configured` at startup	`BITNET_URL` not set and no `ConnectionStrings:<name>` / `BitNet:<name>:Endpoint` bound	Run `scripts/bitnet-docker.sh start` then `export BITNET_URL=http://localhost:11434`, or pass `configure: o => o.Endpoint = ...`
Health check `bitnet:<name>` is Unhealthy	server not listening, or wrong port	`curl $BITNET_URL/health` to confirm; check container with `scripts/bitnet-docker.sh status`
Model generates until context fills, ignores token cap	running against a `llama-server` older than llama.cpp PR #19831 without our shim	confirm `Microsoft.Extensions.AI.OpenAI` is going through `QylBitNetChatClientFactory.Create` — the `LegacyMaxTokensPolicy` is applied there automatically
`Unexpected end of input` / GGUF errors on stock Ollama	Ollama uses upstream llama.cpp which lacks `i2_s`	Ollama is not a target — use the Docker image

Product	Compatible and additional computed target framework versions.
.NET	net10.0 is compatible. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net10.0
- ANcpLua.Roslyn.Utilities (>= 2.2.5)
- Microsoft.Agents.AI (>= 1.4.0)
- Microsoft.Agents.AI.Hosting (>= 1.4.0-preview.260505.1)
- Microsoft.Agents.AI.Workflows (>= 1.4.0)
- Microsoft.Bcl.AsyncInterfaces (>= 10.0.7)
- Microsoft.CodeAnalysis.Analyzers (>= 5.3.0)
- Microsoft.CodeAnalysis.CSharp (>= 5.3.0)
- Microsoft.Extensions.AI (>= 10.5.0)
- Microsoft.Extensions.AI.Abstractions (>= 10.5.0)
- Microsoft.Extensions.AI.OpenAI (>= 10.5.0)
- OpenAI (>= 2.10.0)
- OpenTelemetry.Api (>= 1.15.3)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
1.4.11-alpha.1	0	5/14/2026
1.4.10-alpha.1	60	5/13/2026

First publish to nuget.org. Pairs with ANcpLua.Agents.Testing's BitNetFixture which auto-manages Microsoft's prebuilt bitnet.cpp Docker image (digest-pinned) when BITNET_URL is unset and Docker is on PATH. Includes the LegacyMaxTokensPolicy shim for llama-server builds pre-ggml-org/llama.cpp#19831.