BenchGecko 0.1.1

dotnet add package BenchGecko --version 0.1.1
                    
NuGet\Install-Package BenchGecko -Version 0.1.1
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="BenchGecko" Version="0.1.1" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="BenchGecko" Version="0.1.1" />
                    
Directory.Packages.props
<PackageReference Include="BenchGecko" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add BenchGecko --version 0.1.1
                    
#r "nuget: BenchGecko, 0.1.1"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package BenchGecko@0.1.1
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=BenchGecko&version=0.1.1
                    
Install as a Cake Addin
#tool nuget:?package=BenchGecko&version=0.1.1
                    
Install as a Cake Tool

BenchGecko

C# SDK for BenchGecko -- the data platform for comparing AI model benchmarks, estimating inference costs, and exploring performance across providers.

Overview

BenchGecko provides strongly-typed .NET primitives for working with LLM benchmark data. Build comparison tools, cost calculators, model selectors, and leaderboard UIs with clean C# idioms and full nullable reference type support.

The package includes:

  • Model class with fluent builder pattern for constructing models with scores and pricing
  • BenchmarkCategory enum covering 9 evaluation dimensions (Reasoning, Coding, Knowledge, Instruction, Multilingual, Safety, LongContext, Vision, Agentic)
  • ModelTier classification (S through D) based on aggregate performance
  • ModelComparer static class for head-to-head comparison, ranking, filtering, and value analysis
  • CostEstimator static class for per-request, monthly, and multi-model cost estimation
  • BenchmarkScore and Pricing records for structured data

Installation

dotnet add package BenchGecko

Or via the NuGet Package Manager:

Install-Package BenchGecko

Quick Start

using BenchGecko;

// Define models with benchmark scores and pricing
var gpt4 = new Model("gpt-4o", "OpenAI")
    .WithContextWindow(128_000)
    .WithScore(BenchmarkCategory.Reasoning, 92.3)
    .WithScore(BenchmarkCategory.Coding, 89.1)
    .WithScore(BenchmarkCategory.Knowledge, 88.7)
    .WithPricing(inputPerMTok: 2.50, outputPerMTok: 10.00);

var claude = new Model("claude-sonnet-4", "Anthropic")
    .WithContextWindow(200_000)
    .WithScore(BenchmarkCategory.Reasoning, 94.1)
    .WithScore(BenchmarkCategory.Coding, 93.7)
    .WithScore(BenchmarkCategory.Knowledge, 91.2)
    .WithPricing(inputPerMTok: 3.00, outputPerMTok: 15.00);

// Compare across shared categories
var result = ModelComparer.Compare(gpt4, claude);
Console.WriteLine($"Winner: {result.Winner.Name}");
Console.WriteLine($"Categories compared: {result.CategoriesCompared}");
Console.WriteLine($"GPT-4o wins: {result.AWins.Count}, Claude wins: {result.BWins.Count}");

Cost Estimation

Estimate inference costs for individual requests, monthly budgets, or compare across providers:

using BenchGecko;

var model = new Model("gpt-4o", "OpenAI")
    .WithPricing(inputPerMTok: 2.50, outputPerMTok: 10.00);

// Single request cost
var cost = CostEstimator.EstimateRequest(model, inputTokens: 5_000, outputTokens: 2_000);
Console.WriteLine($"Request cost: ${cost:F4}");

// Monthly budget estimate
var monthly = CostEstimator.EstimateMonthly(
    model, dailyRequests: 1000, avgInputTokens: 3_000, avgOutputTokens: 1_000);
Console.WriteLine($"Monthly estimate: ${monthly:F2}");

// Compare costs across models
var models = new[] { gpt4, claude, gemini };
var ranked = CostEstimator.CompareCosts(models, inputTokens: 10_000, outputTokens: 5_000);
foreach (var (m, c) in ranked)
    Console.WriteLine($"  {m.Name}: ${c:F4}");

Tier Classification

Models are automatically classified into performance tiers:

Tier Average Score Description
S 90+ Elite frontier models
A 80-89 Strong general-purpose models
B 70-79 Capable mid-range models
C 60-69 Budget or older generation
D <60 Entry-level or legacy
// Filter by tier
var eliteModels = ModelComparer.FilterByTier(models, ModelTier.S);

// Rank by specific category
var codingLeaders = ModelComparer.RankByCategory(models, BenchmarkCategory.Coding);

// Find best value (performance per dollar)
var bestDeal = ModelComparer.BestValue(models);
Console.WriteLine($"Best value: {bestDeal?.Name} (score/dollar: {bestDeal?.ValueScore:F1})");

Benchmark Categories

The BenchmarkCategory enum covers the major evaluation dimensions tracked by BenchGecko:

Category Typical Benchmarks
Reasoning GSM8K, MATH, ARC-Challenge
Coding HumanEval, MBPP, SWE-bench
Knowledge MMLU, HellaSwag, TriviaQA
Instruction MT-Bench, AlpacaEval
Multilingual MGSM, XLSum
Safety TruthfulQA, BBQ
LongContext RULER, Needle-in-a-Haystack
Vision MMMU, MathVista
Agentic WebArena, SWE-bench

Data Source

Benchmark data, model metadata, and pricing information are maintained by BenchGecko. Visit the platform for live leaderboards, interactive comparisons, and the full model database covering 300+ models across 50+ providers.

License

MIT

Product Compatible and additional computed target framework versions.
.NET net8.0 is compatible.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed.  net10.0 was computed.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
  • net8.0

    • No dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last Updated
0.1.1 84 3/31/2026
0.1.0 80 3/31/2026