OpenccNetLib 1.0.3

dotnet add package OpenccNetLib --version 1.0.3
                    
NuGet\Install-Package OpenccNetLib -Version 1.0.3
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="OpenccNetLib" Version="1.0.3" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="OpenccNetLib" Version="1.0.3" />
                    
Directory.Packages.props
<PackageReference Include="OpenccNetLib" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add OpenccNetLib --version 1.0.3
                    
#r "nuget: OpenccNetLib, 1.0.3"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package OpenccNetLib@1.0.3
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=OpenccNetLib&version=1.0.3
                    
Install as a Cake Addin
#tool nuget:?package=OpenccNetLib&version=1.0.3
                    
Install as a Cake Tool

OpenccNet

NuGet NuGet Downloads License Release

OpenccNetLib is a fast and efficient .NET library for converting Chinese text, offering support for Simplified ↔ Traditional, Taiwan, Hong Kong, and Japanese Kanji variants. Built with inspiration from OpenCC, this library is designed to integrate seamlessly into modern .NET projects with a focus on performance and minimal memory usage.

Table of Contents

Features

  • Fast, multi-stage conversion using static dictionary caching
  • Supports:
    • Simplified ↔ Traditional Chinese
    • Traditional (Taiwan) ↔ Simplified/Traditional
    • Traditional (Hong Kong) ↔ Simplified/Traditional
    • Japanese Kanji Shinjitai ↔ Traditional Kyujitai
  • Optional punctuation conversion
  • Thread-safe and suitable for parallel processing
  • .NET Standard 2.0 compatible

Installation

  • Add the library to your project via NuGet or reference the source code directly.
  • Add required dependencies of dictionary files to library root.
    • dicts\dictionary_maxlength.zstd Default dictionary file.
    • dicts\*.* Others dictionary files for different configurations.

Install via NuGet:

dotnet add package OpenccNetLib

Or, clone and include the source files in your project.

Usage

Basic Example

using OpenccNetLib;
var opencc = new Opencc("s2t"); // Simplified to Traditional 
string traditional = opencc.Convert("汉字转换测试"); 
Console.WriteLine(traditional);
// Output: 漢字轉換測試

Supported Configurations

Config Description
s2t Simplified → Traditional
t2s Traditional → Simplified
s2tw Simplified → Traditional (Taiwan)
tw2s Traditional (Taiwan) → Simplified
s2twp Simplified → Traditional (Taiwan, phrases)
tw2sp Traditional (Taiwan, phrases) → Simplified
s2hk Simplified → Traditional (Hong Kong)
hk2s Traditional (Hong Kong) → Simplified
t2tw Traditional → Traditional (Taiwan)
tw2t Traditional (Taiwan) → Traditional
t2twp Traditional → Traditional (Taiwan, phrases)
tw2tp Traditional (Taiwan, phrases) → Traditional
t2hk Traditional → Traditional (Hong Kong)
hk2t Traditional (Hong Kong) → Traditional
t2jp Traditional Kyujitai → Japanese Kanji Shinjitai
jp2t Japanese Kanji Shinjitai → Traditional Kyujitai

Example: Convert with Punctuation

var opencc = new Opencc("s2t"); 
string result = opencc.Convert("“汉字”转换。", punctuation: true);
Console.WriteLine(result);
// Output: 「漢字」轉換。

Example: Switching Config Dynamically

using OpenccNetLib;

var opencc = new Opencc("s2t");

// Initial conversion
string result = opencc.Convert("动态切换转换方式");
Console.WriteLine(result);  // Output: 動態切換轉換方式

// Switch config using string
opencc.Config = "t2s";  // Also valid: opencc.SetConfig("t2s")
result = opencc.Convert("動態切換轉換方式");
Console.WriteLine(result);  // Output: 动态切换转换方式

// Switch config using enum (recommended for safety and autocomplete)
opencc.SetConfig(OpenccConfig.S2T);
result = opencc.Convert("动态切换转换方式");
Console.WriteLine(result);  // Output: 動態切換轉換方式

// Invalid config falls back to "s2t"
opencc.Config = "invalid_config";
Console.WriteLine(opencc.GetLastError());  // Output: Invalid config provided: invalid_config. Using default 's2t'.
💡 Tips
  • Use OpenccConfig enum for compile-time safety and IntelliSense support.
  • Use GetLastError() to check if fallback occurred due to an invalid config.
  • You can also validate config strings with Opencc.IsValidConfig("t2tw").

Direct API Methods

You can also use direct methods for specific conversions:

using OpenccNetLib;
var opencc = new Opencc();
opencc.S2T("汉字");      
// Simplified to Traditional opencc.T2S("漢字");      
// Traditional to Simplified opencc.S2Tw("汉字");     
// Simplified to Taiwan Traditional opencc.T2Jp("漢字");     
// Traditional to Japanese Kanji
// ...and more

Error Handling

If an error occurs (e.g., invalid config), use:

string error = opencc.GetLastError();
Console.WriteLine(error); // Output the last error message

Language Detection

Detect if a string is Simplified, Traditional, or neither:

using OpenccNetLib;
int result = Opencc.ZhoCheck("汉字"); // Returns 2 for Simplified, 1 for Traditional, 0 for neither
Console.WriteLine(result); // Output: 2 (for Simplified)

Using Custom Dictionary

Library default is zstd compressed dictionary Lexicon. It can be changed to custom dictionary (JSON, CBOR or "baseDir/*.txt") prior to instantiate Opencc():

using OpenccNetLib;
Opencc.UseCustomDictionary(DictionaryLib.FromDicts()) // Init only onece, dicts from baseDir "./dicts/"
var opencc = new Opencc("s2t"); // Simplified to Traditional 
string traditional = opencc.Convert("汉字转换测试"); 
Console.WriteLine(traditional); // Output: 漢字轉換測試

Performance

  • Uses static dictionary caching and thread-local buffers for high throughput.
  • Suitable for batch and parallel processing scenarios.

📊 Benchmark Results – OpenccNetLib 1.0.3

BenchmarkDotNet v0.15.2 · .NET 9.0.7 · Windows 11 · RyuJIT AVX2
Test: BM_Convert_Sized · Warmup + 10 Iterations

Input Size Mean Time Gen0 (per 1k ops) Gen1 Gen2 Allocated Memory
100 10.79 µs 2.08 21.20 KB
1,000 169.58 µs 22.46 0.98 229.43 KB
10,000 472.20 µs 195.80 57.13 1.99 MB
100,000 8.57 ms 2218.75 531.25 218.75 21.72 MB
1,000,000 88.61 ms 21,833.33 5,666.67 1,000.00 225.30 MB

⏱ Relative Performance Chart

Benchmark: Time vs Memory

✅ Highlights

  • ✅ Preallocated StringBuilder delivers consistent performance across all input sizes.
  • 🚀 Inclusive splitting ensures fewer ConvertBy() calls, improving throughput.
  • 🔁 Parallel processing kicks in for large workloads (≥16 segments, ≥2000 chars) to utilize multicore efficiency.
  • 📉 Memory usage scales linearly with input size — from 21 KB to 225 MB — no spikes.
  • 🧠 GC pressure remains stable and predictable, even at 1M characters:
    • Gen0: ~21K collections,
    • Gen1: ~5.6K,
    • Gen2: ~1K — all expected and manageable.
  • ⚡ Warm startup is fast, ideal for both CLI batch conversion and interactive GUI usage.
  • ✨ OpenccNetLib 1.0.3 is now production-ready for large-scale Chinese text conversion.

API Reference

Opencc Class

🔧 Constructor
  • Opencc(string config = null)
    Create a new converter with the specified configuration.
🔁 Conversion Methods
  • string Convert(string inputText, bool punctuation = false)
    Convert text according to the current config and punctuation mode.

  • string S2T(string inputText, bool punctuation = false)

  • string T2S(string inputText, bool punctuation = false)

  • string S2Tw(string inputText, bool punctuation = false)

  • string Tw2S(string inputText, bool punctuation = false)

  • string S2Twp(string inputText, bool punctuation = false)

  • string Tw2Sp(string inputText, bool punctuation = false)

  • string S2Hk(string inputText, bool punctuation = false)

  • string Hk2S(string inputText, bool punctuation = false)

  • string T2Tw(string inputText)

  • string T2Twp(string inputText)

  • string Tw2T(string inputText)

  • string Tw2Tp(string inputText)

  • string T2Hk(string inputText)

  • string Hk2T(string inputText)

  • string T2Jp(string inputText)

  • string Jp2T(string inputText)

⚙️ Configuration
  • string Config { get; set; }
    Gets or sets the current config string. Invalid configs fallback to "s2t" and update error status.
  • void SetConfig(string config)
    Set the config using a string (e.g., "tw2sp"). Falls back to "s2t" if invalid.
  • void SetConfig(OpenccConfig configEnum)
    Set the config using a strongly typed OpenccConfig enum. Recommended for safety and IDE support.
  • string GetConfig()
    Returns the current config string (e.g., "s2tw").
  • string GetLastError()
    Returns the most recent error message, if any, from config setting.
📋 Validation and Helpers
  • static bool IsValidConfig(string config)
    Checks whether the given string is a valid config name.
  • static IReadOnlyCollection<string> GetSupportedConfigs()
    Returns the list of all supported config names as strings.
  • static bool TryParseConfig(string config, out OpenccConfig result)
    Converts a valid config string to the corresponding OpenccConfig enum. Returns false if invalid.
  • static int ZhoCheck(string inputText)
    Detects whether the input is likely Simplified Chinese (2), Traditional Chinese (1), or neither (0).

Dictionary Data

  • Dictionaries are loaded and cached on first use.
  • Data files are expected in the dicts/ directory (see DictionaryLib for details).

Add-On CLI Tools (Separated from OpenccNetLib)

OpenccNet dictgen

Description:
  Generate OpenccNetLib dictionary files.

Usage:
  OpenccNet dictgen [options]

Options:
  -f, --format <cbor|json|zstd>  Dictionary format: [zstd|cbor|json] [default: zstd]
  -o, --output <output>          Output filename. Default: dictionary_maxlength.<ext>
  -b, --base-dir <base-dir>      Base directory containing source dictionary files [default: dicts]
  -?, -h, --help                 Show help and usage information

OpenccNet convert

Description:
  Convert text using OpenccNetLib configurations.

Usage:
  OpenccNet convert [options]

Options:
  -i, --input              Read original text from file <input>
  -o, --output             Write original text to file <output>
  -c, --config (REQUIRED)  Conversion configuration: s2t|s2tw|s2twp|s2hk|t2s|tw2s|tw2sp|hk2s|jp2t|t2jp
  -p, --punct              Punctuation conversion. [default: False]
  --in-enc                 Encoding for input: UTF-8|UNICODE|GBK|GB2312|BIG5|Shift-JIS [default: UTF-8]
  --out-enc                Encoding for output: UTF-8|UNICODE|GBK|GB2312|BIG5|Shift-JIS [default: UTF-8]
  -?, -h, --help           Show help and usage information

OpenccNet office

Description:
  Convert Office documents or Epub using OpenccNetLib.

Usage:
  OpenccNet office [options]

Options:
  -i, --input              Input Office document <input>
  -o, --output             Output Office document <output>
  -c, --config (REQUIRED)  Conversion configuration: s2t|s2tw|s2twp|s2hk|t2s|tw2s|tw2sp|hk2s|jp2t|t2jp
  -p, --punct              Enable punctuation conversion. [default: False]
  -f, --format             Force Office document format: docx | xlsx | pptx | odt | ods | odp | epub
  --keep-font              Preserve font names in Office documents [default: true]. Use --keep-font:false to disable. [default: True]
  --auto-ext               Auto append correct extension to Office output files [default: true]. Use --auto-ext:false to disable. [default: True]
  -?, -h, --help           Show help and usage information

Project That Use OpenccNetLib

  • OpenccNetLibGui : A GUI application for OpenccNetLib, providing a user-friendly interface for Chinese text conversion.

License

This project is licensed under the MIT License. See the LICENSE file for details.


OpenccNet is not affiliated with the original OpenCC project, but aims to provide a compatible and high-performance solution for .NET developers.

Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed.  net10.0 was computed.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 was computed. 
.NET Framework net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last Updated
1.0.3 84 7/29/2025
1.0.2 143 7/9/2025
1.0.1 149 6/19/2025
1.0.0 213 5/31/2025

OpenccNetLib v1.0.3

A major performance-focused update for the OpenCC-based Chinese text conversion library.

✅ Highlights:
- New parallel segment processing engine for large-scale text conversion
- Optimized `StringBuilder` usage with smart preallocation (up to 20% faster in benchmarks)
- Inclusive splitting improves dictionary lookup performance with fewer overhead calls
- Memory allocation and GC pressure now scale linearly and predictably
- Unified join logic for small and large inputs (no more string.Concat bottlenecks)
- Consistently fast warm and cold starts in both CLI and GUI environments

🔧 Compatible with .NET Standard 2.0+

Benchmark Results:
- 1M character input processed in ~88 ms with 225 MB allocated — a 3%+ gain over v1.0.2

Project:
https://github.com/laisuk/OpenccNet