RCParsing 2.4.0
See the version list below for details.
dotnet add package RCParsing --version 2.4.0
NuGet\Install-Package RCParsing -Version 2.4.0
<PackageReference Include="RCParsing" Version="2.4.0" />
<PackageVersion Include="RCParsing" Version="2.4.0" />
<PackageReference Include="RCParsing" />
paket add RCParsing --version 2.4.0
#r "nuget: RCParsing, 2.4.0"
#:package RCParsing@2.4.0
#addin nuget:?package=RCParsing&version=2.4.0
#tool nuget:?package=RCParsing&version=2.4.0
RCParsing
A Fluent, Lexerless Parser Builder for .NET — Define ANY grammars with the elegance of BNF and the power of C#.
This library focuses on Developer-experience (DX) first, providing best toolkit for creating your programming languages, file formats or even data extraction tools with declarative API, debugging tools, and more. This allows you to design your parser directly in code and easily fix it using rule stack traces and detailed error messages.
Why RCParsing?
- 🐍 Hybrid Power: Unique support for barrier tokens to parse indent-sensitive languages like Python and YAML flawlessly.
- 💪 Regex on Steroids: You can find all matches for target structure in the input text with detailed AST information and transformed value.
- 🚫 Lexerless Freedom: No token priority headaches. Parse directly from raw text, even with keywords embedded in strings. Tokens are used just as lightweight matching primitives.
- 🎨 Fluent API: Write parsers in C# that read like clean BNF grammars, boosting readability and maintainability compared to imperative or functional approaches.
- 🐛 Debug-Friendly: Get detailed, actionable error messages with stack traces and precise source locations. Richest API for manual error information included.
- ⚡ Fast: Performance is now on par with the fastest .NET parsing libraries (see benchmarks below).
- 🌳 Rich AST: Parser makes an AST (Abstract Syntax Tree) from raw text, with ability to optimize, fully analyze and calculate the result value entirely lazy, reducing unnecessary allocations.
- 🔧 Configurable Skipping: Advanced strategies for whitespace and comments, allowing you to use conflicting tokens in your main rules.
- 📦 Batteries Included: Useful built-in tokens and rules (regex, identifiers, numbers, escaped strings, separated lists, custom tokens, and more...).
- 🖥️ Broad Compatibility: Targets
.NET Standard 2.0(runs on.NET Framework 4.6.1+),.NET 6.0, and.NET 8.0.
Table of contents
- Installation
- Tutorials, docs and examples
- Simple examples - The examples that you can copy, paste and run!
- A + B - Basic arithmetic expression parser with result calculation.
- JSON - A complete JSON parser with comments and skipping.
- Python-like - Demonstrating barrier tokens for indentation.
- Finding patterns - How to find all occurrences of a rule in a string.
- Comparison with other parsing libraries
- Benchmarks
- Projects using RCParsing
- Roadmap
- Contributing
Installation
You can install the package via NuGet Package Manager or console window, using one of these commands:
dotnet add package RCParsing
Install-Package RCParsing
Or do it manually by cloning this repository.
Tutorials, docs and examples
- Tutorials - detailed tutorials, explaining features and mechanics of this library, highly recommended to read!
Simple examples
A + B
Here is simple example how to make simple parser that parses "a + b" string with numbers and transforms the result:
using RCParsing;
using RCParsing.Building;
// First, you need to create a builder
var builder = new ParserBuilder();
// Enable and configure the auto-skip for 'Whitespaces' (you can replace it with any other rule)
builder.Settings.SkipWhitespaces();
// Create a main sequential expression rule
builder.CreateMainRule("expression")
.Number<double>()
.LiteralChoice("+", "-")
.Number<double>()
.Transform(v => {
var value1 = v.GetValue<double>(0);
var op = v.Children[1].Text;
var value2 = v.GetValue<double>(2);
return op == "+" ? value1 + value2 : value1 - value2;
});
// Build the parser
var parser = builder.Build();
// Parse a string using 'expression' rule and get the raw AST (value will be calculated lazily)
var parsedRule = parser.Parse("10 + 15");
// We can now get the value from our 'Transform' functions (value calculates now)
var transformedValue = parsedRule.GetValue<double>();
Console.WriteLine(transformedValue); // 25
JSON
And here is JSON example:
using RCParsing;
using RCParsing.Building;
var builder = new ParserBuilder();
// Configure whitespace and comment skip-rule
builder.Settings
.Skip(r => r.Rule("skip"), ParserSkippingStrategy.SkipBeforeParsingGreedy);
builder.CreateRule("skip")
.Choice(
b => b.Whitespaces(),
b => b.Literal("//").TextUntil('\n', '\r'))
.ConfigureForSkip(); // Prevents from error recording
builder.CreateToken("string")
.Literal("\"")
.EscapedTextPrefix(prefix: '\\', '\\', '\"') // This sub-token automaticaly escapes the source string and puts it into intermediate value
.Literal("\"")
.Pass(index: 1); // Pass the EscapedTextPrefix's intermediate value up (it will be used as token's result value)
builder.CreateToken("number")
.Number<double>();
builder.CreateToken("boolean")
.LiteralChoice(["true", "false"], v => v.Text == "true");
builder.CreateToken("null")
.Literal("null", _ => null);
builder.CreateRule("value")
.Choice(
c => c.Token("string"),
c => c.Token("number"),
c => c.Token("boolean"),
c => c.Token("null"),
c => c.Rule("array"),
c => c.Rule("object")
); // Choice rule propogates child's value by default
builder.CreateRule("array")
.Literal("[")
.ZeroOrMoreSeparated(v => v.Rule("value"), s => s.Literal(","),
allowTrailingSeparator: true, includeSeparatorsInResult: false,
factory: v => v.SelectArray())
.Literal("]")
.TransformSelect(1); // Selects the Children[1]'s value
builder.CreateRule("object")
.Literal("{")
.ZeroOrMoreSeparated(v => v.Rule("pair"), s => s.Literal(","),
allowTrailingSeparator: true, includeSeparatorsInResult: false,
factory: v => v.SelectValues<KeyValuePair<string, object>>().ToDictionary(k => k.Key, v => v.Value))
.Literal("}")
.TransformSelect(1);
builder.CreateRule("pair")
.Token("string")
.Literal(":")
.Rule("value")
.Transform(v => KeyValuePair.Create(v.GetValue<string>(0), v.GetValue(2)));
builder.CreateMainRule("content")
.Rule("value")
.EOF() // Sure that we captured all the input
.TransformSelect(0);
var jsonParser = builder.Build();
var json =
"""
{
"id": 1,
"name": "Sample Data",
"created": "2023-01-01T00:00:00", // This is a comment
"tags": ["tag1", "tag2", "tag3"],
"isActive": true,
"nested": {
"value": 123.456,
"description": "Nested description"
}
}
""";
// Get the result!
var result = jsonParser.Parse<Dictionary<string, object>>(json);
Console.WriteLine(result["name"]); // Output: Sample Data
Python-like
This example involves our killer-feature, barrier tokens that allows to parse indentations without missing them:
using RCParsing;
using RCParsing.Building;
var builder = new ParserBuilder();
builder.Settings.SkipWhitespaces();
// Add the 'INDENT' and 'DEDENT' barrier tokenizer
// 'INDENT' is emitted when indentation grows
// And 'DEDENT' is emitted when indentation cuts
// They are indentation delta tokens
builder.BarrierTokenizers
.AddIndent(indentSize: 4, "INDENT", "DEDENT");
// Create the statement rule
builder.CreateRule("statement")
.Choice(
b => b
.Literal("def")
.Identifier()
.Literal("():")
.Rule("block"),
b => b
.Literal("if")
.Identifier()
.Literal(":")
.Rule("block"),
b => b
.Identifier()
.Literal("=")
.Identifier()
.Literal(";"));
// Create the 'block' rule that matches our 'INDENT' and 'DEDENT' barrier tokens
builder.CreateRule("block")
.Token("INDENT")
.OneOrMore(b => b.Rule("statement"))
.Token("DEDENT");
builder.CreateMainRule("program")
.ZeroOrMore(b => b.Rule("statement"))
.EOF();
var parser = builder.Build();
string inputStr =
"""
def a():
b = c;
c = a;
a = p;
if c:
h = i;
if b:
a = aa;
""";
// Get the optimized AST...
var ast = parser.Parse(inputStr).Optimized();
// And print it!
foreach (var statement in ast.Children)
{
Console.WriteLine(statement.Text);
Console.Write("\n\n");
}
// Outputs:
/*
def a():
b = c;
c = a;
a = p;
if c:
h = i;
if b:
a = aa;
*/
Finding patterns
The FindAllMatches method allows you to extract all occurrences of a pattern from a string, even in complex inputs, while handling optional transformations. Here's an example where will will find the Price: *PRICE* (USD|EUR) pattern:
var builder = new ParserBuilder();
// Skip unnecessary whitespace (you can configure comments here and they will be ignored when matching)
builder.Settings.SkipWhitespaces();
// Create the rule that we will find in text
builder.CreateMainRule()
.Literal("Price:")
.Number<double>() // 1
.LiteralChoice(["USD", "EUR"]) // 2
.Transform(v =>
{
var number = v[1].Value; // Get the number value
var currency = v[2].Text; // Get the 'USD' or 'EUR' text
return new { Amount = number, Currency = currency };
});
var input =
"""
Some log entries.
Price: 42.99 USD
Error: something happened.
Price: 99.50 EUR
Another line.
Price: 2.50 USD
""";
// Find all transformed matches
var prices = builder.Build().FindAllMatches<dynamic>(input).ToList();
foreach (var price in prices)
{
Console.WriteLine($"Price: {price.Amount}; Currency: {price.Currency}");
}
Comparison with Other Parsing Libraries
RCParsing is designed to outstand with unique features, and easy developer experience, speed is not the target, but it is good enough to compete with other fastest parsers. The benchmarks show that it competes directly with the fastest libraries, while the feature comparison reveals why it stands apart.
Performance at a Glance (based on benchmarks)
| Library | Speed (Relative to RCParsing) | Memory Efficiency |
|---|---|---|
| RCParsing | 1.00x (baseline) | High |
| Parlot | ~4.10-4.90x faster | Excellent |
| Pidgin | ~1.00x-2.70x slower | Excellent |
| Superpower | ~5.90x-6.10x slower | Medium |
| Sprache | ~5.50x-6.50x slower | Very low |
Feature Comparison
This table highlights the unique architectural and usability features of each library.
| Feature | RCParsing | Pidgin | Parlot | Superpower | ANTLR4 |
|---|---|---|---|---|---|
| Architecture | Scannerless hybrid | Scannerless | Scannerless | Lexer-based | Lexer-based |
| API | Fluent | Functional | Fluent/functional | Fluent/functional | Grammar Files |
| Barrier/complex Tokens | Yes, built-in or manual | None | None | Yes, manual | Yes, manual |
| Skipping | 6 strategies, globally | Manual | Global or manual | Tokenizer-based | Tokenizer-based |
| Error Messages | Extremely Detailed | Position/expected | Manual messages | Position/expected | Position/expected |
| Minimum .NET Target | .NET Standard 2.0 | .NET 7.0 | .NET Standard 2.0 | .NET Standard 2.0 | .NET Framework 4.5 |
The Verdict: Why RCParsing?
Choose
RCParsingwhen you need:- Rapid Development: A fluent API that reads like a grammar definition
- Maximum Flexibility: To parse complex syntax (Python-like indentation, mixed data/code formats) with barrier tokens
- Superior Debugging: Detailed errors with stack traces to quickly pinpoint problems
- Modern Features: Built-in ruleset (
EscapedText,SeparatedRepeat,Number) for common patterns
Consider other libraries only for:
- Specialized ultra-low-memory scenarios where every byte counts (Pidgin, Parlot)
- When already invested in a different ecosystem (ANTLR)
The performance is now near-optimal, but the developer experience advantage is significant and enduring.
Benchmarks
All benchmarks are done via BenchmarkDotNet.
Here is machine and runtime information:
BenchmarkDotNet v0.15.2, Windows 10 (10.0.19045.3448/22H2/2022Update)
AMD Ryzen 5 5600 3.60GHz, 1 CPU, 12 logical and 6 physical cores
.NET SDK 9.0.302
[Host] : .NET 8.0.18 (8.0.1825.31117), X64 RyuJIT AVX2
Job-KTXINV : .NET 8.0.18 (8.0.1825.31117), X64 RyuJIT AVX2
Runtime=.NET 8.0 IterationCount=3 WarmupCount=2
JSON
The JSON value calculation with the typeset Dictionary<string, object>, object[], string, int and null.
| Method | Mean | Error | StdDev | Ratio | RatioSD | Gen0 | Gen1 | Allocated | Alloc Ratio |
|---|---|---|---|---|---|---|---|---|---|
| JsonBig_RCParsing | 181.800 us | 3.4884 us | 0.1912 us | 1.00 | 0.00 | 14.4043 | 3.6621 | 237.67 KB | 1.00 |
| JsonBig_Parlot | 41.269 us | 1.6653 us | 0.0913 us | 0.23 | 0.00 | 1.9531 | 0.1221 | 32.08 KB | 0.13 |
| JsonBig_Pidgin | 200.936 us | 5.0342 us | 0.2759 us | 1.11 | 0.00 | 3.9063 | 0.2441 | 65.25 KB | 0.27 |
| JsonBig_Superpower | 1,180.159 us | 46.5028 us | 2.5490 us | 6.49 | 0.01 | 39.0625 | 5.8594 | 638.31 KB | 2.69 |
| JsonBig_Sprache | 1,168.350 us | 149.0224 us | 8.1684 us | 6.43 | 0.04 | 232.4219 | 27.3438 | 3808.34 KB | 16.02 |
| JsonShort_RCParsing | 10.366 us | 0.5918 us | 0.0324 us | 1.00 | 0.00 | 0.8545 | 0.0153 | 14.13 KB | 1.00 |
| JsonShort_Parlot | 2.368 us | 0.0648 us | 0.0036 us | 0.23 | 0.00 | 0.1144 | - | 1.91 KB | 0.14 |
| JsonShort_Pidgin | 11.052 us | 0.6637 us | 0.0364 us | 1.07 | 0.00 | 0.2136 | - | 3.58 KB | 0.25 |
| JsonShort_Superpower | 64.223 us | 1.6115 us | 0.0883 us | 6.20 | 0.02 | 1.9531 | - | 33.32 KB | 2.36 |
| JsonShort_Sprache | 62.399 us | 3.8648 us | 0.2118 us | 6.02 | 0.02 | 12.6953 | 0.2441 | 208.17 KB | 14.74 |
Notes:
RCParsingusesUseInlining(),UseFirstCharacterMatch()andIgnoreErrors()settings.ParlotusesCompiled()version of parser.JsonShortmethods uses ~20 lines of hardcoded (not generated) JSON with simple content.JsonBigmethods uses ~180 lines of hardcoded (not generated) JSON with various content (deep, long objects/arrays).
Expressions
The int value calculation from expression with parentheses (), spaces and operators +-/* with priorities.
| Method | Mean | Error | StdDev | Ratio | RatioSD | Gen0 | Gen1 | Allocated | Alloc Ratio |
|---|---|---|---|---|---|---|---|---|---|
| ExpressionBig_RCParsing | 287,184.2 ns | 7,644.17 ns | 419.00 ns | 1.00 | 0.00 | 22.9492 | 10.7422 | 385720 B | 1.00 |
| ExpressionBig_Parlot | 60,676.0 ns | 7,001.80 ns | 383.79 ns | 0.21 | 0.00 | 3.3569 | - | 56608 B | 0.15 |
| ExpressionBig_Pidgin | 675,575.8 ns | 35,734.99 ns | 1,958.76 ns | 2.35 | 0.01 | 0.9766 | - | 23536 B | 0.06 |
| ExpressionShort_RCParsing | 2,477.7 ns | 284.49 ns | 15.59 ns | 1.00 | 0.01 | 0.2327 | - | 3904 B | 1.00 |
| ExpressionShort_Parlot | 589.3 ns | 43.17 ns | 2.37 ns | 0.24 | 0.00 | 0.0534 | - | 896 B | 0.23 |
| ExpressionShort_Pidgin | 7,071.3 ns | 50.96 ns | 2.79 ns | 2.85 | 0.02 | 0.0153 | - | 344 B | 0.09 |
Notes:
RCParsingusesUseInlining()andIgnoreErrors()settings.ParlotusesCompiled()version of parser.ExpressionShortmethods uses single line with 4 operators of hardcoded (not generated) expression.ExpressionBigmethods uses single line with ~400 operators of hardcoded (not generated) expression.
More benchmarks will be later here...
Projects using RCParsing
- RCLargeLangugeModels: My project, used for
LLT, the template Razor-like language with VERY specific syntax.
Using RCParsing in your project? We'd love to feature it here! Submit a pull request to add your project to the list.
Roadmap
The future development of RCParsing is focused on:
- Performance: Continued profiling and optimization, especially for large files with deep structures.
- API Ergonomics: Introducing even more expressive and concise fluent methods (such as expression builder).
- New Built-in Rules: Adding common patterns (e.g., number with wide range of notations) out of the box.
- Visualization Tooling: Exploring tools for debugging and visualizing grammar rules.
- API for analyzing errors: The API that will allow users to analyze errors more effectively.
- Error recovery: Ability to re-parse the content when encountering an error using the anchor token. Applicable to
RepeatandSeparatedRepeatrules. - Transformation sugars: Ignorance flags of AST childs, more automatic transformation factories.
- Incremental parsing: Parsing only changed parts in the middle of text that will be good for IDE and LSP (Language Server Protocol).
- Streaming incremental parsing: The stateful approach for parsing chunked streaming content. For example, Markdown or structured JSON output from LLM.
- Cosmic levels of debug: Very detailed parse walk traces, showing the order of what was parsed with success/fail status.
Contributing
Contributions are welcome!
If you have an idea about this project, you can report it to Issues.
For contributing code, please fork the repository and make your changes in a new branch. Once you're ready, create a pull request to merge your changes into the main branch. Pull requests should include a clear description of what was changed and why.
| Product | Versions Compatible and additional computed target framework versions. |
|---|---|
| .NET | net5.0 was computed. net5.0-windows was computed. net6.0 is compatible. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed. |
| .NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
| .NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
| .NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
| MonoAndroid | monoandroid was computed. |
| MonoMac | monomac was computed. |
| MonoTouch | monotouch was computed. |
| Tizen | tizen40 was computed. tizen60 was computed. |
| Xamarin.iOS | xamarinios was computed. |
| Xamarin.Mac | xamarinmac was computed. |
| Xamarin.TVOS | xamarintvos was computed. |
| Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- System.Collections.Immutable (>= 8.0.0)
-
net6.0
- System.Collections.Immutable (>= 8.0.0)
-
net8.0
- System.Collections.Immutable (>= 8.0.0)
NuGet packages (1)
Showing the top 1 NuGet packages that depend on RCParsing:
| Package | Downloads |
|---|---|
|
LLTSharp
A lightweight .NET template engine for LLM with messages and prompt templates support. |
GitHub repositories
This package is not used by any popular GitHub repositories.
| Version | Downloads | Last Updated |
|---|---|---|
| 5.0.0 | 215 | 11/14/2025 |
| 4.7.1 | 151 | 10/17/2025 |
| 4.7.0 | 97 | 10/17/2025 |
| 4.6.1 | 212 | 10/15/2025 |
| 4.6.0 | 167 | 10/13/2025 |
| 4.5.0 | 174 | 10/13/2025 |
| 4.4.0 | 117 | 10/11/2025 |
| 4.3.1 | 172 | 10/7/2025 |
| 4.3.0 | 170 | 10/7/2025 |
| 4.2.1 | 160 | 10/5/2025 |
| 4.2.0 | 190 | 9/30/2025 |
| 4.1.0 | 176 | 9/23/2025 |
| 4.0.0 | 180 | 9/22/2025 |
| 3.2.0 | 228 | 9/21/2025 |
| 3.1.0 | 161 | 9/21/2025 |
| 3.0.0 | 299 | 9/18/2025 |
| 2.4.0 | 225 | 9/14/2025 |
| 2.3.0 | 163 | 9/7/2025 |
| 2.2.0 | 170 | 9/1/2025 |
| 2.1.0 | 176 | 8/30/2025 |
| 2.0.2 | 205 | 8/28/2025 |
| 2.0.1 | 206 | 8/28/2025 |
| 2.0.0 | 213 | 8/28/2025 |
| 1.1.0 | 195 | 8/24/2025 |
| 1.0.0 | 159 | 8/21/2025 |