Microlithix.AnsiParser 1.0.0

Prefix Reserved
dotnet add package Microlithix.AnsiParser --version 1.0.0                
NuGet\Install-Package Microlithix.AnsiParser -Version 1.0.0                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Microlithix.AnsiParser" Version="1.0.0" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Microlithix.AnsiParser --version 1.0.0                
#r "nuget: Microlithix.AnsiParser, 1.0.0"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install Microlithix.AnsiParser as a Cake Addin
#addin nuget:?package=Microlithix.AnsiParser&version=1.0.0

// Install Microlithix.AnsiParser as a Cake Tool
#tool nuget:?package=Microlithix.AnsiParser&version=1.0.0                

Logo Microlithix AnsiParser

This package contains modules for parsing character streams and strings with embedded control sequences that conform to specifications published in ECMA-48 and ANSI X3.64, later adopted as ISO/IEC 6429. Such coded control sequences are commonly referred to as ANSI escape sequences or ANSI escape codes. These control sequences are often used by terminals, terminal emulators, command shells, and other types of console applications.

Note that this package implements a parser only, and its output is a sequence of parsed elements containing a structured representation of the sequences in the input stream. It doesn't perform any actions based on the parsed elements other than providing them to the consuming application for subsequent processing. It is the responsibility of the consuming application to act upon the parsed elements, and the AnsiParser Application Programming Interface (API) defines many named constants and other values to assist with the interpretation of the parsed elements in accordance with the higher-level functionality described in ECMA-48.

This work was inspired by Paul Flo Williams' state machine description of the DEC VT500-Series parser, and by Joshua Haberman's VTParse implementation of that state machine. See the citations for details.

Full Documentation

Release Notes

License

Getting Started

Most of the functionality of the package is available through two classes named AnsiStreamParser and AnsiStringParser. The former parses a single character on each invocation and returns parsed elements on the fly via a callback function. The latter parses an entire string on each invocation and returns the result as a list of parsed elements found in the string. Both ultimately provide the same functionality, so it is up to you to decide which is best suited for your application.

AnsiStreamParser

AnsiStreamParser parses one character at a time from a stream of character data. It expects the characters to be encoded in UTF-16 format, which is the standard encoding format for the .NET char type. It should generally also work with 7 and 8-bit codes that conform to ECMA-35.

Here is a simple example of its usage:

using Microlithix.Text.Ansi;
using Microlithix.Text.Ansi.Element;

AnsiStreamParser parser = new(ParsedElementCallback);

string inputStream = $"\x1b[34mHello\x1b[0m";

foreach (char ch in inputStream) {
    parser.Parse(ch);
}

void ParsedElementCallback(IAnsiStreamParserElement element) {
    // Handle the parsed element.
    Debug.Print($"{element}");
}

This example creates a new AnsiStreamParser instance and provides a callback function that will be executed once for every parsed element produced by the parser. Then it simply calls the parser's Parse(char) method once for each character in the input stream. Executing this code will produce the following debug output:

AnsiControlSequence { Parameters = [ 34 ], ControlFunction = m }
AnsiPrintableChar { Text = H }
AnsiPrintableChar { Text = e }
AnsiPrintableChar { Text = l }
AnsiPrintableChar { Text = l }
AnsiPrintableChar { Text = o }
AnsiControlSequence { Parameters = [ 0 ], ControlFunction = m }

The parsed elements will each be one of the element types implementing the IAnsiStreamParserElement interface. Strings are returned one character at a time in elements of type AnsiPrintableChar or AnsiControlStringChar.

Note that the parser preserves its state from one invocation to the next and produces a parsed element only when it has received the complete sequence of characters for that element. Therefore, there isn't a 1-to-1 mapping between invocations of the Parse() method and the production of elements. Rather, a single invocation of the Parse() method might result in the production zero, one, or two elements depending on the sequence of characters in the input stream.

AnsiStringParser

AnsiStringParser parses an entire string on each invocation of its Parse(string) method. It expects the strings to be encoded in UTF-16 format, which is the standard string encoding format for the .NET string type. It should generally also work with 7 and 8-bit codes that conform to ECMA-35.

AnsiStringParser uses an internal instance of AnsiStreamParser for its low-level parsing functionality. Unlike AnsiStreamParser, AnsiStringParser returns a list of elements rather than invoking a callback function for each element. Furthermore, the printable characters and the characters in control strings are consolidated into strings of characters returned in elements of type AnsiPrintableString and AnsiControlString, rather than being returned one character at a time.

Here is a simple example showing how AnsiStringParser can be used:

using Microlithix.Text.Ansi;
using Microlithix.Text.Ansi.Element;

AnsiStringParser parser = new();

List<IAnsiStringParserElement> elements = parser.Parse($"\x1b[34mHello\x1b[0m");

foreach (IAnsiStringParserElement element in elements) {
    Debug.Print($"{element}");
}

This example creates a new AnsiStringParser instance and then simply calls its Parse(string) method once for each string to be parsed (in this case there is only a single string). The example above will produce the following debug output:

AnsiControlSequence { Parameters = [ 34 ], ControlFunction = m }
AnsiPrintableString { Text = Hello }
AnsiControlSequence { Parameters = [ 0 ], ControlFunction = m }

The parsed elements will each be one of the element types implementing the IAnsiStringParserElement interface.

Note that the parser preserves its state from one invocation to the next, so an unterminated control sequence in one string may be continued or terminated by a character sequence in another string during a subsequent invocation.

Product Compatible and additional computed target framework versions.
.NET net7.0 is compatible.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
  • net7.0

    • No dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
1.0.0 182 4/25/2024