IronOcr 4.2.2

IronOcr - The C# Ocr Library

IronOCR is an advanced OCR (Optical Character Recognition) & Barcode library for C# and VB.Net.

The engine adds OCR functionality to Desktop, Console and Web applications in minutes. IronOCR reads Text, Barcodes & QR from all major image and PDF formats.

Language packs available for Arabic, Simplified Chinese, Traditional Chinese, Danish, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish, and Swedish at http://ironsoftware.com/csharp/ocr/
 or as individual NuGet packages.

What makes IronOCR different is how well it deals with poor quality scans. IronOCR can read scans with low resolution, paper distortion and background noise.  It takes the time to analyze the input file and prepare it for OCR to achieve consistent results every time.

Image pre-processing supports: Rotation & De-Skewing, Background Noise Cleaning, Resolution Enhancement, Increasing Contrast, Colour Simplification, and Setting Crop Regions.

IronOCR automatically detects the quality of an image before reading. Developers can adjust advanced image processing settings if needed.

Data output options include: Plain Text, Barcode Data, Object model structured data (headings, paragraphs, lines, words, and characters for each page).

Requires: .Net framework 4.0 or above on the Windows Platform including Cloud hosting such as Azure.  Supports multithreading to improve OCR performance on multi-core servers.

Licensing & Support available for commercial deployments. For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/ For support please email us at developers@ironsoftware.com

There is a newer version of this package available.
See the version list below for details.
Install-Package IronOcr -Version 4.2.2
dotnet add package IronOcr --version 4.2.2
<PackageReference Include="IronOcr" Version="4.2.2" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronOcr --version 4.2.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

IronOCR

An advanced OCR library for .Net. The OCR engine reads and extracts text from images and scanned documents. “Image to text” functionality is added to Desktop, Console and Web applications in about 5 minuites.


C# Example 1 - Automated Ocr:
==============================
using IronOcr;
//..
var Ocr = new AutoOcr();
var Result = Ocr.Read(@"C:\path\to\image.png");
Console.WriteLine(Result.Text);

C# Example 2 - Advanced Ocr:
==============================
using IronOcr;
//..
var Ocr = new AdvancedOcr()
{
CleanBackgroundNoise = true,
EnhanceContrast = true,
EnhanceResolution = true,
Language = IronOcr.Languages.English.OcrLanguagePack,
Strategy = IronOcr.AdvancedOcr.OcrStrategy.Advanced,
ColorSpace = AdvancedOcr.OcrColorSpace.Color,
DetectWhiteTextOnDarkBackgrounds = true,
InputImageType = AdvancedOcr.InputTypes.AutoDetect,
RotateAndStraighten = true,
ReadBarCodes = true,
ColorDepth =4
};

var testImage = @"C:\path\to\scan.tiff";

var Results = Ocr.Read(testImage);

var Barcodes = Results.Barcodes.Select(b => b.Value);

Console.WriteLine(Results.Text);
Console.WriteLine("Barcodes:" + String.Join(",", Barcodes));

C# Example 3 - PDF Ocr:
==============================
using IronOcr;
//..
var Ocr = new AdvancedOcr()
{
CleanBackgroundNoise = false,
ColorDepth = 4,
ColorSpace = AdvancedOcr.OcrColorSpace.Color,
EnhanceContrast = false,
DetectWhiteTextOnDarkBackgrounds = false,
RotateAndStraighten = false,
Language = IronOcr.Languages.English.OcrLanguagePack,
EnhanceResolution = false,
InputImageType = AdvancedOcr.InputTypes.Document,
ReadBarCodes = true,
Strategy = AdvancedOcr.OcrStrategy.Fast
};

var Results = Ocr.ReadPdf(@"C:\Users\Me\Desktop\Invoice.pdf");
var Barcodes = Results.Barcodes.Select(b => b.Value);

Console.WriteLine(Results.Text);
Console.WriteLine("Barcodes:" + String.Join(",", Barcodes));

Supports: Console Applications, ASP.NET Web Applications, MVC, Web APIs, Desktop Applications written in any .Net language such as C# or VB.Net

Requires: .Net framework 4.0 or above on the Windows Platform or Azure Cloud.

For more information visit http://ironsoftware.com/csharp/ocr/

IronOCR

An advanced OCR library for .Net. The OCR engine reads and extracts text from images and scanned documents. “Image to text” functionality is added to Desktop, Console and Web applications in about 5 minuites.


C# Example 1 - Automated Ocr:
==============================
using IronOcr;
//..
var Ocr = new AutoOcr();
var Result = Ocr.Read(@"C:\path\to\image.png");
Console.WriteLine(Result.Text);

C# Example 2 - Advanced Ocr:
==============================
using IronOcr;
//..
var Ocr = new AdvancedOcr()
{
CleanBackgroundNoise = true,
EnhanceContrast = true,
EnhanceResolution = true,
Language = IronOcr.Languages.English.OcrLanguagePack,
Strategy = IronOcr.AdvancedOcr.OcrStrategy.Advanced,
ColorSpace = AdvancedOcr.OcrColorSpace.Color,
DetectWhiteTextOnDarkBackgrounds = true,
InputImageType = AdvancedOcr.InputTypes.AutoDetect,
RotateAndStraighten = true,
ReadBarCodes = true,
ColorDepth =4
};

var testImage = @"C:\path\to\scan.tiff";

var Results = Ocr.Read(testImage);

var Barcodes = Results.Barcodes.Select(b => b.Value);

Console.WriteLine(Results.Text);
Console.WriteLine("Barcodes:" + String.Join(",", Barcodes));

C# Example 3 - PDF Ocr:
==============================
using IronOcr;
//..
var Ocr = new AdvancedOcr()
{
CleanBackgroundNoise = false,
ColorDepth = 4,
ColorSpace = AdvancedOcr.OcrColorSpace.Color,
EnhanceContrast = false,
DetectWhiteTextOnDarkBackgrounds = false,
RotateAndStraighten = false,
Language = IronOcr.Languages.English.OcrLanguagePack,
EnhanceResolution = false,
InputImageType = AdvancedOcr.InputTypes.Document,
ReadBarCodes = true,
Strategy = AdvancedOcr.OcrStrategy.Fast
};

var Results = Ocr.ReadPdf(@"C:\Users\Me\Desktop\Invoice.pdf");
var Barcodes = Results.Barcodes.Select(b => b.Value);

Console.WriteLine(Results.Text);
Console.WriteLine("Barcodes:" + String.Join(",", Barcodes));

Supports: Console Applications, ASP.NET Web Applications, MVC, Web APIs, Desktop Applications written in any .Net language such as C# or VB.Net

Requires: .Net framework 4.0 or above on the Windows Platform or Azure Cloud.

For more information visit http://ironsoftware.com/csharp/ocr/

Release Notes

This build adds:

- PDF OCR support stability issues resolved
- Async Multithread PDF Ocr

Dependencies

This package has no dependencies.

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
4.4.0 32,990 6/21/2018
4.3.0.1 8,666 4/9/2018
4.2.2.51 2,335 1/22/2018
4.2.2.5 1,499 1/19/2018
4.2.2.3 546 1/15/2018
4.2.2.1 1,074 12/1/2017
4.2.2 367 12/1/2017
4.2.1.5 1,993 9/9/2017
4.2.1.2 248 9/8/2017
4.2.1.1 252 9/6/2017
4.2.0 310 9/5/2017
4.1.1 1,188 8/4/2017
4.1.0 242 8/2/2017
4.0.10 1,124 1/12/2017
4.0.9 401 12/20/2016
Show less