IronOcr 2021.12.0

The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org. Prefix Reserved
There is a newer version of this package available.
See the version list below for details.
dotnet add package IronOcr --version 2021.12.0
NuGet\Install-Package IronOcr -Version 2021.12.0
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="IronOcr" Version="2021.12.0" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronOcr --version 2021.12.0
#r "nuget: IronOcr, 2021.12.0"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install IronOcr as a Cake Addin
#addin nuget:?package=IronOcr&version=2021.12.0

// Install IronOcr as a Cake Tool
#tool nuget:?package=IronOcr&version=2021.12.0

Passing Nuget Downloads Support

IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET

IronOCR provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:

  • .Net Framework 4.0 +
  • .Net Standard 2.0 +
  • .Net Core 2.0 +
  • .Net 5
  • Mono for MacOS and Linux
  • Xamarin for MacOS

IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.

IronOCR's Unique Features

  • Pure .Net OCR API
  • All OCR tasks run locally (no SAAS)
  • 125 languages
  • Barcode & QR Code reading
  • Corrects low quality, noisy and distorted scans
  • Performance tuned above and beyond any other known build of Tesseract OCR.
  • Reads PDFs and multi-page TIFFs
  • Can save any OCR Scan to a searchable PDF document or XHTML

Data Output Options Include

Output Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.

International Language Support

125 Languages supported including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created.

Licensing & Support available

For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/ Email: developers@ironsoftware.com

Get Started Code Example

string Result = new IronOcr.IronTesseract().Read("scan.pdf").Text;

Why C# developers choose IronOCR over Vanilla Tesseract:

  • Achieve 99.8%+ OCR accuracy without using external web services.
  • Includes for Tesseract 5 , 4 and 3 Engines out of the box.
  • Blazing Speed and MultiThreading
  • MVC, WebApp, Desktop, Console & Server Application compatible
  • No Exes or C++ code to work with
  • Full PDF OCR support
  • To perform OCR an almost any Image file or PDF
  • Full .Net Core, Standard and FrameWork support
  • Deploy on Windows, Mac, Linux, Azure, Docker, Lambda, AWS
  • Read barcodes and QR codes
  • Export OCR as to XHTML
  • Export OCR to searchable PDF documents
  • Multithreading support
  • 125 international languages all managed via Nuget or OcrData files
  • Extract Images, Coordinates, Statistics and Fonts. Not just text.
  • Can be used to redistribute Tesseract OCR inside commercial & proprietary applications.
  • Supports: windows Linux Mac Azure AWS Docker

IronOCR shines when working with real world images and imperfect documents such as photographs, or scans of low resolution which may have digital noise or imperfections. Other free OCR libraries for the .NET platform such other .Net Tesseract APIs and web services do not perform so well on these real world use cases.  

OCR with Tesseract 5 - Start Coding in C#

These code examples below shows how easy it is to read text from an image using C# or VB .NET.

Configurable Hello World

using IronOcr;

var Ocr = new IronTesseract();
using (var Input = new OcrInput()){
    Input.AddImage("images/sample.jpeg")
    //... you can add any number of images
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}

 

C# PDF OCR

The same approach can similarly be used to extract text from any PDF document.

var Ocr = new IronTesseract();
using (var input = new OcrInput())
{

    input.AddPdf("example.pdf","password");
    // We can also select specific PDF page numnbers to OCR
        
    var Result = Ocr.Read(input);

    Console.WriteLine(Result.Text);
    Console.WriteLine($"{Result.Pages.Count()} Pages");
    // 1 page for every page of the PDF
}

 

C# OCR Working Code Examples

Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 was computed. 
.NET Framework net40 is compatible.  net403 was computed.  net45 was computed.  net451 was computed.  net452 was computed.  net46 was computed.  net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages (133)

Showing the top 5 NuGet packages that depend on IronOcr:

Package Downloads
IronOcr.Languages.German The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.Japanese The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * JapaneseAlphabet * JapaneseAlphabetBest * JapaneseAlphabetFast * JapaneseVerticalAlphabet * JapaneseVerticalAlphabetBest * JapaneseVerticalAlphabetFast * Japanese * JapaneseBest * JapaneseFast * JapaneseVertical * JapaneseVerticalBest * JapaneseVerticalFast ==================================== C#および.NETの日本語OCR。スタンドアロンの.NETOCR APIで最適化されたC#Tesseract 5OCR。スキャナーのドキュメント、画像、PDFをテキストに変換します。 C#とVBの例:https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Japanese support including: * Japanese (also known as 日本語 (にほんご)) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Japanese * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.Spanish The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Spanish * SpanishBest * SpanishFast * SpanishOld * SpanishOldBest * SpanishOldFast ==================================== OCR en español en C# y .NET. C# Tesseract 5 OCR optimizado en una API de .NET OCR independiente. Convierte documentos, imágenes y PDF del escáner en texto. Ejemplos de C# y VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Spanish support including: * Spanish (also known as Español, Castellano) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Spanish * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.Arabic The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.French The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * French * FrenchBest * FrenchFast ==================================== OCR de langue française en C# & .NET. OCR C# Tesseract 5 optimisé dans une API OCR .NET autonome. Convertit les documents du scanner, les images et les PDF en texte. Exemples C# et VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also French support including: * French (also known as Français, Langue Française) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in French * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
2024.3.4 5,995 3/8/2024
2024.2.41 9,428 1/29/2024
2024.1.17 9,312 12/29/2023
2023.12.34 44,321 11/27/2023
2023.11.35 9,877 10/27/2023
2023.10.9 11,753 9/26/2023
2023.9.4 6,494 9/8/2023
2023.8.34 32,222 8/1/2023
2023.7.28 48,857 7/3/2023
2023.6.6 34,834 5/29/2023
2023.5.35 14,950 5/1/2023
2023.4.13 17,193 4/6/2023
2023.3.2 31,117 3/1/2023
2023.1.11644 21,798 1/18/2023
2022.12.10830 31,068 12/5/2022
2022.11.10109 56,042 10/26/2022
2022.10.9390 13,680 9/27/2022
2022.8.8198 40,325 8/18/2022
2022.8.7804 65,608 7/26/2022
2022.3.0 153,620 3/10/2022
2022.1.0 45,392 1/17/2022
2021.12.0 19,976 12/21/2021
2021.11.0 166,472 10/29/2021
2021.9.0 25,736 8/24/2021
2021.6.0 28,985 6/24/2021
2021.2.1 38,643 2/24/2021
2020.12.2 26,098 12/14/2020
2020.11.2 109,224 11/13/2020
4.4.0 290,670 6/21/2018
4.3.0.1 35,842 4/9/2018
4.2.2.51 5,532 1/22/2018
4.2.2.1 4,300 12/1/2017
4.2.1.5 5,545 9/9/2017
4.1.1 9,200 8/4/2017
4.0.10 3,040 1/12/2017
4.0.9 2,651 12/20/2016

* Improved: Speed and Stability
* Improved: .Net 5 and .NET 6 Beta support
* Improved: Logging