GroupDocs.Parser 26.6.1

Prefix Reserved
dotnet add package GroupDocs.Parser --version 26.6.1
                    
NuGet\Install-Package GroupDocs.Parser -Version 26.6.1
                    
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="GroupDocs.Parser" Version="26.6.1" />
                    
For projects that support PackageReference, copy this XML node into the project file to reference the package.
<PackageVersion Include="GroupDocs.Parser" Version="26.6.1" />
                    
Directory.Packages.props
<PackageReference Include="GroupDocs.Parser" />
                    
Project file
For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.
paket add GroupDocs.Parser --version 26.6.1
                    
#r "nuget: GroupDocs.Parser, 26.6.1"
                    
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
#:package GroupDocs.Parser@26.6.1
                    
#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.
#addin nuget:?package=GroupDocs.Parser&version=26.6.1
                    
Install as a Cake Addin
#tool nuget:?package=GroupDocs.Parser&version=26.6.1
                    
Install as a Cake Tool

GroupDocs.Parser for .NET

GroupDocs.Parser is a high-performance .NET API for extracting text, metadata, and images from PDFs, Office files, and emails across platforms. This package automatically selects the correct runtime assemblies for your target framework.

Quick Start

using (Parser parser = new Parser("sample.pdf"))
{
	string text = parser.GetText();
	Console.WriteLine(text);
}

Runtime Selection (TFM Split)

GroupDocs.Parser is a lightweight router (metapackage). When you install it, NuGet pulls exactly one runtime package matching your project’s target framework.

Available Runtime Packages

Package Target Framework
GroupDocs.Parser.Net462 .NET Framework 4.6.2
GroupDocs.Parser.NetStandard21 .NET Standard 2.1
GroupDocs.Parser.Net60 .NET 6.0
GroupDocs.Parser.Net80 .NET 8.0

You can also install a specific runtime package directly if you want full control.

Examples

Extracting text from a PDF document

This code loads a PDF file (sample.pdf) and extracts its text content using the GetText() method. The extracted text is then displayed in the console.

using GroupDocs.Parser;
using GroupDocs.Parser.Options;

// Load the PDF document
using (Parser parser = new Parser("sample.pdf"))
{
	// Extract text from the document
	string text = parser.GetText();
	
	// Output the extracted text
	Console.WriteLine(text);
}

Extracting images from a Word document

This code loads a Word document (sample.docx) and extracts all images found within the document. Each image is saved as a separate PNG file.

using GroupDocs.Parser;
using GroupDocs.Parser.Data;

// Load the Word document
using (Parser parser = new Parser("sample.docx"))
{
	// Get images from the document
	IEnumerable<PageImageArea> images = parser.GetImages();
	
	// Save each image to a file
	int imageNumber = 1;
	foreach (PageImageArea image in images)
	{
		image.Save($"image{imageNumber++}.png");
	}
}

Parsing metadata from an Excel spreadsheet

This code loads an Excel spreadsheet (sample.xlsx) and extracts its metadata, such as author, title, and creation date. The metadata is then displayed in the console.

using GroupDocs.Parser;
using GroupDocs.Parser.Data;

// Load the Excel spreadsheet
using (Parser parser = new Parser("sample.xlsx"))
{
	// Get document's metadata
	IEnumerable<MetadataItem> metadata = parser.GetMetadata();
	
	// Output the metadata
	foreach (var item in metadata)
	{
		Console.WriteLine($"{item.Name}: {item.Value}");
	}
}

Features

  • Extract text
  • Extract images
  • Extract tables
  • Extract hyperlinks
  • Extract data from email attachments and archives
  • Scan QR codes
  • Parse PDF forms
  • Parse data by templates
  • Search text in documents

Supported Formats (high level)

This API supports a broad set of document formats, including:

  • Word Processing: DOC, DOCX, DOCM, RTF, ODT
  • Spreadsheets: XLS, XLSX, XLSM, ODS, CSV
  • Presentations: PPT, PPTX, PPS, PPSX
  • PDF: PDF
  • Images: JPG, BMP, PNG, TIFF, GIF
  • Archives: ZIP, TAR, 7Z
  • Web: HTML, MHTML
  • e-Books: CHM, EPUB, FB2, MOBI

For the full list, see the product documentation.

Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed.  net10.0 was computed.  net10.0-android was computed.  net10.0-browser was computed.  net10.0-ios was computed.  net10.0-maccatalyst was computed.  net10.0-macos was computed.  net10.0-tvos was computed.  net10.0-windows was computed. 
.NET Core netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.1 is compatible. 
.NET Framework net462 is compatible.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages (1)

Showing the top 1 NuGet packages that depend on GroupDocs.Parser:

Package Downloads
Shoremem.Core

Package Description

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last Updated
26.6.1 90 7/1/2026
26.6.0 89 7/1/2026
26.4.0 357 4/29/2026
26.3.0 335 3/30/2026
26.2.0 339 2/28/2026
25.12.1 2,199 12/24/2025
25.12.0 948 12/10/2025
25.11.0 535 11/28/2025
25.10.1 1,395 11/12/2025
25.10.0 571 11/7/2025
25.8.0 2,257 8/28/2025
25.7.0 1,338 7/24/2025
25.2.0 7,183 2/26/2025
25.1.0 6,311 1/30/2025
24.12.0 447,861 12/24/2024
24.11.0 11,194 11/29/2024
24.10.0 3,432 11/1/2024
24.9.0 4,753 9/30/2024
24.8.0 77,605 8/30/2024
Loading failed