Parquet.Net 3.2.1

Pure .NET library to read and write Apache Parquet files, targeting .NET 4.5 and .NET Standand 1.4 and up. Linux, Windows and Mac are first class citizens, but also works everywhere .NET is running (Android, iOS, IOT). Has zero dependencies on thrid-party libraries or any native code. Provides both low-level access to Apache Parquet files, and high-level utilities for more traditional and humanly understandable row-based access. Includes automatic serializer/deserializer from C# classes into parquet files that works by generating MSIL (bytecode) on the fly and is therefore super fast.

There is a newer version of this package available.
See the version list below for details.
Install-Package Parquet.Net -Version 3.2.1
dotnet add package Parquet.Net --version 3.2.1
<PackageReference Include="Parquet.Net" Version="3.2.1" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Parquet.Net --version 3.2.1
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: Parquet.Net, 3.2.1"
For F# scripts that support #r syntax, copy this into the source code to reference the package.

Release Notes

3.2.0:
- new feature: POCO serialiser support for repeatable fields POCO (#358)
- bug fixed: --max-rows 10 not honored by PARQ Global Tool 📺 (#357)
- bug fixed: failure to read columns if data page is larger than it should be (supposably padded by Spark) 🐛
(#330)
- improvement: Limit number of rows printed by parq. By default only show the first 10 rows in PARQ Global Tool 📺
(#351)

3.1.3, 3.1.4:
- includes massive performance improvements in parquet reader, now we are faster than fastparquet (python lib)  

3.1.2:
- new feature: replaced default ToString() method in Table and Row object to produce json (#346)
- new feature: parquet CLI supports conversion from parquet to json (#341)

3.1.1:
parq cli improvements

3.1.0:
- re-introducing utilities for row-based access allowing you to access and create parquet files in more readable format.
- Field class now supports MaxRepetitionLevel and MaxDefinitionLevel
- fixed bug #334 preventing reading generated files in Impala
- parquet.net library supports SourceLink

3.0.5:
- #321 bug fixed: a nullable field should support all-non-nullable values
- performance improvement around packing definition levels

3.0.4:
- bug fixed: Cannot read schema where map elements are structures (#320)

3.0.3
- critical bug fixed: reading parquet files with multiple pages doesn't read beyond 1st page (#318)
   
3.0.2
- performance improvements (#317)

3.0.1
- improvement: better column validation in row group writer
- bug fixed: Snappy compression writer fails on certain encodings (#315)

3.0.0
the first release of a major rewrite

NuGet packages (12)

Showing the top 5 NuGet packages that depend on Parquet.Net:

Package Downloads
Microsoft.DataPrep
Microsoft Azure Machine Learning Data Preparation SDK.
DataFrame.Math
A suite of libraries to provide stats functionality in dotnet for parquet and other formats
DataFrame.Formats
Package Description
Microsoft.ML.Parquet
ML.NET components for Apache Parquet support.
DataPipe.Serializer.Parquet
Provides Parquet support to the DataPipe platform. Reference this package if you need to read or write Parquet file from a data pipe script

GitHub repositories (3)

Showing the top 3 popular GitHub repositories that depend on Parquet.Net:

Repository Stars
dotnet/machinelearning
ML.NET is an open source and cross-platform machine learning framework for .NET.
Cinchoo/ChoETL
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
mukunku/ParquetViewer
Simple windows desktop application for viewing & querying Apache Parquet files

Version History

Version Downloads Last updated
3.8.5 574 2/23/2021
3.8.4 14,727 12/13/2020
3.8.3 366 12/10/2020
3.8.2 60 12/10/2020
3.8.1 6,299 11/6/2020
3.8.0 101 11/6/2020
3.7.7 104,003 6/25/2020
3.7.6 6,403 6/16/2020
3.7.5 4,796 6/8/2020
3.7.4 19,920 5/19/2020
3.7.2 731 5/18/2020
3.7.1 12,379 4/21/2020
3.7.0 5,067 4/19/2020
3.6.0 219,709 1/23/2020
3.5.3 10,272 1/8/2020
3.5.2 1,156 1/3/2020
3.5.1 312 12/31/2019
3.5.0 5,704 12/18/2019
3.4.3 2,377 12/16/2019
3.4.2 924 12/13/2019
3.4.1 264 12/11/2019
3.4.0 252 12/11/2019
3.3.11 2,990 12/1/2019
3.3.10 13,356 11/6/2019
3.3.9 101,856 8/15/2019
3.3.8 5,922 8/1/2019
3.3.7 235 8/1/2019
3.3.6 255 7/31/2019
3.3.5 11,327 7/5/2019
3.3.4 113,551 3/11/2019
3.3.3 15,881 2/1/2019
3.3.2 20,466 1/21/2019
3.3.1 2,501 1/14/2019
3.3.0 985 1/11/2019
3.2.6 793 1/11/2019
3.2.5 2,597 1/3/2019
3.2.4 3,012 11/21/2018
3.2.3 13,334 11/7/2018
3.2.2 1,516 10/30/2018
3.2.1 408 10/30/2018
3.2.0 1,088 10/24/2018
3.1.4 586 10/15/2018
3.1.3 362 10/15/2018
3.1.2 5,091 10/11/2018
3.1.1 751 10/4/2018
3.1.0 599 10/3/2018
3.1.0-preview-390 357 10/3/2018
3.1.0-preview-373 539 10/2/2018
3.0.5 5,206 8/13/2018
3.0.4 589 7/25/2018
3.0.3 441 7/25/2018
3.0.2 927 7/24/2018
3.0.1 428 7/24/2018
3.0.0 1,057 7/19/2018
2.1.4 37,589 6/7/2018
2.1.3 233,620 3/30/2018
2.1.2 36,148 1/10/2018
2.1.1 32,205 12/1/2017
2.1.0 705 11/29/2017
2.0.1 481 11/27/2017
2.0.0 1,265 11/27/2017
1.5.1 1,060 11/14/2017
1.4.0 3,384 10/23/2017
1.3.0 2,643 9/12/2017
1.2.139 1,392 9/6/2017
1.1.128 689 8/15/2017
1.0.114 553 7/31/2017
Show less