Parquet.Net 3.2.0

Pure .NET library to read and write Apache Parquet files, targeting .NET 4.5 and .NET Standand 1.4 and up. Linux, Windows and Mac are first class citizens, but also works everywhere .NET is running (Android, iOS, IOT). Has zero dependencies on thrid-party libraries or any native code. Provides both low-level access to Apache Parquet files, and high-level utilities for more traditional and humanly understandable row-based access. Includes automatic serializer/deserializer from C# classes into parquet files that works by generating MSIL (bytecode) on the fly and is therefore super fast.

There is a newer version of this package available.
See the version list below for details.
Install-Package Parquet.Net -Version 3.2.0
dotnet add package Parquet.Net --version 3.2.0
<PackageReference Include="Parquet.Net" Version="3.2.0" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Parquet.Net --version 3.2.0
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

3.2.0:
- new feature: POCO serialiser support for repeatable fields POCO (#358)
- bug fixed: --max-rows 10 not honored by PARQ Global Tool 📺 (#357)
- bug fixed: failure to read columns if data page is larger than it should be (supposably padded by Spark) 🐛
(#330)
- improvement: Limit number of rows printed by parq. By default only show the first 10 rows in PARQ Global Tool 📺
(#351)

3.1.3, 3.1.4:
- includes massive performance improvements in parquet reader, now we are faster than fastparquet (python lib)  

3.1.2:
- new feature: replaced default ToString() method in Table and Row object to produce json (#346)
- new feature: parquet CLI supports conversion from parquet to json (#341)

3.1.1:
parq cli improvements

3.1.0:
- re-introducing utilities for row-based access allowing you to access and create parquet files in more readable format.
- Field class now supports MaxRepetitionLevel and MaxDefinitionLevel
- fixed bug #334 preventing reading generated files in Impala
- parquet.net library supports SourceLink

3.0.5:
- #321 bug fixed: a nullable field should support all-non-nullable values
- performance improvement around packing definition levels

3.0.4:
- bug fixed: Cannot read schema where map elements are structures (#320)

3.0.3
- critical bug fixed: reading parquet files with multiple pages doesn't read beyond 1st page (#318)
   
3.0.2
- performance improvements (#317)

3.0.1
- improvement: better column validation in row group writer
- bug fixed: Snappy compression writer fails on certain encodings (#315)

3.0.0
the first release of a major rewrite

Showing the top 1 GitHub repositories that depend on Parquet.Net:

Repository Stars
dotnet/machinelearning
ML.NET is an open source and cross-platform machine learning framework for .NET.

Version History

Version Downloads Last updated
3.3.10 128 11/6/2019
3.3.9 30,600 8/15/2019
3.3.8 2,552 8/1/2019
3.3.7 76 8/1/2019
3.3.6 81 7/31/2019
3.3.5 2,928 7/5/2019
3.3.4 76,473 3/11/2019
3.3.3 7,482 2/1/2019
3.3.2 7,755 1/21/2019
3.3.1 524 1/14/2019
3.3.0 296 1/11/2019
3.2.6 162 1/11/2019
3.2.5 1,851 1/3/2019
3.2.4 1,938 11/21/2018
3.2.3 6,721 11/7/2018
3.2.2 696 10/30/2018
3.2.1 167 10/30/2018
3.2.0 565 10/24/2018
3.1.4 341 10/15/2018
3.1.3 165 10/15/2018
3.1.2 1,439 10/11/2018
3.1.1 505 10/4/2018
3.1.0 199 10/3/2018
3.1.0-preview-390 150 10/3/2018
3.1.0-preview-373 253 10/2/2018
3.0.5 3,877 8/13/2018
3.0.4 322 7/25/2018
3.0.3 183 7/25/2018
3.0.2 676 7/24/2018
3.0.1 181 7/24/2018
3.0.0 682 7/19/2018
2.1.4 25,384 6/7/2018
2.1.3 142,166 3/30/2018
2.1.2 9,530 1/10/2018
2.1.1 18,926 12/1/2017
2.1.0 477 11/29/2017
2.0.1 243 11/27/2017
2.0.0 298 11/27/2017
1.5.1 728 11/14/2017
1.4.0 2,778 10/23/2017
1.3.0 1,595 9/12/2017
1.2.139 343 9/6/2017
1.1.128 322 8/15/2017
1.0.114 293 7/31/2017
Show less