Parquet.Net
3.2.1
Pure .NET library to read and write Apache Parquet files, targeting .NET 4.5 and .NET Standand 1.4 and up. Linux, Windows and Mac are first class citizens, but also works everywhere .NET is running (Android, iOS, IOT). Has zero dependencies on thrid-party libraries or any native code. Provides both low-level access to Apache Parquet files, and high-level utilities for more traditional and humanly understandable row-based access. Includes automatic serializer/deserializer from C# classes into parquet files that works by generating MSIL (bytecode) on the fly and is therefore super fast.
See the version list below for details.
Install-Package Parquet.Net -Version 3.2.1
dotnet add package Parquet.Net --version 3.2.1
<PackageReference Include="Parquet.Net" Version="3.2.1" />
paket add Parquet.Net --version 3.2.1
#r "nuget: Parquet.Net, 3.2.1"
Release Notes
3.2.0:
- new feature: POCO serialiser support for repeatable fields POCO (#358)
- bug fixed: --max-rows 10 not honored by PARQ Global Tool 📺 (#357)
- bug fixed: failure to read columns if data page is larger than it should be (supposably padded by Spark) 🐛
(#330)
- improvement: Limit number of rows printed by parq. By default only show the first 10 rows in PARQ Global Tool 📺
(#351)
3.1.3, 3.1.4:
- includes massive performance improvements in parquet reader, now we are faster than fastparquet (python lib)
3.1.2:
- new feature: replaced default ToString() method in Table and Row object to produce json (#346)
- new feature: parquet CLI supports conversion from parquet to json (#341)
3.1.1:
parq cli improvements
3.1.0:
- re-introducing utilities for row-based access allowing you to access and create parquet files in more readable format.
- Field class now supports MaxRepetitionLevel and MaxDefinitionLevel
- fixed bug #334 preventing reading generated files in Impala
- parquet.net library supports SourceLink
3.0.5:
- #321 bug fixed: a nullable field should support all-non-nullable values
- performance improvement around packing definition levels
3.0.4:
- bug fixed: Cannot read schema where map elements are structures (#320)
3.0.3
- critical bug fixed: reading parquet files with multiple pages doesn't read beyond 1st page (#318)
3.0.2
- performance improvements (#317)
3.0.1
- improvement: better column validation in row group writer
- bug fixed: Snappy compression writer fails on certain encodings (#315)
3.0.0
the first release of a major rewrite
Dependencies
-
.NETFramework 4.5
- System.Buffers (>= 4.5.0)
- System.Memory (>= 4.5.1)
- System.Reflection.Emit.Lightweight (>= 4.3.0)
-
.NETStandard 1.4
- NETStandard.Library (>= 1.6.1)
- System.Buffers (>= 4.5.0)
- System.Memory (>= 4.5.1)
- System.Reflection.Emit.Lightweight (>= 4.3.0)
Used By
NuGet packages (12)
Showing the top 5 NuGet packages that depend on Parquet.Net:
Package | Downloads |
---|---|
Microsoft.DataPrep
Microsoft Azure Machine Learning Data Preparation SDK.
|
|
DataFrame.Math
A suite of libraries to provide stats functionality in dotnet for parquet and other formats
|
|
DataFrame.Formats
Package Description
|
|
Microsoft.ML.Parquet
ML.NET components for Apache Parquet support.
|
|
DataPipe.Serializer.Parquet
Provides Parquet support to the DataPipe platform. Reference this package if you need to read or write Parquet file from a data pipe script
|
GitHub repositories (3)
Showing the top 3 popular GitHub repositories that depend on Parquet.Net:
Repository | Stars |
---|---|
dotnet/machinelearning
ML.NET is an open source and cross-platform machine learning framework for .NET.
|
|
Cinchoo/ChoETL
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
|
|
mukunku/ParquetViewer
Simple windows desktop application for viewing & querying Apache Parquet files
|
Version History
Version | Downloads | Last updated |
---|---|---|
3.8.5 | 574 | 2/23/2021 |
3.8.4 | 14,727 | 12/13/2020 |
3.8.3 | 366 | 12/10/2020 |
3.8.2 | 60 | 12/10/2020 |
3.8.1 | 6,299 | 11/6/2020 |
3.8.0 | 101 | 11/6/2020 |
3.7.7 | 104,003 | 6/25/2020 |
3.7.6 | 6,403 | 6/16/2020 |
3.7.5 | 4,796 | 6/8/2020 |
3.7.4 | 19,920 | 5/19/2020 |
3.7.2 | 731 | 5/18/2020 |
3.7.1 | 12,379 | 4/21/2020 |
3.7.0 | 5,067 | 4/19/2020 |
3.6.0 | 219,709 | 1/23/2020 |
3.5.3 | 10,272 | 1/8/2020 |
3.5.2 | 1,156 | 1/3/2020 |
3.5.1 | 312 | 12/31/2019 |
3.5.0 | 5,704 | 12/18/2019 |
3.4.3 | 2,377 | 12/16/2019 |
3.4.2 | 924 | 12/13/2019 |
3.4.1 | 264 | 12/11/2019 |
3.4.0 | 252 | 12/11/2019 |
3.3.11 | 2,990 | 12/1/2019 |
3.3.10 | 13,356 | 11/6/2019 |
3.3.9 | 101,856 | 8/15/2019 |
3.3.8 | 5,922 | 8/1/2019 |
3.3.7 | 235 | 8/1/2019 |
3.3.6 | 255 | 7/31/2019 |
3.3.5 | 11,327 | 7/5/2019 |
3.3.4 | 113,551 | 3/11/2019 |
3.3.3 | 15,881 | 2/1/2019 |
3.3.2 | 20,466 | 1/21/2019 |
3.3.1 | 2,501 | 1/14/2019 |
3.3.0 | 985 | 1/11/2019 |
3.2.6 | 793 | 1/11/2019 |
3.2.5 | 2,597 | 1/3/2019 |
3.2.4 | 3,012 | 11/21/2018 |
3.2.3 | 13,334 | 11/7/2018 |
3.2.2 | 1,516 | 10/30/2018 |
3.2.1 | 408 | 10/30/2018 |
3.2.0 | 1,088 | 10/24/2018 |
3.1.4 | 586 | 10/15/2018 |
3.1.3 | 362 | 10/15/2018 |
3.1.2 | 5,091 | 10/11/2018 |
3.1.1 | 751 | 10/4/2018 |
3.1.0 | 599 | 10/3/2018 |
3.1.0-preview-390 | 357 | 10/3/2018 |
3.1.0-preview-373 | 539 | 10/2/2018 |
3.0.5 | 5,206 | 8/13/2018 |
3.0.4 | 589 | 7/25/2018 |
3.0.3 | 441 | 7/25/2018 |
3.0.2 | 927 | 7/24/2018 |
3.0.1 | 428 | 7/24/2018 |
3.0.0 | 1,057 | 7/19/2018 |
2.1.4 | 37,589 | 6/7/2018 |
2.1.3 | 233,620 | 3/30/2018 |
2.1.2 | 36,148 | 1/10/2018 |
2.1.1 | 32,205 | 12/1/2017 |
2.1.0 | 705 | 11/29/2017 |
2.0.1 | 481 | 11/27/2017 |
2.0.0 | 1,265 | 11/27/2017 |
1.5.1 | 1,060 | 11/14/2017 |
1.4.0 | 3,384 | 10/23/2017 |
1.3.0 | 2,643 | 9/12/2017 |
1.2.139 | 1,392 | 9/6/2017 |
1.1.128 | 689 | 8/15/2017 |
1.0.114 | 553 | 7/31/2017 |