Parquet.Net 1.4.0

A .NET library to read and write Apache Parquet files.

There is a newer version of this package available.
See the version list below for details.
Install-Package Parquet.Net -Version 1.4.0
dotnet add package Parquet.Net --version 1.4.0
<PackageReference Include="Parquet.Net" Version="1.4.0" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Parquet.Net --version 1.4.0
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

v1.4

new features:
- data representation internally changed to columnar format, resulting in much smaller memory footprint and better performance (#238)
- added support for short/ushort CLR type (#235)
- breaking change: nullable columns are easier to define, parquet.net doesn't try to guess the nullability anymore and gives full control to the user

improvements:
- BigDecimal type supports much bigger numbers (scale and precision) (#239)
- Writer was failing on some forward-only streams like GZipStream as they don't track stream position, now parquet.net supports all kinds of non-seekable streams for write operations (#252)
- Decimal number format is now compatible with Impala (#243)

bug fixes
- Writer was failing on columns consisting of only null values (#222)
- When appending to file and using DateTime for one of your columns, subsequent appends would fail on schema validation (#247)
- Failure when deserialising Impala generated file with all nulls in a column (#240)
v1.3
new features:
- .NET byte and sbyte type is supported
- DataSet has got a new .Merge method that allows to merge two datasets, even if rows and columns are incompatible

improvements:
- we have removed a dependency on Snappy.Sharp completely which conflicted with projects targeting .NET 4.5
- dependency on System.ValueTuple is removed
- Apache Thrift dependency was replaced by a custom build which has zero downstream dependencies now. This was causing problems for projects using ASP.NET Core and specific Kestrel version, as stupidly enough Apache Thrift was referencing web hosting framework!

v1.2
new features:
- INT64 (C# long) type is supported (#194)
- Decimal datatype is fully supported (#209). This includes support for simple System.Decimal, and decimal types with different scales and precisions. Decimals are encoded by utilising all three encodings from parquet specs, however this can be switched off for compatibility with older system. Decimals are fully compatible with Hive and Impala which have some edge cases not complying with parquet specifications. Thanks to @dmitryPavliv and @nzapolski for making this possible

bugs fixed:
- fixed a flaw in dictionary encoding implementation affecting files written for AWS Impala (#193)
- when a column contains only single value and it's null Parquet.Net was crashing (#198)

v1.1
new features:
- Reader supports nested structures.
- Parquet output is now compatible with AWS Athena
- Writer can append data to existing file

improvements:
- Parquet metadata sets page sizes according to standard
- Schema and SchemaElement has Show method allowing to get user readable representation

bug fixes:
- some files mix encodings between data pages resulting in wrong count of rows and occasional crashes (#183)
- some string encoded fields not annotated properly were crashing Parquet.Net reader (#138)

Showing the top 1 GitHub repositories that depend on Parquet.Net:

Repository Stars
dotnet/machinelearning
ML.NET is an open source and cross-platform machine learning framework for .NET.

Read more about the GitHub Usage information on our documentation.

Version History

Version Downloads Last updated
3.6.0 2,848 1/23/2020
3.5.3 3,699 1/8/2020
3.5.2 278 1/3/2020
3.5.1 179 12/31/2019
3.5.0 1,667 12/18/2019
3.4.3 362 12/16/2019
3.4.2 158 12/13/2019
3.4.1 124 12/11/2019
3.4.0 70 12/11/2019
3.3.11 1,866 12/1/2019
3.3.10 4,459 11/6/2019
3.3.9 52,807 8/15/2019
3.3.8 3,715 8/1/2019
3.3.7 103 8/1/2019
3.3.6 108 7/31/2019
3.3.5 4,841 7/5/2019
3.3.4 90,022 3/11/2019
3.3.3 8,802 2/1/2019
3.3.2 12,098 1/21/2019
3.3.1 1,003 1/14/2019
3.3.0 413 1/11/2019
3.2.6 210 1/11/2019
3.2.5 1,987 1/3/2019
3.2.4 2,179 11/21/2018
3.2.3 8,370 11/7/2018
3.2.2 831 10/30/2018
3.2.1 195 10/30/2018
3.2.0 707 10/24/2018
3.1.4 367 10/15/2018
3.1.3 190 10/15/2018
3.1.2 2,477 10/11/2018
3.1.1 531 10/4/2018
3.1.0 272 10/3/2018
3.1.0-preview-390 175 10/3/2018
3.1.0-preview-373 291 10/2/2018
3.0.5 4,083 8/13/2018
3.0.4 356 7/25/2018
3.0.3 220 7/25/2018
3.0.2 714 7/24/2018
3.0.1 218 7/24/2018
3.0.0 756 7/19/2018
2.1.4 27,408 6/7/2018
2.1.3 173,190 3/30/2018
2.1.2 13,380 1/10/2018
2.1.1 21,995 12/1/2017
2.1.0 516 11/29/2017
2.0.1 281 11/27/2017
2.0.0 340 11/27/2017
1.5.1 863 11/14/2017
1.4.0 2,830 10/23/2017
1.3.0 1,756 9/12/2017
1.2.139 383 9/6/2017
1.1.128 364 8/15/2017
1.0.114 335 7/31/2017
Show less