From 6f3f909ef410852713cc3865aa72e65eb21f9323 Mon Sep 17 00:00:00 2001 From: Raphael Taylor-Davies <1781103+tustvold@users.noreply.github.com> Date: Tue, 14 Nov 2023 05:16:22 +0000 Subject: [PATCH] PARQUET-2369: Clarify GZIP members (#218) --- Compression.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/Compression.md b/Compression.md index 43abe8c7..cfd4a3a1 100644 --- a/Compression.md +++ b/Compression.md @@ -58,6 +58,10 @@ formats) defined by [RFC 1952](https://tools.ietf.org/html/rfc1952). If any ambiguity arises when implementing this format, the implementation provided by the [zlib compression library](https://zlib.net/) is authoritative. +Readers should support reading pages containing multiple GZIP members, however, +as this has historically not been supported by all implementations, it is recommended +that writers refrain from creating such pages by default for better interoperability. + ### LZO A codec based on or interoperable with the