[Dropped] Adding encrypt_with_per_column_encryption.parquet.encrypted to test Parquet Modular Encryption (PME) per-column encryption#99
Conversation
…arquet Modular Encryption (PME) per-column encryption (apache#99)
dd79481 to
fb2c9d3
Compare
I think this is not compatible with the parquet spec, https://parquet.apache.org/docs/file-format/data-pages/encryption/ . |
Hi - our work will propose an extension to allow algorithms to be specified at column-level. We currently have a fork of apache/arrow where we're performing this work (as mentioned in the description of this PR). We would like to avoid also having to have a fork of parquet-testing also, thus this PR. |
|
Dropping this ticket. Once the extensions for per-column algorithm have been discussed with the community, we can bring the discussion of adding the specific tests to |
What
My team is developing an extension to the C++ Parquet Modular Encryption (PME). In this work we are developing two new features:
In this PR, we add a new encrypted test file to
data/that specifically exercises per-column encryption. This test validates that readers can handle files where different columns are encrypted independently (potentially with different algorithms/configurations), complementing existing PME test vectors.File Description
orderId,productId,price,vat,customer_name.orderId: (using test, XOR encryption)productId: AES_GCM_V1 via column_keys mapping to key id productid_key.