[arrow] Improve customization capabilities for data type conversion. #6695

jichen20210919 · 2025-11-27T14:43:49Z

[arrow] Improve customization capabilities for data type conversion between Paimon and Arrow formats.(#6694).

Purpose

paimon-arrow module has a good design for paimon-arrow field type conversions, but the top level class has the fixed type conversions, in some use cases, developers may want to customize the field type conversion process to meet the use-case.

Linked issue: open #6693

Tests

add org.apache.paimon.arrow.vector.ArrowFormatWriterTest#testCustomArrowFormatWriter

API and Format

No

Documentation

No

…etween Paimon and Arrow formats(apache#6694).

…etween Paimon and Arrow formats(apache#6695).

JingsongLi

+1

* upstream/master: [test] Fix test SparkWriteITCase.testTruncatePartitionValueNull [orc] Limiting Memory Usage of OrcBulkWriter When Writing VectorizedRowBatch (apache#6590) [arrow] Improve customization capabilities for data type conversion. (apache#6695) [spark] Fix NPE in spark truncate null partitions [core] Exclude .class files from sources.jar (apache#6707) [core] DataEvolutionFileStoreScan should not filter files by read type when it contains no physical columns. (apache#6714) [spark] Update scalafmt version to 3.10.2 (apache#6709) [variant] Extract only required columns when reading shredded variants (apache#6720) [python] Fix read large volume of blob data (apache#6701) [flink] Support cdc source (apache#6606) [hive] fix splitting for bucket tables (apache#6594) [spark] Update spark build topology for global index (2) (apache#6703) [test][spark] Fix the flaky test setDefaultDatabase (apache#6696) [spark] Update global index build topology (apache#6700) [spark] Introduce global file index builder on spark (apache#6684) [python] Fix with_shard feature for blob data (apache#6691) [test][spark] Add alter with incompatible col type test case (apache#6689) [variant] Introduce withVariantAccess in ReadBuilder (apache#6685) [python] Fix file name prefix in postpone mode. (apache#6668)

jichen20210919 added 3 commits November 27, 2025 22:37

[arrow] Improve customization capabilities for data type conversion b…

70c8151

…etween Paimon and Arrow formats(apache#6694).

[arrow] Improve customization capabilities for data type conversion b…

8eaa2e3

…etween Paimon and Arrow formats(apache#6695).

[arrow] Improve customization capabilities for data type conversion b…

9735e60

…etween Paimon and Arrow formats(apache#6695).

JingsongLi approved these changes Dec 2, 2025

View reviewed changes

JingsongLi merged commit f8f7091 into apache:master Dec 2, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[arrow] Improve customization capabilities for data type conversion. #6695

[arrow] Improve customization capabilities for data type conversion. #6695

Uh oh!

jichen20210919 commented Nov 27, 2025

Uh oh!

JingsongLi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[arrow] Improve customization capabilities for data type conversion. #6695

[arrow] Improve customization capabilities for data type conversion. #6695

Uh oh!

Conversation

jichen20210919 commented Nov 27, 2025

Purpose

Tests

API and Format

Documentation

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants