Skip to content

Conversation

@jichen20210919
Copy link
Contributor

[arrow] Improve customization capabilities for data type conversion between Paimon and Arrow formats.(#6694).

Purpose

paimon-arrow module has a good design for paimon-arrow field type conversions, but the top level class has the fixed type conversions, in some use cases, developers may want to customize the field type conversion process to meet the use-case.

Linked issue: open #6693

Tests

add org.apache.paimon.arrow.vector.ArrowFormatWriterTest#testCustomArrowFormatWriter

API and Format

No

Documentation

No

Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@JingsongLi JingsongLi merged commit f8f7091 into apache:master Dec 2, 2025
22 checks passed
jerry-024 added a commit to jerry-024/paimon that referenced this pull request Dec 2, 2025
* upstream/master:
  [test] Fix test SparkWriteITCase.testTruncatePartitionValueNull
  [orc] Limiting Memory Usage of OrcBulkWriter When Writing VectorizedRowBatch (apache#6590)
  [arrow] Improve customization capabilities for data type conversion. (apache#6695)
  [spark] Fix NPE in spark truncate null partitions
  [core] Exclude .class files from sources.jar (apache#6707)
  [core] DataEvolutionFileStoreScan should not filter files by read type when it contains no physical columns. (apache#6714)
  [spark] Update scalafmt version to 3.10.2 (apache#6709)
  [variant] Extract only required columns when reading shredded variants (apache#6720)
  [python] Fix read large volume of blob data (apache#6701)
  [flink] Support cdc source (apache#6606)
  [hive] fix splitting for bucket tables (apache#6594)
  [spark] Update spark build topology for global index (2) (apache#6703)
  [test][spark] Fix the flaky test setDefaultDatabase (apache#6696)
  [spark] Update global index build topology (apache#6700)
  [spark] Introduce global file index builder on spark (apache#6684)
  [python] Fix with_shard feature for blob data (apache#6691)
  [test][spark] Add alter with incompatible col type test case (apache#6689)
  [variant] Introduce withVariantAccess in ReadBuilder (apache#6685)
  [python] Fix file name prefix in postpone mode. (apache#6668)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants