Support improved chunking_config in File Search Store for languages without word spacing (e.g., Japanese/Chinese)

## Problem

When importing text into a File Search Store, the current `chunking_config` only supports `white_space_config` (which assumes space-separated tokens).  
This does not work well for languages without spaces between words (e.g., Japanese, Chinese), and mixed language content (e.g., Japanese + English) is also not chunked consistently. 

## Desired Behavior

- Allow correctly specifying `max_tokens_per_chunk` and `max_overlap_tokens` **regardless of the presence of word boundaries**.
- Improve control over chunk size and boundaries for mixed language text.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support improved chunking_config in File Search Store for languages without word spacing (e.g., Japanese/Chinese) #1935

Problem

Desired Behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support improved chunking_config in File Search Store for languages without word spacing (e.g., Japanese/Chinese) #1935

Description

Problem

Desired Behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions