Document bulk ingestion and write parallelism by wjones127 · Pull Request #193 · lancedb/docs

wjones127 · 2026-03-20T19:30:33Z

Summary

Rewrites "Use Iterators / Write Large Datasets" → "Loading Large Datasets" with subsections for file-based ingestion (pyarrow.dataset), iterator-based ingestion, and write parallelism behavior
Updates FAQ "How can I speed up data inserts?" with specific guidance on auto-parallelism and the create-empty-then-add pattern
Adds test for pyarrow.dataset → table.add() pattern

Related: bulk-ingestion-7e70a4dab825, lancedb/lancedb#3173

Test plan

pytest tests/py/test_tables.py — all 48 tests pass
Snippets regenerated via python scripts/mdx_snippets_gen.py
Preview with npx mintlify dev

🤖 Generated with Claude Code

mintlify · 2026-03-20T19:30:43Z

Preview deployment for your docs. Learn more about Mintlify Previews.

Project	Status	Preview	Updated (UTC)
lancedb-bcbb4faf	🟢 Ready	View Preview	Mar 20, 2026, 7:31 PM

prrao87

This requires lancedb==0.30.0 to work, right? Could you bump the versions for deps in pyproject.toml as needed? Thanks!

prrao87 · 2026-03-25T19:28:01Z

Hi @wjones127 , is this ready for review? Or has it not yet been officially released in the stable version?

wjones127 · 2026-03-25T23:06:50Z

Hi @wjones127 , is this ready for review? Or has it not yet been officially released in the stable version?

Hi, this is not ready for review. In general, I leave my PRs in draft until they are ready for review.

`table.add()` now auto-parallelizes large writes, but the docs still showed only the old iterator-based pattern. This rewrites the "Use Iterators" section into "Loading Large Datasets" with guidance on `pyarrow.dataset` input, the create-empty-then-add pattern, and auto-parallelism behavior. Updates the FAQ to match. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

mintlify bot deployed to staging - docs March 20, 2026 19:31 View deployment

prrao87 reviewed Mar 22, 2026

View reviewed changes

wjones127 and others added 2 commits April 2, 2026 15:34

upgrade lancedb in Python

cc65b78

wjones127 force-pushed the docs/bulk-ingestion branch from 99419a5 to cc65b78 Compare April 2, 2026 22:38

mintlify bot deployed to staging - docs April 2, 2026 22:40 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document bulk ingestion and write parallelism#193

Document bulk ingestion and write parallelism#193
wjones127 wants to merge 2 commits intomainfrom
docs/bulk-ingestion

wjones127 commented Mar 20, 2026

Uh oh!

mintlify bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

prrao87 left a comment

Uh oh!

prrao87 commented Mar 25, 2026

Uh oh!

wjones127 commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wjones127 commented Mar 20, 2026

Summary

Test plan

Uh oh!

mintlify bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prrao87 left a comment

Choose a reason for hiding this comment

Uh oh!

prrao87 commented Mar 25, 2026

Uh oh!

wjones127 commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mintlify bot commented Mar 20, 2026 •

edited

Loading