feat: DeepSeek V3.2 tool calling support #4822

vladnosiv · 2025-12-09T13:53:34Z

Overview:

Depends on #4797

Add DeepSeek V3.2 tool call parser support

Details:

Implemented DSML (DeepSeek Markup Language) parser for V3.2 tool calling format
Support for string="true|false" attribute to distinguish string vs JSON types (numbers, booleans, arrays, objects, null)

Where should the reviewer start?

lib/parsers/src/tool_calling/dsml/parser.rs main implementation

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Relates to #4796

Summary by CodeRabbit

Release Notes

New Features
- Added support for DeepSeek V3.2 model with DSML tool-calling format, enabling proper handling of function invocations in this format.
Tests
- Added sample test data demonstrating weather assistant workflow with multi-location queries and structured function responses.
Chores
- Updated build configurations to exclude test data files from trailing whitespace validation.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: Vladislav Nosivskoy <[email protected]>

copy-pr-bot · 2025-12-09T13:53:38Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions · 2025-12-09T13:53:43Z

👋 Hi vladnosiv! Thank you for contributing to ai-dynamo/dynamo.

Just a reminder: The NVIDIA Test Github Validation CI runs an essential subset of the testing framework to quickly catch errors.Your PR reviewers may elect to test the changes comprehensively before approving your changes.

🚀

coderabbitai · 2025-12-09T13:58:20Z

Walkthrough

Support for DSML (DeepSeek-specific Markup Language) tool call parsing is introduced. A new DSML parser module is added with detection, positioning, and parsing functions. Configuration structures and integration into the tool calling system enable DeepSeek V3.2 format parsing. Test data and ignore rules are updated accordingly.

Changes

Cohort / File(s)	Summary
Build Configuration `.github/workflows/copyright-check.ps1`, `.pre-commit-config.yaml`	Adds `lib/llm/tests/data/deepseek-v3.2` to ignore lists to skip copyright checks and trailing-whitespace validation on DSML test data files.
Test Data `lib/llm/tests/data/deepseek-v3.2/test_output.txt`	New DSML format test data file demonstrating weather assistant workflow with sequential function calls, structured results blocks, and Chinese-language user interactions.
Parser Configuration `lib/parsers/src/tool_calling/config.rs`	Introduces `DsmlParserConfig` struct with DSML-specific tokens (function_calls, invoke, parameter blocks) and `ParserConfig::Dsml` variant. Adds `ToolCallConfig::deepseek_v3_2()` constructor.
DSML Parser Module `lib/parsers/src/tool_calling/dsml/mod.rs`, `lib/parsers/src/tool_calling/dsml/parser.rs`	New DSML parser implementation with public functions: `detect_tool_call_start_dsml`, `find_tool_call_end_position_dsml`, `try_tool_call_parse_dsml`. Extracts invokes and parameters, converts values to JSON, handles streaming partials. Includes extensive unit tests.
Module Exports `lib/parsers/src/tool_calling/mod.rs`	Exposes `pub mod dsml` and re-exports `pub use dsml::try_tool_call_parse_dsml` for external access.
Parser Integration `lib/parsers/src/tool_calling/parsers.rs`	Integrates DSML parser into dispatch logic via imports, tool parser map entry, and branching in `try_tool_call_parse`, `detect_tool_call_start`, and `find_tool_call_end_position`. Adds DSML-specific tests.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Parser logic correctness: dsml/parser.rs contains regex-based parameter extraction, JSON type conversion, and streaming chunk detection requiring careful edge-case validation.
Integration points: Multiple branching paths added to existing dispatch functions (try_tool_call_parse, detect_tool_call_start, find_tool_call_end_position) that need consistency verification.
Configuration alignment: Ensure DsmlParserConfig defaults match actual DeepSeek V3.2 DSML format specifications and are correctly wired through the factory pattern.
Test coverage: Review DSML-specific test cases for comprehensive parameter type handling (strings, numbers, booleans, arrays, objects, null values) and mixed text/tool-call scenarios.

Poem

A rabbit hops through DSML's delightful maze,
With function calls and invoke blocks ablaze! 🐇
DeepSeek V3.2 now speaks in structured tongue,
Parameters parsed and JSON songs sung—
Tool-calling dreams in the parser's sweet cage! 🎯

Pre-merge checks

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly describes the main feature addition - DeepSeek V3.2 tool calling support - which aligns with the PR's core objective of implementing DSML parser support for V3.2 tool calls.
Description check	✅ Passed	The description follows the template structure with all required sections (Overview, Details, Where should the reviewer start, Related Issues) properly filled and provides clear, actionable information.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

lib/parsers/src/tool_calling/dsml/parser.rs (1)

102-107: Consider caching compiled regex patterns.

Regex compilation occurs on every function call (extract_tool_calls, extract_invokes, parse_parameters). For high-throughput parsing scenarios, this could become a performance bottleneck.

Since the config tokens are typically constant at runtime, consider caching the compiled regex patterns. One approach is to store pre-compiled Regex instances in the DsmlParserConfig struct (possibly using once_cell::sync::Lazy or computing them lazily on first use).

Also applies to: 131-136, 177-182

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1e37c10 and 1040ada.

📒 Files selected for processing (8)

.github/workflows/copyright-check.ps1 (1 hunks)
.pre-commit-config.yaml (1 hunks)
lib/llm/tests/data/deepseek-v3.2/test_output.txt (1 hunks)
lib/parsers/src/tool_calling/config.rs (5 hunks)
lib/parsers/src/tool_calling/dsml/mod.rs (1 hunks)
lib/parsers/src/tool_calling/dsml/parser.rs (1 hunks)
lib/parsers/src/tool_calling/mod.rs (2 hunks)
lib/parsers/src/tool_calling/parsers.rs (7 hunks)

🧰 Additional context used

🧠 Learnings (2)

📚 Learning: 2025-09-10T22:32:12.978Z

Learnt from: zhongdaor-nv
Repo: ai-dynamo/dynamo PR: 2999
File: lib/parsers/src/tool_calling/harmony/harmony_parser.rs:250-256
Timestamp: 2025-09-10T22:32:12.978Z
Learning: In lib/parsers/src/tool_calling/harmony/harmony_parser.rs, the team prefers to maintain identical code patterns between parse_tool_calls_harmony and parse_tool_calls_harmony_complete functions, including message.content[0] indexing, to ensure consistency between streaming and complete parser implementations.

Applied to files:

lib/parsers/src/tool_calling/mod.rs
lib/parsers/src/tool_calling/dsml/mod.rs
lib/parsers/src/tool_calling/parsers.rs
lib/parsers/src/tool_calling/dsml/parser.rs

📚 Learning: 2025-09-08T21:18:43.478Z

Learnt from: nachiketb-nvidia
Repo: ai-dynamo/dynamo PR: 2936
File: lib/parsers/src/reasoning/granite_parser.rs:42-46
Timestamp: 2025-09-08T21:18:43.478Z
Learning: In GraniteReasoningParser in lib/parsers/src/reasoning/granite_parser.rs, the think_start_tokens and think_end_tokens are hardcoded in the constructor with fixed values, so unwrap() calls on these vectors are safe and won't panic.

Applied to files:

lib/parsers/src/tool_calling/config.rs

🧬 Code graph analysis (2)

lib/parsers/src/tool_calling/mod.rs (1)

lib/parsers/src/tool_calling/dsml/parser.rs (1)

try_tool_call_parse_dsml (55-92)

lib/parsers/src/tool_calling/dsml/parser.rs (2)

lib/parsers/src/tool_calling/parsers.rs (3)

extract_name_and_args (183-186)

extract_name_and_args (1680-1683)

extract_name_and_args (2597-2600)

lib/parsers/src/tool_calling/config.rs (4)

default (31-40)

default (60-69)

default (90-99)

default (150-154)

🪛 LanguageTool

lib/llm/tests/data/deepseek-v3.2/test_output.txt

[grammar] ~17-~17: Use a hyphen to join words.
Context: ...ibute should be set to "true" for string type parameters and "false" for other ty...

(QB_NEW_EN_HYPHEN)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (8)

GitHub Check: Build and Test - dynamo
GitHub Check: clippy (.)
GitHub Check: tests (launch/dynamo-run)
GitHub Check: clippy (launch/dynamo-run)
GitHub Check: tests (.)
GitHub Check: clippy (lib/bindings/python)
GitHub Check: tests (lib/runtime/examples)
GitHub Check: tests (lib/bindings/python)

🔇 Additional comments (20)

.github/workflows/copyright-check.ps1 (1)

87-87: LGTM!

The addition of the DeepSeek V3.2 test data path to the ignored paths list is appropriate and consistent with the existing sample-models exclusion pattern.

.pre-commit-config.yaml (1)

66-67: LGTM!

The exclusion pattern for trailing-whitespace on test data files is appropriately scoped to .txt files in the DeepSeek V3.2 test directory.

lib/parsers/src/tool_calling/config.rs (3)

72-100: LGTM!

The DsmlParserConfig struct and its Default implementation are well-designed. The configuration fields map directly to the DSML format tokens, and the default values align with the DeepSeek V3.2 specification referenced in the parser implementation.

111-111: LGTM!

The Dsml variant integration into ParserConfig and the accessor method implementations correctly map the DSML function_calls tokens to the parser's generic interface.

Also applies to: 124-124, 137-137

285-296: LGTM!

The deepseek_v3_2() constructor follows the established pattern of other model-specific constructors and includes helpful documentation of the DSML format.

lib/parsers/src/tool_calling/dsml/parser.rs (5)

22-40: LGTM!

The detection logic correctly handles both complete tokens and partial matches for streaming scenarios. The partial match loop is a thoughtful addition for handling incremental input.

42-51: LGTM!

The end position calculation correctly accounts for the token length and provides a sensible fallback for incomplete input.

53-92: LGTM!

The parsing entry point has clean control flow with appropriate early exits for edge cases. The separation of normal text extraction from tool call parsing is well-structured.

164-209: LGTM!

The parameter parsing logic correctly distinguishes between string and non-string types using the string attribute. The fallback to string on JSON parse failure (lines 198-201) provides safe degradation for malformed input.

211-489: LGTM!

Comprehensive test coverage including detection, positioning, single/multiple tool calls, mixed parameter types (strings, numbers, booleans, arrays, objects, null), and edge cases like empty strings. The tests validate the parser against realistic DSML payloads.

lib/parsers/src/tool_calling/mod.rs (1)

5-5: LGTM!

The dsml module declaration and the re-export of try_tool_call_parse_dsml follow the established patterns for other parser modules (harmony, json, xml, etc.), providing a clean public API.

Also applies to: 18-18

lib/parsers/src/tool_calling/dsml/mod.rs (1)

1-9: LGTM!

The module structure is clean and follows the established pattern of other parser modules (harmony, json, pythonic, xml). The public API exports align with the expected DSML parser interface.

lib/llm/tests/data/deepseek-v3.2/test_output.txt (1)

1-112: LGTM!

Comprehensive test data that demonstrates the DSML format well, including:

Single and multiple tool invocations

The string="true|false" attribute for type handling

Multi-turn conversation flow with function results

Mixed language content (Chinese) for realistic testing

lib/parsers/src/tool_calling/parsers.rs (7)

5-7: LGTM!

DSML imports follow the established pattern of other parser modules.

41-41: LGTM!

Parser map entry follows the existing pattern and naming convention.

79-82: LGTM!

The DSML parsing branch follows the exact pattern of other parser branches in the match statement.

131-131: LGTM!

Detection branch follows the established pattern.

168-168: LGTM!

End position detection follows the simpler pattern (like XML), appropriate for DSML's distinct end tokens.

204-204: LGTM!

Test list correctly includes the new parser.

1583-1657: LGTM!

Comprehensive tests covering:

Single tool call parsing

Multiple parallel tool calls

The critical string="true|false" attribute handling (line 1655 correctly validates topn is parsed as number 10, not string "10")

The tests follow the established pattern and validate the key differentiating feature of the V3.2 format.

rmccorm4 · 2025-12-09T17:19:34Z

/ok to test 1040ada

lib/parsers/src/tool_calling/dsml/parser.rs

lib/llm/tests/data/deepseek-v3.2/test_output.txt

ayushag-nv

Looks good to me. Thanks @vladnosiv for the clean implementation. Just address "adding comments" for few function and the test_ouput.txt file related comments.

Signed-off-by: Vladislav Nosivskoy <[email protected]>

ayushag-nv · 2025-12-10T03:37:39Z

/ok to test abc486e

Merged 238 commits from main branch to bring the feature branch up to date. Key conflicts resolved: - Removed lib/kvbm-kernels references (deleted in main) - Kept nova/nova-backend/kvbm workspace members from feature branch - Maintained v2 module API refactoring from feature branch - Updated Cargo.lock files to reflect new dependencies Major updates from main include: - LoRA support for vLLM (#4810) - Multimodal documentation (#4510) - Scaling adapter features (#4699, #4825) - Tool calling support (#4822, #4722) - NIXL connect improvements (#4433) Signed-off-by: Ryan Olson <[email protected]>

Signed-off-by: Vladislav Nosivskoy <[email protected]> Co-authored-by: Ayush Agarwal <[email protected]>

vladnosiv added 3 commits December 9, 2025 16:21

add tool calling parser for dsml

5317230

Signed-off-by: Vladislav Nosivskoy <[email protected]>

add necessary files from PR ai-dynamo#4797

82faa91

Signed-off-by: Vladislav Nosivskoy <[email protected]>

fix clippy

1040ada

Signed-off-by: Vladislav Nosivskoy <[email protected]>

vladnosiv requested review from a team as code owners December 9, 2025 13:53

pull-request-size bot added the size/XL label Dec 9, 2025

github-actions bot added external-contribution Pull request is from an external contributor feat labels Dec 9, 2025

coderabbitai bot reviewed Dec 9, 2025

View reviewed changes

copy-pr-bot bot had a problem deploying to GITLAB December 9, 2025 17:19 Failure

rmccorm4 requested review from GuanLuo, ayushag-nv and zhongdaor-nv December 9, 2025 17:19

ayushag-nv reviewed Dec 9, 2025

View reviewed changes

lib/parsers/src/tool_calling/dsml/parser.rs Show resolved Hide resolved

ayushag-nv reviewed Dec 9, 2025

View reviewed changes

lib/llm/tests/data/deepseek-v3.2/test_output.txt Outdated Show resolved Hide resolved

ayushag-nv approved these changes Dec 9, 2025

View reviewed changes

vladnosiv and others added 3 commits December 10, 2025 00:24

remove files

b58973d

Signed-off-by: Vladislav Nosivskoy <[email protected]>

add comment

900cdc9

Signed-off-by: Vladislav Nosivskoy <[email protected]>

Merge branch 'main' into dsv32-tool-calling-parser

abc486e

copy-pr-bot bot had a problem deploying to GITLAB December 10, 2025 03:37 Failure

ayushag-nv enabled auto-merge (squash) December 10, 2025 03:37

ayushag-nv merged commit 6d091bf into ai-dynamo:main Dec 10, 2025
37 of 38 checks passed

zxue2 pushed a commit to zxue2/dynamo that referenced this pull request Dec 11, 2025

feat: DeepSeek V3.2 tool calling support (ai-dynamo#4822)

951989d

Signed-off-by: Vladislav Nosivskoy <[email protected]> Co-authored-by: Ayush Agarwal <[email protected]>

rmccorm4 mentioned this pull request Dec 12, 2025

[FEATURE]: DeepSeek V3.2 Support #4796

Open

smatta-star pushed a commit to smatta-star/dynamo that referenced this pull request Dec 19, 2025

feat: DeepSeek V3.2 tool calling support (ai-dynamo#4822)

1e4d133

Signed-off-by: Vladislav Nosivskoy <[email protected]> Co-authored-by: Ayush Agarwal <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: DeepSeek V3.2 tool calling support #4822

feat: DeepSeek V3.2 tool calling support #4822

Uh oh!

vladnosiv commented Dec 9, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

coderabbitai bot commented Dec 9, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

rmccorm4 commented Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

ayushag-nv left a comment

Uh oh!

ayushag-nv commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: DeepSeek V3.2 tool calling support #4822

feat: DeepSeek V3.2 tool calling support #4822

Uh oh!

Conversation

vladnosiv commented Dec 9, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Release Notes

Uh oh!

copy-pr-bot bot commented Dec 9, 2025

Uh oh!

github-actions bot commented Dec 9, 2025

Uh oh!

coderabbitai bot commented Dec 9, 2025

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

rmccorm4 commented Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

ayushag-nv left a comment

Choose a reason for hiding this comment

Uh oh!

ayushag-nv commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vladnosiv commented Dec 9, 2025 •

edited by coderabbitai bot

Loading