KG-545 Fix requestLLMOnlyCallingTools ignoring tool calls after rea…
#1198
+93
−9
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation and Context
Related to KG-545.
Currently,
requestLLMOnlyCallingToolsrelies onexecuteSingle, which returns the first message received from the LLM. When using models that output Chain of Thought or "Thinking" blocks (e.g., Nova, Claude) prior to calling a tool, the response sequence is often[Message.Assistant(Thinking), Message.Tool.Call].As a result:
Changes:
requestLLMMultipleOnlyCallingTools()inAIAgentLLMSessionto allow retrieving the full list of messages while enforcingToolChoice.Required.requestLLMOnlyCallingToolsto use this new method. It now persists all messages (preserving the reasoning context in the session history) but filters the return value to ensure the caller receives theMessage.Tool.Call.Breaking Changes
None. This is a behavioral fix to ensure the method contract (returning a tool call) is honored when the LLM is "chatty" or provides reasoning.
Type of the changes
Checklist
developas the base branchAdditional steps for pull requests adding a new feature