Tool Call Improvements #71

wendy-aw · 2025-02-17T09:47:54Z

Changes

1. tool_choice schema

Removed pydantic schema for tool_choice. OpenAI and Anthropic have very different inputs for this parameter.

Feature	OpenAI	Anthropic
Zero or more tools called	`auto`	`{"type": "auto"}`
At least one tool must be called	`required`	`{"type": "any"}`
Specific tool called	`{"type": "function", "function": {"name": function_name}}`	`{"type": "tool", "name": function_name}`

We hence simplify things by allowing the input to the chat functions to be simple strings like "auto", "required", "function_name". convert_tool_choice function will then map these to the right formats above for the different models. Also if tools are listed but tool_choice is not set, we default to "auto".

2. Avoid looping with `required` tool_choice

Previously if tool_choice was set to required, tool chaining would force the model to use a tool in each round. This will continue indefinitely and would not end with a message. We fix this by setting tool_choice to be auto after the first tool call.

3. tool_outputs in LLMResponse

We output the intermediate tool outputs during tool chaining so that we can inspect the steps the model has taken.

4. New tests

We port the tests for tool calls from defog_utils. However, we retain get_weather tests and do away with the web_search tests so that we do not require a defog API key. Other tests have also been added to cases where tool_choice is required or a forced function. Anthropic tests now also use sonnet instead of haiku as it's more reliable.

- add detailed docstrings to chat_openai and chat_anthropic

- set default tool_choice as auto if tools are listed

- replace haiku with sonnet

rishsriv

Neat! Thank you for documenting this so thoroughly, and for the added tests. Returning results from tool use is super helpful!

wongjingping

very neat, thanks for adding the rigorous battery of tests!

wendy-aw added 12 commits February 14, 2025 12:05

move tool call tests from defog_utils

d07f67e

add tool outputs

a84f88c

print results

6cb724d

- allow tool_choice to be simple strings instead of pydantic classes

3e46fe9

- add detailed docstrings to chat_openai and chat_anthropic

add tests for required tool calls

0e3540b

add tests for forced functions

60949dd

convert tool choice for openai

31e6e1d

- import traceback

654c53b

- set default tool_choice as auto if tools are listed

linted

defcffe

edit test

6e8c9da

- replace search qn with weather qn

baf08a9

- replace haiku with sonnet

standardize test name

873cd51

wendy-aw requested a review from rishsriv February 17, 2025 09:47

rishsriv approved these changes Feb 17, 2025

View reviewed changes

rishsriv merged commit 3800e53 into main Feb 17, 2025
2 checks passed

rishsriv deleted the wendy/move_tools branch February 17, 2025 09:51

wongjingping reviewed Feb 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool Call Improvements #71

Tool Call Improvements #71

wendy-aw commented Feb 17, 2025

rishsriv left a comment

wongjingping left a comment

Tool Call Improvements #71

Tool Call Improvements #71

Conversation

wendy-aw commented Feb 17, 2025

Changes

1. tool_choice schema

2. Avoid looping with required tool_choice

3. tool_outputs in LLMResponse

4. New tests

rishsriv left a comment

Choose a reason for hiding this comment

wongjingping left a comment

Choose a reason for hiding this comment

2. Avoid looping with `required` tool_choice