Improve function calling (auto selection, parallel functions, parameters grammar) #1347

themrzmaster · 2024-04-17T02:06:14Z

update json to grammar based on https://github.com/ggerganov/llama.cpp/blob/master/examples/json_schema_to_grammar.py (now the required parameter works and not every function parameter is sampled)
created a generic function calling template, which i created a vicuna derivative
improved/fixed auto tool selection

Examples:

response = client.chat.completions.create(
  model="WizardLM-2-7B.Q8_0.gguf",
 messages = [
        {
          "role": "system",
          "content": "Always follow user requests"
        },
        {
          "role": "user",
          "content": "search for items with the term burguer and a maximum price of 10 dollars."
        },
       

      ],
    tools=[
         {
      "type": "function",
      "function": {
          "name": "search_merchant",
          "description": "Search for merchants in the catalog based on the term",
          "parameters": {
              "type": "object",
              "properties": {
                  "term": {
                      "type": "string",
                      "description": "Term to be searched for finding merchants.",
                  }
              },
              "required": ["term"],
          },
      },
  },
  {
      "type": "function",
      "function": {
          "name": "search_item",
          "description": "Search for items in the catalog based on various criteria.",
          "parameters": {
              "type": "object",
              "properties": {
                  "term": {
                      "type": "string",
                      "description": "Term to be searched for finding items, with removed accents.",
                  },
                  "item_price_to": {
                      "type": "integer",
                      "description": "Maximum price the user is willing to pay for an item, if specified.",
                  },
                  "merchant_delivery_fee_to": {
                      "type": "integer",
                      "description": "Maximum delivery fee the user is willing to pay, if specified.",
                  },
                  "merchant_payment_types": {
                      "type": "string",
                      "description": "Type of payment the user prefers, if specified.",
                      "enum": [
                          "Credit Card",
                          "Debit Card",
                          "Other",
                      ],
                  },
              },
              "required": ["term"],
          },
      },
  }],
     tool_choice="auto"
)

gives me:
Choice(finish_reason='tool_calls', index=0, logprobs=None, message=ChatCompletionMessage(content=None, role='assistant', function_call=FunctionCall(arguments='{ "term": "burger", "item_price_to": 10 }', name='search_item'), tool_calls=[ChatCompletionMessageToolCall(id='call__0_search_item_cmpl-768bb07b-6b14-47b1-8d91-3efa3c4de6d3', function=Function(arguments='{ "term": "burger", "item_price_to": 10 }', name='search_item'), type='function')]))

multiple functions:

messages = [
        {
          "role": "system",
          "content": "Always follow user requests"
        },
        {
          "role": "user",
          "content": "search for items with the term burguer and a maximum price of 10 dollars. at the same time look for merchant named Burger King"
        },
       
      ],

same tools definition

gives:
Choice(finish_reason='tool_calls', index=0, logprobs=None, message=ChatCompletionMessage(content=None, role='assistant', function_call=None, tool_calls=[ChatCompletionMessageToolCall(id='call__0_search_item_cmpl-f924aecb-64e8-4ba0-8fc4-e9a620700064', function=Function(arguments='{ "term": "burger", "item_price_to": 10, "merchant_delivery_fee_to": 0 } ', name='search_item'), type='function'), ChatCompletionMessageToolCall(id='call__1_search_merchant_cmpl-98995992-1197-48c0-8740-d67ee7e12255', function=Function(arguments='{ "term": "Burger King" } ', name='search_merchant'), type='function')]))

I ran the server with:

python3 -m llama_cpp.server --model 'llm_models/WizardLM-2-7B.Q8_0.gguf' --n_gpu_layers 100 --chat_format vicuna-function-calling --n_ctx 1024

abetlen · 2024-04-17T13:14:32Z

Hey @themrzmaster this is awesome, thank you for the contribution! Do you mind splitting this up into 2 seperate PRs one for the grammar updates and one for the chat format changes.

themrzmaster · 2024-04-17T13:35:54Z

Hi @abetlen ! done it
#1351
#1350

themrzmaster · 2024-04-17T14:36:25Z

@abetlen i see the test failing. maybe we should use this https://github.com/ggerganov/llama.cpp/blob/8dd1ec8b3ffbfa2d26e82e672cea89f5eeb2f141/examples/pydantic_models_to_grammar.py#L518 for pydantic grammar?

feat: improve function calling

c817793

This was referenced Apr 17, 2024

Update json to grammar #1350

Merged

Improve function calling (auto selection, parallel functions) #1351

Open

themrzmaster and others added 10 commits April 17, 2024 16:08

debug

1b696a8

debug

ca55725

debug

9969526

debug

6197f62

debug

6e03692

debug

7ce0667

debug

15dd7aa

debug

014575f

debug

454b5e3

up

08cf4f7

themrzmaster closed this Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve function calling (auto selection, parallel functions, parameters grammar) #1347

Improve function calling (auto selection, parallel functions, parameters grammar) #1347

Uh oh!

themrzmaster commented Apr 17, 2024 •

edited

Loading

Uh oh!

abetlen commented Apr 17, 2024

Uh oh!

themrzmaster commented Apr 17, 2024

Uh oh!

themrzmaster commented Apr 17, 2024

Uh oh!

Uh oh!

Improve function calling (auto selection, parallel functions, parameters grammar) #1347

Improve function calling (auto selection, parallel functions, parameters grammar) #1347

Uh oh!

Conversation

themrzmaster commented Apr 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abetlen commented Apr 17, 2024

Uh oh!

themrzmaster commented Apr 17, 2024

Uh oh!

themrzmaster commented Apr 17, 2024

Uh oh!

Uh oh!

themrzmaster commented Apr 17, 2024 •

edited

Loading