KeyError: 'llama-3' in /fastchat/conversation.py when running ToolGen #4

27yw · 2024-11-20T03:19:00Z

Hi! Very wonderful work.
Here I set up the env following your requirements.txt and try to this code below:
`
import json
from OpenAgent.agents.toolgen.toolgen import ToolGen
from OpenAgent.tools.src.rapidapi.rapidapi import RapidAPIWrapper

with open("keys.json", 'r') as f:
keys = json.load(f)
toolbench_key = keys['TOOLBENCH_KEY']
rapidapi_wrapper = RapidAPIWrapper(
toolbench_key=toolbench_key,
rapidapi_key="",
)

toolgen = ToolGen(
"reasonwang/ToolGen-Llama-3-8B",
indexing="Atomic",
tools=rapidapi_wrapper,
)

messages = [
{"role": "system", "content": ""},
{"role": "user", "content": "I'm a football fan and I'm curious about the different team names used in different leagues and countries. Can you provide me with an extensive list of football team names and their short names? It would be great if I could access more than 7000 team names. Additionally, I would like to see the first 25 team names and their short names using the basic plan."}
]

toolgen.restart()
toolgen.start(
single_chain_max_step=16,
start_messages=messages
)
`

I got the error:
Traceback (most recent call last): File "/data2/ToolGen/run_toolgen_local.py", line 24, in <module> toolgen.start( File "/data2/ToolGen/OpenAgent/agents/base.py", line 54, in start out_node = self.do_chain(self.tree.root, single_chain_max_step) File "/data2/ToolGen/OpenAgent/agents/base.py", line 148, in do_chain new_message, error_code, total_tokens = self.get_agent_response(now_node) File "/data2/ToolGen/OpenAgent/agents/base.py", line 73, in get_agent_response new_message, error_code, total_tokens = self.parse(tools=self.io_func.tools, File "/data2/ToolGen/OpenAgent/agents/toolgen/toolgen.py", line 467, in parse conv, roles = self.convert_to_fastchat_format( File "/data2/ToolGen/OpenAgent/agents/toolgen/toolgen.py", line 357, in convert_to_fastchat_format conv = get_conv_template(self.template) File "/home/anaconda3/envs/toolgen/lib/python3.10/site-packages/fastchat/conversation.py", line 415, in get_conv_template return conv_templates[name].copy() KeyError: 'llama-3'

and I use your hugging face model.

I am confused. My env is set up with the requirements.txt where the fschat=0.2.36

any help?

The text was updated successfully, but these errors were encountered:

Reason-Wang · 2024-11-20T12:55:31Z

Hi, the released version of fastchat does not contain the template for Llama-3. You may need to install it from the code. The follow snippet shows how to do it.

git clone https://github.com/lm-sys/FastChat
cd FastChat
pip install -e .

27yw · 2024-11-28T06:14:59Z

Thanks for your answering!
We also find another way for if fastchat has already been installed before.
first find the conversation.py in fschat
then add this code
register_conv_template( Conversation( name="llama-3", system_template="<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n{system_message}", roles=("<|start_header_id|>user<|end_header_id|>\n", "<|start_header_id|>assistant<|end_header_id|>\n"), sep_style=SeparatorStyle.ADD_NEW_LINE_SINGLE, sep="<|eot_id|>", stop_token_ids=[128009], stop_str="<|eot_id|>", ) )

Then all thing will be fine.

27yw · 2024-11-28T06:16:42Z

Here is another question:
if we want to evaluate toolgen on toolbench, which .sh should I run?
this one?
/scripts/inference/inference_toolgen_pipeline_virtual.sh

Reason-Wang · 2024-11-29T18:55:37Z

First run inference on queries to generate trajectories. For toolgen, it's inference_toolgen_pipeline_virtual.sh. Then, convert the trajectory format: scripts/convert_answer/run_convert_answer.sh. Run scripts/pass_rate/run_pass_rate.sh for pass rate evaluation. Run scripts/preference/run_preference.sh for win rate evaluation. Note that run the evaluation will cost GPT-4 credits.

27yw · 2024-12-03T18:11:06Z

really thankful for your help.
how to reproduce this result?

I only find /ToolGen/evaluation/retrieval/eval_toolgen.py to calculate ndcg of retrieval.
thank you !!!

Reason-Wang · 2024-12-04T11:17:06Z

You can use other scripts in scripts/retrieval/ to reproduce the results. Set corpus to "G123" for multi-domain evaluation. The score reported is the average score. For example, for G1, it is the average of G1 instruction, tool, and category. For G2, the average of G2 instruction, category. For G3, since this is only G3 instruction, so the score is G3 instruction.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError: 'llama-3' in /fastchat/conversation.py when running ToolGen #4

KeyError: 'llama-3' in /fastchat/conversation.py when running ToolGen #4

27yw commented Nov 20, 2024 •

edited

Loading

Reason-Wang commented Nov 20, 2024

27yw commented Nov 28, 2024

27yw commented Nov 28, 2024

Reason-Wang commented Nov 29, 2024

27yw commented Dec 3, 2024

Reason-Wang commented Dec 4, 2024 •

edited

Loading

KeyError: 'llama-3' in /fastchat/conversation.py when running ToolGen #4

KeyError: 'llama-3' in /fastchat/conversation.py when running ToolGen #4

Comments

27yw commented Nov 20, 2024 • edited Loading

Reason-Wang commented Nov 20, 2024

27yw commented Nov 28, 2024

27yw commented Nov 28, 2024

Reason-Wang commented Nov 29, 2024

27yw commented Dec 3, 2024

Reason-Wang commented Dec 4, 2024 • edited Loading

27yw commented Nov 20, 2024 •

edited

Loading

Reason-Wang commented Dec 4, 2024 •

edited

Loading