What is ReAct

ReAct comes from the paper ReAct: Synergizing Reasoning and Acting in Language Models and is a way to improve the quality of prompts generated by Chloe (or any LLM for that matter).

ReAct

Below is the abstract from the paper:

While large language models (LLMs) have demonstrated impressive capabilities across tasks in
language understanding and interactive decision making, their abilities for reasoning (e.g.
chain-of-thought prompting) and acting (e.g. action plan generation) have primarily been studied as
separate topics. 

In this paper, we explore the use of LLMs to generate both reasoning traces and
task-specific actions in an interleaved manner, allowing for greater synergy between the two: reasoning traces help the model induce, track, and update action plans as well as handle exceptions,
while actions allow it to interface with external sources, such as knowledge bases or environments,
to gather additional information. 

We apply our approach, named ReAct, to a diverse set of language
and decision making tasks and demonstrate its effectiveness over state-of-the-art baselines, as well
as improved human interpretability and trustworthiness over methods without reasoning or acting
components. 

Concretely, on question answering (HotpotQA) and fact verification (Fever), ReAct
overcomes issues of hallucination and error propagation prevalent in chain-of-thought reasoning by
interacting with a simple Wikipedia API, and generates human-like task-solving trajectories that are
more interpretable than baselines without reasoning traces. 

On two interactive decision making
benchmarks (ALFWorld and WebShop), ReAct outperforms imitation and reinforcement learning methods by
an absolute success rate of 34% and 10% respectively, while being prompted with only one or two
in-context examples.

This package also includes all the actions, such as google, wikipedia, news, etc. that Chloe supports.

Configuration

Environment Variable	Default Value	Description	Options
CHLOE_REACT_REPORT_THOUGHTS	false	Whether to report thoughts (the reasoning isn't the best in GPT 3.5)	true false

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is ReAct

Configuration

Home

Configuring Chloe

Interfaces

ReAct

Actions

User Management

Improving Responses

Extending Chloe

Clone this wiki locally