FunctionCall

In order to increase the capabilities of the model so that it can not only generate text, but also perform specific tasks, query databases, interact with external system, etc. We define the FunctionCall class to implement the tool calling capabilities of the model. You can refer to the API documentation as FunctionCall. Next, i will start with a simple example to introduce the design ideas of FunctionCall in LazyLLM.

FunctionCall Quick Start

Suppose we are developing an application for querying the weather. Since weather information is time-sensitive, it is impossible to generate specific weather information simply by relying on a large model. This requires the model to call an external weather query tool to obtain realtime weather information. Now we define two weather query functions as follows:

from typing import Literal
import json
def get_current_weather(location: str, unit: Literal["fahrenheit", "celsius"]="fahrenheit"):
    """
    Get the current weather in a given location

    Args:
        location (str): The city and state, e.g. San Francisco, CA.
        unit (str): The temperature unit to use. Infer this from the users location.
    """
    if 'tokyo' in location.lower():
        return json.dumps({'location': 'Tokyo', 'temperature': '10', 'unit': 'celsius'})
    elif 'san francisco' in location.lower():
        return json.dumps({'location': 'San Francisco', 'temperature': '72', 'unit': 'fahrenheit'})
    elif 'paris' in location.lower():
        return json.dumps({'location': 'Paris', 'temperature': '22', 'unit': 'celsius'})
    elif 'beijing' in location.lower():
        return json.dumps({'location': 'Beijing', 'temperature': '90', 'unit': 'fahrenheit'})
    else:
        return json.dumps({'location': location, 'temperature': 'unknown'})

def get_n_day_weather_forecast(location: str, num_days: int, unit: Literal["celsius", "fahrenheit"]='fahrenheit'):
    """
    Get an N-day weather forecast

    Args:
        location (str): The city and state, e.g. San Francisco, CA.
        num_days (int): The number of days to forecast.
        unit (Literal['celsius', 'fahrenheit']): The temperature unit to use. Infer this from the users location.
    """
    if 'tokyo' in location.lower():
        return json.dumps({'location': 'Tokyo', 'temperature': '10', 'unit': 'celsius', "num_days": num_days})
    elif 'san francisco' in location.lower():
        return json.dumps({'location': 'San Francisco', 'temperature': '75', 'unit': 'fahrenheit', "num_days": num_days})
    elif 'paris' in location.lower():
        return json.dumps({'location': 'Paris', 'temperature': '25', 'unit': 'celsius', "num_days": num_days})
    elif 'beijing' in location.lower():
        return json.dumps({'location': 'Beijing', 'temperature': '85', 'unit': 'fahrenheit', "num_days": num_days})
    else:
        return json.dumps({'location': location, 'temperature': 'unknown'})

In order for the large model to call the corresponding function and generate the corresponding parameters, when defining the function, it is necessary to add annotations to the function parameters and add a functional description to the function so that the large model knows the function of the function ans when it can be called. This is the first step, defining the tool. The second step is to register the defined tool into LazyLLM so that you don't have to transfer functions when you use large models later. The registration method is as follows:

from lazyllm.tools import fc_register
@fc_register("tool")
def get_current_weather(location: str, unit: Literal["fahrenheit", "celsius"]="fahrenheit"):
    ...

@fc_register("tool")
def get_n_day_weather_forecast(location: str, num_days: int, unit: Literal["celsius", "fahrenheit"]='fahrenheit'):
    ...

The registration method is very simple. After importing fc_register, you can add it directly above the predefined function name in the manner of a decorator. Note that when adding, you need to specify the default group tool, and the default registered tool name is the name of the function being registered.

Sandbox Execution and File Transfer

When ToolManager is configured with a sandbox, tools are executed inside the sandbox by default. The following parameters control sandbox behavior during registration:

execute_in_sandbox (bool): Whether to execute in the sandbox, default True. Set to False to disable sandbox execution.
input_files_parm (str): The name of the function parameter that holds input file paths. The sandbox uploads these files before execution. The parameter it points to must be of type str or List[str].
output_files_parm (str): The name of the function parameter that holds output file paths. The sandbox downloads these files after execution. The parameter it points to must be of type str or List[str].
output_files (List[str]): Additional output file paths for the sandbox to download. Use this for output filenames that are hardcoded in the tool rather than passed as parameters.

Here is an example tool that uploads a file and downloads a generated result:

from typing import List, Optional
from lazyllm.tools import fc_register

@fc_register("tool", input_files_parm="input_paths", output_files_parm="output_paths")
def count_lines_in_file(
    input_paths: Optional[List[str]] = None,
    output_paths: Optional[List[str]] = None,
):
    """
    Count lines of the first input file and write to an output file.

    Args:
        input_paths (List[str] | None): input file paths.
        output_paths (List[str] | None): output file paths.
    """
    if not input_paths or not output_paths:
        return "input_paths/output_paths required"
    src = input_paths[0]
    dst = output_paths[0]
    with open(src, "r", encoding="utf-8") as f:
        count = sum(1 for _ in f)
    with open(dst, "w", encoding="utf-8") as f:
        f.write(str(count))
    return {"lines": count}

In the example above, input_files_parm="input_paths" tells the sandbox that files listed in the input_paths parameter should be uploaded before execution; output_files_parm="output_paths" tells the sandbox that files listed in the output_paths parameter should be downloaded after execution.

To disable sandbox execution for a tool, register with: @fc_register("tool", execute_in_sandbox=False).

We can also register the tool under a different name by filling in the second parameter during registration, for example:

from lazyllm.tools import fc_register

def get_current_weather(location: str, unit: Literal["fahrenheit", "celsius"]="fahrenheit"):
    ...

fc_register("tool")(get_current_weather, "another_get_current_weather")

Thus, the function get_current_weather is registered as a tool named another_get_current_weather.

Hierarchical Tool Groups (ToolGroup)

When the number of tools is large, injecting all tool descriptions into the system prompt at once significantly increases context length and can degrade model performance. LazyLLM provides a hierarchical tool group mechanism that lets you pass a dict to the tools list to define a tool group:

tools = [
    tool1,
    dict(name='search', desc='Search tools', tools=[search_web, search_news]),
    dict(name='advanced', desc='Advanced tools', tools=[
        tool2,
        dict(name='sub_ops', desc='Sub-tools', tools=[tool3, tool4]),
    ]),
]
agent = ReactAgent(llm, tools=tools)

dict field reference:

name (required): Tool group name, used to generate the gateway tool get_<name>_methods.
tools (required): Child tool list; elements can be functions, ModuleTool instances, or nested dicts.
desc (optional): Tool group description shown to the LLM as the gateway tool's summary.
lazy (optional, default True): Whether to use lazy mode.

Lazy mode (default) workflow:

The initial system prompt contains only the get_search_methods gateway tool—child tools are not expanded.
When the LLM determines that a search is needed, it first calls get_search_methods(), which returns:
```
Activated tool group "search". Available tools: search_web, search_news
```
In the next LLM call, the system prompt automatically includes the full descriptions of search_web and search_news, and the LLM selects and calls the appropriate one.

Eager mode (lazy=False): All child tool descriptions are expanded and injected into the system prompt immediately, identical to normal tool registration.

For multi-level nesting, the same rule applies at each level—once the outer gateway is activated, the inner sub-group's gateway tool appears in the system prompt, and so on.

Pick-First-Valid Tool Group (pick_first_valid)

When multiple equivalent services act as fallbacks for each other (e.g. Google, Bing, DuckDuckGo), you only need to expose whichever service the user has a key for. Use pick_first_valid=True for this:

import lazyllm
from lazyllm.tools import ReactAgent

llm = lazyllm.OnlineChatModule()

# Assume only BING_API_KEY is set; GOOGLE_API_KEY is absent
search_group = dict(
    name='search',
    desc='Search engine tool',
    pick_first_valid=True,
    tools=[
        (google_search_tool, 'env.GOOGLE_API_KEY'),   # no key, skipped
        (bing_search_tool,   'env.BING_API_KEY'),     # key present, selected
        duckduckgo_tool,                               # no key required, always valid fallback
    ],
)

agent = ReactAgent(llm, tools=[search_group, calculator])

How it works:

All tools are registered in ToolManager._tool_call at init time regardless of key state.
On every forward call, credentials are checked in order; only the first valid tool's description is returned to the LLM.
Different sessions with different keys naturally select different tools without interference.
If no child tool has a valid credential, the tool group is completely invisible to the LLM (empty list).
Enabling pick_first_valid=True forces lazy=False (no progressive disclosure needed).

dict fields for pick-first-valid:

Field	Type	Default	Description
`pick_first_valid`	`bool`	`False`	Enable pick-first-valid mode
`lazy`	`bool`	`True`	Automatically ignored (forced False) in pick-first-valid mode

If we do not intend to register the tool as globally visible, we can also pass the tool itself directly when calling FunctionCall, like this:

import lazyllm
from lazyllm.tools import FunctionCall
llm = lazyllm.OnlineChatModule()
tools = [get_current_weather, get_n_day_weather_forecast]
fc = FunctionCall(llm, tools)
query = "What's the weather like today in celsius in Tokyo and Paris."
ret = fc(query)
print(f"ret: {ret}")

The code above directly passes the two functions we previously defined as tools, which are only visible within the generated fc instance. If you try to access these tools by name outside of fc, it will result in an error.

Then we can define the model and use FunctionCall, as shown below:

import lazyllm
from lazyllm.tools import FunctionCall
llm = lazyllm.TrainableModule("internlm2-chat-20b").start()  # or llm = lazyllm.OnlineChatModule()
tools = ["get_current_weather", "get_n_day_weather_forecast"]
fc = FunctionCall(llm, tools)
query = "What's the weather like today in celsius in Tokyo and Paris."
ret = fc(query)
print(f"ret: {ret}")
# {'role': 'assistant', 'content': '', 'tool_calls': [{'id': 'xxx', 'type': 'function', 'function': {'name': 'get_current_weather', 'arguments': '{"location":"Tokyo, Japan","unit":"celsius"}'}, 'code_block': None}, {'id': 'xxx', 'type': 'function', 'function': {'name': 'get_current_weather', 'arguments': '{"location":"Paris, France","unit":"celsius"}'}, 'code_block': None}], 'tool_calls_results': ('{"location": "Tokyo", "temperature": "10", "unit": "celsius"}', '{"location": "Paris", "temperature": "22", "unit": "celsius"}')}

FunctionCall returns the assistant message generated by the model. When a tool call is triggered, the message contains both tool_calls (the structured calls issued by the model) and tool_calls_results (the output returned by each tool execution). These data are also synchronized into lazyllm.locals['_lazyllm_agent']['workspace']['tool_calls'] and lazyllm.locals['_lazyllm_agent']['workspace']['tool_call_results'], making them available for the next iteration. When FunctionCall determines that no further tool calls are needed, it stores all tool invocations throughout the execution process into lazyllm.locals['_lazyllm_agent']['completed'] for debugging and subsequent tasks. If the model does not select any tool, the function directly returns the string result. If you want to automatically complete the full reasoning loop, you can use the ReactAgent, as shown below:

import lazyllm
from lazyllm.tools import ReactAgent
llm = lazyllm.TrainableModule("internlm2-chat-20b").start()  # or llm = lazyllm.OnlineChatModule()
tools = ["get_current_weather", "get_n_day_weather_forecast"]
agent = ReactAgent(llm, tools)
query = "What's the weather like today in celsius in Tokyo and Paris."
ret = agent(query)
print(f"ret: {ret}")
# The current weather in Tokyo is 10 degrees Celsius, and in Paris, it is 22 degrees Celsius.

In the above example, if the input query triggers a function call, FunctionCall returns the assistant message dictionary that includes both tool_calls and tool_calls_results. ReactAgent will re-invoke the model with the tool outputs until the model concludes that the information is sufficient or the number of iterations, controlled by max_retries (default 5), is exhausted.

Skills-aware agents

All built-in agents (ReactAgent, PlanAndSolveAgent, ReWOOAgent) share the same base class and can enable Skills via the skills parameter. skills=True enables Skills with auto selection; pass a str/list to enable specific skills.

import lazyllm
from lazyllm.tools import ReactAgent

llm = lazyllm.OnlineChatModule()
agent = ReactAgent(
    llm,
    tools=["get_current_weather"],
    skills=["docs-writer"],  # set skills=True to enable auto selection
)

What changes when skills are enabled: - A skills guide prompt is automatically injected into the system prompt. - The agent can call skill tools: get_skill, read_reference, run_script. - A default toolset is auto-added for common operations (read/list/search/write/delete/move files, shell, download).

Approval flow for risky operations: - Tools return {"status": "needs_approval", ...} for dangerous actions. - The front-end (or orchestrator) should ask for confirmation. - Re-run the tool with allow_unsafe=True only after explicit user approval.

Complete code is as follows:

from typing import Literal
import json
import lazyllm
from lazyllm.tools import fc_register, FunctionCall, ReactAgent

@fc_register("tool")
def get_current_weather(location: str, unit: Literal["fahrenheit", "celsius"]="fahrenheit"):
    """
    Get the current weather in a given location

    Args:
        location (str): The city and state, e.g. San Francisco, CA.
        unit (str): The temperature unit to use. Infer this from the users location.
    """
    if 'tokyo' in location.lower():
        return json.dumps({'location': 'Tokyo', 'temperature': '10', 'unit': 'celsius'})
    elif 'san francisco' in location.lower():
        return json.dumps({'location': 'San Francisco', 'temperature': '72', 'unit': 'fahrenheit'})
    elif 'paris' in location.lower():
        return json.dumps({'location': 'Paris', 'temperature': '22', 'unit': 'celsius'})
    elif 'beijing' in location.lower():
        return json.dumps({'location': 'Beijing', 'temperature': '90', 'unit': 'fahrenheit'})
    else:
        return json.dumps({'location': location, 'temperature': 'unknown'})

@fc_register("tool")
def get_n_day_weather_forecast(location: str, num_days: int, unit: Literal["celsius", "fahrenheit"]='fahrenheit'):
    """
    Get an N-day weather forecast

    Args:
        location (str): The city and state, e.g. San Francisco, CA.
        num_days (int): The number of days to forecast.
        unit (Literal['celsius', 'fahrenheit']): The temperature unit to use. Infer this from the users location.
    """
    if 'tokyo' in location.lower():
        return json.dumps({'location': 'Tokyo', 'temperature': '10', 'unit': 'celsius', "num_days": num_days})
    elif 'san francisco' in location.lower():
        return json.dumps({'location': 'San Francisco', 'temperature': '75', 'unit': 'fahrenheit', "num_days": num_days})
    elif 'paris' in location.lower():
        return json.dumps({'location': 'Paris', 'temperature': '25', 'unit': 'celsius', "num_days": num_days})
    elif 'beijing' in location.lower():
        return json.dumps({'location': 'Beijing', 'temperature': '85', 'unit': 'fahrenheit', "num_days": num_days})
    else:
        return json.dumps({'location': location, 'temperature': 'unknown'})

llm = lazyllm.TrainableModule("internlm2-chat-20b").start()  # or llm = lazyllm.OnlineChatModule()
tools = ["get_current_weather", "get_n_day_weather_forecast"]
fc = FunctionCall(llm, tools)
query = "What's the weather like today in celsius in Tokyo and Paris."
ret = fc(query)
print(f"ret: {ret}")
# {'role': 'assistant', 'content': '', 'tool_calls': [{'id': 'xxx', 'type': 'function', 'function': {'name': 'get_current_weather', 'arguments': {'location': 'Tokyo, Japan', 'unit': 'celsius'}}}], 'tool_calls_results': ('{"location": "Tokyo", "temperature": "10", "unit": "celsius"}',)}

agent = ReactAgent(llm, tools)
ret = agent(query)
print(f"ret: {ret}")
# The current weather in Tokyo is 10 degrees Celsius, and in Paris, it is 22 degrees Celsius.

Note

When registering a function or tool, you must specify the default group tool, otherwise the model will not be able to use the corresponding tool.
When using the model, thers is no need to distinguish between TrainableModule and [OnlineChatModule][lazyllm.module.onlineChatModule.OnlineChatModule], because the output types of TrainableModule and [OnlineChatModule][lazyllm.module.onlineChatModule.OnlineChatModule] are designed to the same.

Built-in Tool: code_interpreter

LazyLLM provides a built-in tool code_interpreter that executes code inside a sandbox and returns the result. It is already registered with fc_register("tool"), so you can use it directly by name.

Example

import lazyllm
from lazyllm.tools import FunctionCall

llm = lazyllm.OnlineChatModule()
tools = ["code_interpreter"]
fc = FunctionCall(llm, tools)

query = "Use python to compute the sum from 1 to 100"
ret = fc(query)
print(ret)

Sandbox configuration

code_interpreter uses the local sandbox (DummySandbox, python only) by default. Switch to SandboxFusion for remote execution or bash support:

from lazyllm import config

config['sandbox_type'] = 'sandbox_fusion'
config['sandbox_fusion_base_url'] = 'http://your-sandbox-host:8000'

Environment variables:

LAZYLLM_SANDBOX_TYPE: dummy or sandbox_fusion
LAZYLLM_SANDBOX_FUSION_BASE_URL: remote sandbox base URL

Design Concept of FunctionCall

The design process of FunctionCall is carried out in a bottom-up manner. First, since FunctionCall must call LLM, the output format of the model must be consistent. Therefore, the outputs of TrainableModule and [OnlineChatModule][lazyllm.module.onlineChatModule.OnlineChatModule] are aligned. Then a single round of FunctionCall is implemented, that is, LLM and tools are called once. Finally, the complete ReactAgent is implemented, that is, FunctionCall is iterated multiple times until the model iteration is completed or the maximum number of iterations is exceeded.

Aligning the Output of TrainableModule and OnlineChatModule

Since the output of TrainableModule is a string, while OnlineChatModule returns json, we need to unify their output formats so that FunctionCall can use either model without caring about the underlying type.

For TrainableModule, we instruct the model through the prompt to output tool_calls in a specific format. Then we parse the raw model output and extract only the actual model-generated content. For example:

'\nI need to use the "get_current_weather" function to get the current weather in Tokyo and Paris. I will call the function twice, once for Tokyo and once for Paris.<|action_start|><|plugin|>\n{"name": "get_current_weather", "parameters": {"location": "Tokyo"}}<|action_end|><|im_end|>'

Next, we run an extractor on the raw output to parse out the content field and the tool_calls (including the function name and arguments). We generate an ID and then pack the parsed result into a dictionary, like:

{role: "assistant", content: "I need to use the "get_current_weather" function to get the current weather in Tokyo and Paris. I will call the function twice, once for Tokyo and once for Paris.", tool_calls: [{id: "xxx", type: "function", "function": {name: "get_current_weather", arguments: {location: "Tokyo", unit: "celsius"}}}]}

For OnlineChatModule, the online model may return streaming or non-streaming responses. However, whether a FunctionCall should be triggered can only be determined after receiving the entire model output. Therefore, when the model is streaming, we must convert its streaming output to a non-streaming form—i.e., wait until all fragments are received before processing. Example:

{
  "id": "chatcmpl-bbc37506f904440da85a9bad1a21494e",
  "object": "chat.completion",
  "created": 1718099764,
  "model": "moonshot-v1-8k",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "",
        "tool_calls": [
          {
            "index": 0,
            "id": "xxx",
            "type": "function",
            "function": {
              "name": "get_current_weather",
              "arguments": "{\n  \"location\": \"Tokyo\",\n  \"unit\": \"celsius\"\n}"
            }
          }
        ]
      },
      "finish_reason": "tool_calls"
    }
  ],
  "usage": {
    "prompt_tokens": 219,
    "completion_tokens": 22,
    "total_tokens": 241
  }
}

After receiving the full output, we again use an extractor to parse out message:

{
    "role": "assistant",
    "content": "",
    "tool_calls": [
        {
        "index": 0,
        "id": "xxx",
        "type": "function",
        "function": {
            "name": "get_current_weather",
            "arguments": "{\n  \"location\": \"Tokyo\",\n  \"unit\": \"celsius\"\n}"
            }
        }
    ]
}

With this, TrainableModule and OnlineChatModule now expose a unified experience. For full compatibility with FunctionCall, the model output is further passed through a FunctionCallFormatter. The FunctionCallFormatter extracts only role, content, and tool_calls, ignoring all other fields, and returns a standardized dictionary.

Note

In the tool call information, besides the tool name and arguments, there are also the id, type, and function fields.
Any model that produces tool_calls must be configured with FunctionCallFormatter. The base formatter (and EmptyFormatter) collapses an assistant message to plain text and drops tool_calls and reasoning_content, so a provider-compatible multi-turn history can no longer be replayed — for example, DeepSeek thinking models reject a tool-calling request whose previous assistant turn omits reasoning_content. The base formatter logs a warning if it is used on such a message.

FunctionCall Output Flow

FunctionCall handles single-turn tool calls.

Non-function call request
Hello World!
function call request
What's the weather like today in Tokyo.

The input is first passed to the large model, for example:

Non–function call request
```
Hello! How can I assist you today?
```

function call request

{'role': 'assistant', 'content': '', 'tool_calls': [{'id': 'xxx', 'type': 'function', 'function': {'name': 'get_current_weather', 'arguments': {'location': 'Tokyo, Japan', 'unit': 'celsius'}}}]}

if the output is a tool call, use the ToolManager to execute the corresponding tool.
- function call request
```
'{"location": "Tokyo", "temperature": "10", "unit": "celsius"}'
```

If it is not a tool call, return the output directly. If it is a tool call, package together the model output and tool result, then return:

Non–function call request
```
Hello! How can I assist you today?
```

function call request

{'role': 'assistant', 'content': '', 'tool_calls': [{'id': 'xxx', 'type': 'function', 'function': {'name': 'get_current_weather', 'arguments': {'location': 'Tokyo, Japan', 'unit': 'celsius'}}}], 'tool_calls_results': ('{"location": "Tokyo", "temperature": "10", "unit": "celsius"}',)}

Advanced Agent

An agent is an artificial entity that can use sensors to sense the surrounding environment, make decisions autonomously, and then use actuators to perform corresponding actions. It has autonomy (can run independently without human intervention), responsiveness (can sense environmental changes and respond), sociality (multiple agents can coordinate with each other to complete tasks together), and adaptability (can continuously improve its own performance to better complete tasks). Below we introduce the implementation of several advanced agents is LazyLLM.

React

paper

Idea: ReactAgent handles problems according to the process of "Thought->Action->Observation->Thought...->Finish". Thought shows how the model solves problems step by step. Action represents the information of tool calls. Observation is the result returned by the tool. Finish is the final answer to the problem.The example is as follows:

import lazyllm
from lazyllm.tools import fc_register, ReactAgent
@fc_register("tool")
def multiply_tool(a: int, b: int) -> int:
    '''
    Multiply two integers and return the result integer

    Args:
        a (int): multiplier
        b (int): multiplier
    '''
    return a * b

@fc_register("tool")
def add_tool(a: int, b: int):
    '''
    Add two integers and returns the result integer

    Args:
        a (int): addend
        b (int): addend
    '''
    return a + b
tools = ["multiply_tool", "add_tool"]
llm = lazyllm.TrainableModule("internlm2-chat-20b").start()   # or llm = lazyllm.OnlineChatModule()
agent = ReactAgent(llm, tools)
query = "What is 20+(2*4)? Calculate step by step."
res = agent(query)
print(res)
# 'Answer: The result of 20+(2*4) is 28.'

PlanAndSolve

paper

Idea: PlanAndSolveAgent consists of two components: first, decomposing the whole task into smaller subtasks, and second, executing these subtasks according to the plan. Finally, the results are output as answers.

1、After the input comes in, it first passes through the planner model to generate a solution plan for the problem.

Plan:\n1. Identify the given expression: 20 + (2 * 4)\n2. Perform the multiplication operation inside the parentheses: 2 * 4 = 8\n3. Add the result of the multiplication to 20: 20 + 8 = 28\n4. The final answer is 28.\n\nGiven the above steps taken, the answer to the expression 20 + (2 * 4) is 28. <END_OF_PLAN>

2、Parse the generated plan so that the solver model can be executed according to the plan.

3、For each step of the plan, invoke FunctionCall to process it, until no further tool calls are needed or the preset maximum number of calls is reached. The final generated result is then returned as the final answer.

The final answer is 28.

The following is an example:

import lazyllm
from lazyllm.tools import fc_register, PlanAndSolveAgent

@fc_register("tool")
def multiply(a: int, b: int) -> int:
    """
    Multiply two integers and return the result integer

    Args:
        a (int): multiplier
        b (int): multiplier
    """
    return a * b

@fc_register("tool")
def add(a: int, b: int):
    """
    Add two integers and returns the result integer

    Args:
        a (int): addend
        b (int): addend
    """
    return a + b

llm = lazyllm.TrainableModule("internlm2-chat-20b").start()  # or llm = lazyllm.OnlineChatModule(stream=False)
tools = ["multiply", "add"]
agent = PlanAndSolveAgent(llm, tools=tools)
query = "What is 20+(2*4)? Calculate step by step."
ret = agent(query)
print(ret)
# The final answer is 28.

ReWOO (Reasoning WithOut Observation)

paper

Idea: ReWOOAgent consists of three parts: Planner, Worker and Solver. Among them, Planner uses predictable reasoning ability to create a solution blueprint for complex tasks; Woker interacts with the environment through tool calls and fills actual evidence or observations into instructions; Solver processes all plans and evidence to develop solutions to the original tasks or problems.

The input first calls the planner model to generate a blueprint for solving the problem.

Plan: To find out the name of the cognac house that makes the main ingredient in The Hennchata, I will first search for information about The Hennchata on Wikipedia.
#E1 = WikipediaWorker[The Hennchata]

Plan: Once I have the information about The Hennchata, I will look for details about the cognac used in the drink.
#E2 = LLMWorker[What cognac is used in The Hennchata, based on #E1]

Plan: After identifying the cognac, I will search for the cognac house that produces it on Wikipedia.
#E3 = WikipediaWorker[producer of cognac used in The Hennchata]

Plan: Finally, I will extract the name of the cognac house from the Wikipedia page.
#E4 = LLMWorker[What is the name of the cognac house in #E3]

Parse the generated plan blueprint, call the corresponding tool, and fill the results returned by the tool into the corresponding instructions.

Plan: To find out the name of the cognac house that makes the main ingredient in The Hennchata, I will first search for information about The Hennchata on Wikipedia.
Evidence:
The Hennchata is a cocktail consisting of Hennessy cognac and Mexican rice horchata agua fresca. It was invented in 2013 by Jorge Sánchez at his Chaco's Mexican restaurant in San Jose, California.
Plan: Once I have the information about The Hennchata, I will look for details about the cognac used in the drink.
Evidence:
Hennessy cognac.
Plan: After identifying the cognac, I will search for the cognac house that produces it on Wikipedia.
Evidence:
Drinks are liquids that can be consumed, with drinking water being the base ingredient for many of them. In addition to basic needs, drinks form part of the culture of human society. In a commercial setting, drinks, other than water, may be termed beverages.
Plan: Finally, I will extract the name of the cognac house from the Wikipedia page.
Evidence:
The name of the cognac house is not specified.

Stitch the results of the plan and tool execution together, then call the solver model to generate the final answer.
```
'\nHennessy '
```

Here is an example：

import lazyllm
from lazyllm import fc_register, ReWOOAgent, deploy
import wikipedia
@fc_register("tool")
def WikipediaWorker(input: str):
    """
    Worker that search for similar page contents from Wikipedia. Useful when you need to get holistic knowledge about people, places, companies, historical events, or other subjects. The response are long and might contain some irrelevant information. Input should be a search query.

    Args:
        input (str): search query.
    """
    try:
        evidence = wikipedia.page(input).content
        evidence = evidence.split("\n\n")[0]
    except wikipedia.PageError:
        evidence = f"Could not find [{input}]. Similar: {wikipedia.search(input)}"
    except wikipedia.DisambiguationError:
        evidence = f"Could not find [{input}]. Similar: {wikipedia.search(input)}"
    return evidence
@fc_register("tool")
def LLMWorker(input: str):
    """
    A pretrained LLM like yourself. Useful when you need to act with general world knowledge and common sense. Prioritize it when you are confident in solving the problem yourself. Input can be any instruction.

    Args:
        input (str): instruction
    """
    llm = lazyllm.OnlineChatModule(stream=False)
    query = f"Respond in short directly with no extra words.\n\n{input}"
    response = llm(query, llm_chat_history=[])
    return response
tools = ["WikipediaWorker", "LLMWorker"]
llm = lazyllm.TrainableModule("Qwen2-72B-Instruct-AWQ").deploy_method(deploy.vllm).start()  # or llm = lazyllm.OnlineChatModule()
agent = ReWOOAgent(llm, tools=tools)
query = "What is the name of the cognac house that makes the main ingredient in The Hennchata?"
ret = agent(query)
print(ret)
# '\nHennessy '