OpenAI Functions + LangChain : Building a Multi Tool Agent

TL;DR

Define each external capability as a function with a clear name, description, and a validated argument schema so the model can generate correct JSON inputs.

Briefing Cornell Notes

Briefing

OpenAI’s function-calling system, wired through LangChain, can turn a plain chat model into a finance assistant that reliably selects the right API tool, extracts the right parameters, and returns a grounded answer. The core workflow is: define one or more callable functions (with names, descriptions, and a strict parameter schema), let the model decide when to invoke them, execute the tool with the model-provided arguments, then feed the tool result back so the model can produce the final natural-language response.

The example starts with a simple “finance bot” built on the Yahoo Finance API. Users ask questions like “What is the price of Google stock?” or “Has Apple gone up over the past 90 days?” The key challenge—users won’t know ticker symbols—is handled by the model’s built-in knowledge: when asked about Apple or Google, it outputs the correct Yahoo Finance ticker behind the scenes. In the manual setup, the assistant first sends a human message plus a list of function definitions to the model. The model responds not with an answer, but with a structured function call: the function name (e.g., “get_stock_ticker_price”) and JSON arguments (e.g., the ticker for Google). The code then runs the corresponding tool using those arguments, retrieves the real price, and returns that result to the model using a dedicated function message. Only after the tool output is provided does the model generate the final response such as “The current price of Google stock is 123.83.”

LangChain then streamlines this “long way” by using an agent type designed for OpenAI functions. Instead of manually converting tools into OpenAI function formats and manually routing messages, the agent handles tool selection, argument passing, tool execution, and response synthesis. This approach is presented as an advantage over older prompt-based patterns (like ReAct-style tool prompting): it tends to improve tool selection and reasoning while reducing token waste from heavy in-context examples. Tradeoffs remain: customization is less straightforward than prompt tinkering, the setup is currently more tightly coupled to OpenAI’s function-calling conventions, and tool descriptions/schemas still consume tokens.

The finance bot expands from one tool to multiple tools. One function computes percentage price change over a time window given a ticker and a number of days; another finds the best-performing stock among a list of tickers over a specified period. The model can interpret user time expressions and convert them into the tool’s expected inputs—asking for “three months” maps to “90 days,” while “a month” maps to a shorter day count. It also handles mixed ticker formats: users can provide full company names (Google, Meta, Microsoft) and the model supplies the correct Yahoo Finance tickers. The same mechanism even works for crypto comparisons; when asked about Bitcoin over three months, the model uses Yahoo’s expected symbol format (not just a generic “BTC”), enabling comparisons across stocks and cryptocurrencies.

Overall, the transcript demonstrates a practical recipe for building multi-tool agents: strict schemas for tool inputs, function-call orchestration, and agent-based automation in LangChain—resulting in a conversational system that can answer finance questions grounded in external APIs.

Cornell Notes

OpenAI function calling plus LangChain can power a multi-tool finance agent that answers stock questions using the Yahoo Finance API. The system works by defining tools as functions with clear names, descriptions, and a Pydantic-based argument schema. The model first returns a structured function call (function name + JSON arguments) instead of a direct answer, then the tool runs and its result is sent back as a function message so the model can produce the final response. With multiple tools, the agent can compute percentage changes and pick the best-performing stock among a list. A major benefit is that the model converts natural time ranges like “three months” into the tool’s required “days” input and resolves company names to the correct Yahoo tickers.

How does function calling change the way a chat model answers a question like “What is the price of Google stock?”

Instead of replying with a price directly, the model returns a function call payload: it names the function to use (e.g., get_stock_ticker_price) and provides JSON arguments (the Yahoo Finance ticker for Google). The application executes the tool using those arguments, retrieves the real price from the Yahoo Finance API, then sends the tool output back to the model as a function message. Only after that does the model generate the final natural-language answer containing the price.

Why does the tool definition need a strict argument schema (and what role does Pydantic play)?

Tool definitions require a schema so the model knows exactly what inputs to supply and so the system can validate/compile the function call correctly. In the transcript, a Pydantic class is used as the argument schema (passed as argument_schema to the custom tool). Without that schema, the setup can fail with compilation errors. The schema also clarifies which inputs are required versus optional/defaulted.

What message types are involved in the manual orchestration flow?

The manual flow uses LangChain chat roles: a human message for the user query, an AI message that contains the model’s function call request (with no direct answer text), and then a function message that carries the tool’s result back to the model. After the function message is added to the message list, a subsequent model call produces the final answer.

What advantage does LangChain’s OpenAI Functions agent provide over manual function-call handling?

The agent automates the routing steps: it selects which tool to call, formats the function-call request, executes the tool, and then synthesizes the final response. That removes the need to manually convert each tool into OpenAI’s function format and to manually parse and re-inject tool results. The transcript also frames this as often improving tool selection and reducing token usage compared with prompt-heavy ReAct-style prompting.

How does the system handle time expressions like “three months” when the tool expects “days”?

The tool expects a numeric days parameter, but the model interprets natural language time ranges and converts them. In the transcript, “three months” is converted to 90 days, producing the same output as when the user explicitly requests “over the past 90 days.” The same logic is implied for other ranges like “past month,” where the model would choose an appropriate day count.

How can users ask for best-performing stocks using company names instead of tickers?

Users can supply names like Google, Meta, and Microsoft, and the model resolves them to the correct Yahoo Finance tickers before calling the best-performing tool. The tool then compares the stocks over the requested window and returns which ticker performed best along with the computed return percentage (e.g., Meta at 32.4% over three months in the example).

Review Questions

What sequence of model outputs and tool executions is required before the assistant can produce a final price answer under function calling?
How does the argument schema influence both correctness and error handling when building custom tools in LangChain?
In the multi-tool setup, how does the agent ensure that natural time phrases map to the tool’s required numeric “days” input?

Key Points

1
Define each external capability as a function with a clear name, description, and a validated argument schema so the model can generate correct JSON inputs.
2
Let the model return a structured function call first; execute the corresponding tool using the provided arguments rather than trusting the model’s text answer.
3
Send tool results back to the model using a function message so it can ground its final response in real API data.
4
Use LangChain’s OpenAI Functions agent to automate tool selection, argument passing, tool execution, and response synthesis.
5
Add multiple tools (e.g., price change and best-performing stock) to support richer finance queries in a single conversational flow.
6
Rely on the model’s natural-language understanding to convert time windows like “three months” into the tool’s expected “days” parameter.
7
Expect tradeoffs: less prompt-level customization, tighter coupling to OpenAI’s function-calling format, and token usage for tool descriptions/schemas.

Highlights

The manual flow shows the model first requesting a function call (with JSON arguments), then the app running the Yahoo Finance tool, then the model producing the final answer after receiving the tool output.

LangChain’s OpenAI Functions agent removes the manual message plumbing and makes multi-tool agents practical for real conversations.

Natural time ranges are converted automatically—“three months” becomes 90 days—so users don’t need to know the tool’s input format.

The same mechanism resolves company names to Yahoo Finance tickers and can handle crypto symbols like BTC-to-USD for Bitcoin queries.

Topics

OpenAI Function Calling
LangChain Agents
Yahoo Finance API
Multi Tool Agents
Pydantic Schemas

Mentioned

Sam Witteveen