LLM Agent Overview

Published at

2023/12/29

Last edited time

2023/12/29 14:25

Created

2023/12/29 13:24

Section

Prompt Enginnering

Status

Done

Series

Concept of Autonomous Agent Sys. [1]

•

Planning

◦

Subgoal and decomposition: The agent breaks down large tasks into smaller, manageable subgoals, enabling efficient handling of complex tasks.

◦

Reflection and refinement/critic: The agent can do self-criticism and self-reflection over past actions, learn from mistakes and refine them for future steps, thereby improving the quality of final results.

•

Memory

◦

Short-term memory: I would consider all the in-context learning (See Prompt Engineering) as utilizing short-term memory of the model to learn.

◦

Long-term memory: This provides the agent with the capability to retain and recall (infinite) information over extended periods, often by leveraging an external vector store and fast retrieval.

•

Tool use

◦

The agent learns to call external APIs for extra information that is missing from the model weights (often hard to change after pre-training), including current information, code execution capability, access to proprietary information sources and more.

RAG

•

Retrieval-Augmented Generation

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their...

https://arxiv.org/abs/2005.11401

•

Problems of LLM Models

Their ability to access and precisely manipulate knowledge is still limited

Needs to provide provenance for their decisions and updating their world knowledge

•

RAG is a technique for augmenting LLM knowledge with additional, often private or real-time, data. LLMs can reason about wide-ranging topics, but their knowledge is limited to the public data up to a specific point in time that they were trained on. If you want to build AI applications that can reason about private data or data introduced after a model’s cutoff date, you need to augment the knowledge of the model with the specific information it needs. The process of bringing the appropriate information and inserting it into the model prompt is known as Retrieval Augmented Generation (RAG). [2]

Get Insight from LangChain

What are the Agents? [3]

This is the chain responsible for deciding what step to take next. This is powered by a language model and a prompt. The inputs to this chain are:

Tools: Descriptions of available tools

User input: The high level objective

Intermediate steps: Any (action, tool output) pairs previously executed in order to achieve the user input

The output is the next action(s) to take or the final response to send to the user (AgentActions or AgentFinish). An action specifies a tool and the input to that tool.

Code

langchain.agents.agent.Agent — LangChain 0.0.352

•

Agent that calls the language model and deciding the action.

◦

This is driven by an LLMChain. The prompt in the LLMChain MUST include a variable called “agent_scratchpad” where the agent can put its intermediary work.

▪

scratchpad : a small, fast memory for the temporary storage of data.

•

Usage Example

Define the agent

•

let’s construct a custom agent that has access to a custom tool.

load the language model we’re going to use to control the agent.

from langchain.chat_models import ChatOpenAI

llm = ChatOpenAI(model="gpt-3.5-turbo", temperature=0)
Python
복사

define some tools to use

•

To pass in our tools to the agent, we just need to format them to the OpenAI function format and pass them to our model.

from langchain.agents import tool


@tool
def get_word_length(word: str) -> int:
    """Returns the length of a word."""
    return len(word)


tools = [get_word_length]


#---------- Format -------------------------
from langchain.tools.render import format_tool_to_openai_function

llm_with_tools = llm.bind(functions=[format_tool_to_openai_function(t) for t in tools])
Python
복사

Create Prompts

from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder

prompt = ChatPromptTemplate.from_messages(
    [
        (
            "system",
            "You are very powerful assistant, but bad at calculating lengths of words.",
        ),
        ("user", "{input}"),
        MessagesPlaceholder(variable_name="agent_scratchpad"),
    ]
)
Python
복사

Create the custom agent

from langchain.agents.format_scratchpad import format_to_openai_function_messages
from langchain.agents.output_parsers import OpenAIFunctionsAgentOutputParser

agent = (
    {
        "input": lambda x: x["input"],
        "agent_scratchpad": lambda x: format_to_openai_function_messages(
            x["intermediate_steps"]
        ),
    }
    | prompt
    | llm_with_tools
    | OpenAIFunctionsAgentOutputParser()
)
Python
복사

Adding Memories

add a place for memory in the prompt

from langchain.prompts import MessagesPlaceholder

MEMORY_KEY = "chat_history"
prompt = ChatPromptTemplate.from_messages(
    [
        (
            "system",
            "You are very powerful assistant, but bad at calculating lengths of words.",
        ),
        MessagesPlaceholder(variable_name=MEMORY_KEY),
        ("user", "{input}"),
        MessagesPlaceholder(variable_name="agent_scratchpad"),
    ]
)
Python
복사

set up a list to track the chat history

from langchain_core.messages import AIMessage, HumanMessage

chat_history = []
Python
복사

#PUT ALL TOGETHER
agent = (
    {
        "input": lambda x: x["input"],
        "agent_scratchpad": lambda x: format_to_openai_function_messages(
            x["intermediate_steps"]
        ),
        "chat_history": lambda x: x["chat_history"],
    }
    | prompt
    | llm_with_tools
    | OpenAIFunctionsAgentOutputParser()
)
agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)
Python
복사

cf) AgentExecutor

We can import and use the AgentExecutor class. This bundles up all of the above and adds in error handling, early stopping, tracing, and other quality-of-life improvements that reduce safeguards you need to write.

References

[1] LLM Powered Autonomous Agents | Lil'Log (lilianweng.github.io)

LLM Powered Autonomous Agents

Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond generating well-written copies, stories, essays and programs; it can be framed as a powerful general problem solver. Agent System Overview In a LLM-powered autonomous agent system, LLM functions as the agent’s brain, complemented by several key components:

https://lilianweng.github.io/posts/2023-06-23-agent/

[2] Retrieval-augmented generation (RAG) | ️ Langchain

Retrieval-augmented generation (RAG) | 🦜️🔗 Langchain

Open In Colab

https://python.langchain.com/docs/use_cases/question_answering/

[3] Agents | ️ Langchain

Agents | 🦜️🔗 Langchain

The core idea of agents is to use a language model to choose a sequence

https://python.langchain.com/docs/modules/agents/