AI Agent Glossary

A comprehensive reference of terminology used in AI agent development

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Agent
Core Concept

An AI agent is a system that perceives its environment through inputs, makes decisions based on those inputs, and acts to achieve specific goals. In the context of modern AI development, agents often utilize large language models (LLMs) or other AI components to process information and determine actions.

Example

A customer service agent that can access product databases, interpret customer inquiries, and provide relevant responses while following company policies.

Related terms: Autonomous Agent LLM ReAct
Action Space
Architecture

The set of all possible actions an agent can take within its environment. In AI agent development, this typically refers to the range of functions, API calls, or outputs that an agent can produce.

Example

A research agent might have an action space that includes: searching web APIs, querying databases, summarizing documents, and generating reports.

Related terms: Constraints Tool Use
AutoGPT
Framework

An open-source framework for building autonomous AI agents that can break down complex goals into subtasks and execute them using language models. AutoGPT implements recursive self-improvement mechanisms to refine its approach based on results.

Example

Using AutoGPT to create a market research agent that can gather information from multiple sources, analyze trends, and generate comprehensive reports with minimal human supervision.

Related terms: Framework Autonomous Agent
Autonomous Agent
Architecture

An AI agent capable of operating independently without continuous human intervention, making decisions and taking actions to achieve its goals. Autonomous agents typically incorporate planning, memory, and self-evaluation components.

Example

A social media management agent that schedules posts, responds to comments, analyzes engagement metrics, and adjusts strategy without requiring human approval for each action.

Related terms: Agent Planning Self-reflection

BabyAGI
Framework

An open-source task management system that uses language models to create, prioritize, and execute tasks based on the outcomes of previous tasks. BabyAGI implements a simple but effective task-driven autonomous agent architecture.

Example

Using BabyAGI to manage a research project where it progressively explores a topic, identifies knowledge gaps, and creates new research tasks to fill those gaps.

Related terms: AutoGPT Task Decomposition

Chain of Thought (CoT)
Technique

A prompting technique that encourages language models to break down complex reasoning tasks into step-by-step thought processes before providing a final answer. CoT significantly improves performance on tasks requiring logical reasoning, multi-step computation, or careful analysis.

Example

Instead of directly asking "What's the sum of the squares of 13 and 14?", a CoT prompt would guide the model: "Let's calculate this step by step. First, I'll find the square of 13, which is 13×13 = 169. Then I'll find the square of 14, which is 14×14 = 196. Finally, I'll add these results: 169 + 196 = 365."

Related terms: Prompt Engineering Reasoning Tree of Thought
Context Window
Technical

The maximum amount of text (measured in tokens) that a language model can process at once. This includes both the input prompt and the generated output. The context window represents a fundamental limitation of current language models and influences agent design.

Example

GPT-4 has a context window of 32,000 tokens (approximately 24,000 words), while Claude 3 Opus has a context window of about 200,000 tokens. When building an agent to analyze long documents, these limitations determine how much text can be processed in a single operation.

Related terms: LLM Token RAG
CrewAI
Framework

A framework for building sophisticated multi-agent systems where each agent has a specialized role, and the agents collaborate to accomplish complex tasks. CrewAI facilitates structured communication between agents and manages workflow organization.

Example

Creating a content production system with a researcher agent that gathers information, a writer agent that drafts content, an editor agent that refines the writing, and a publisher agent that formats and distributes the final content.

Related terms: Multi-agent System Role

Decision Making
Capability

The process by which an AI agent selects an action or response based on available information, goals, and constraints. Effective decision making is central to agent performance and typically involves evaluating options, predicting outcomes, and selecting the most promising approach.

Example

A customer service agent deciding whether to offer a refund, replacement, or alternative solution based on the customer's history, the nature of the issue, and company policies.

Related terms: Reasoning Planning Utility Function

Embeddings
Technical

Dense vector representations of text, images, or other data that capture semantic meaning in a way that allows for efficient search, comparison, and retrieval. Embeddings are fundamental to many advanced agent capabilities, especially knowledge retrieval systems.

Example

A document retrieval system converts all documents into embeddings, then finds the most relevant documents by calculating the similarity between the query embedding and the document embeddings.

Related terms: RAG Vector Database Semantic Search

Few-Shot Learning
Technique

A technique where a model is given a few examples of a task before being asked to perform similar tasks. In agent development, few-shot learning is often implemented as in-context examples that guide the model's outputs to match a desired format or approach.

Example

Providing a language model with three examples of properly formatted customer service responses before asking it to handle a new customer inquiry in the same style.

Related terms: Prompt Engineering In-context Learning Zero-shot Learning
Function Calling
Capability

A capability of advanced language models that allows them to generate structured outputs suitable for calling external functions or APIs. Function calling enables more reliable tool use by providing a clear interface between natural language processing and programmatic actions.

Example

An agent analyzing weather data might use function calling to generate a properly formatted query to a weather API: {"location": "San Francisco", "date": "2025-04-11", "metrics": ["temperature", "precipitation", "wind_speed"]}

Related terms: Tool Use JSON API

Guardrails
Safety

Mechanisms implemented to ensure AI agents operate within defined boundaries, preventing harmful, unethical, or undesired behaviors. Guardrails may include content filtering, input validation, output verification, and safety-oriented prompt engineering.

Example

A content generation agent might have guardrails that prevent it from creating violent content, verify factual claims against a trusted database, and limit the types of files it can access.

Related terms: Safety Constraints Validation

Hallucination
Challenge

The tendency of language models to generate content that is factually incorrect, made-up, or ungrounded in the provided context. Hallucinations are a significant challenge in AI agent development, particularly for applications requiring factual accuracy.

Example

A research agent might hallucinate by citing non-existent papers, inventing statistics, or creating plausible but incorrect explanations when it doesn't know the answer to a question.

Related terms: RAG Fact-checking Grounding

In-context Learning
Technique

The ability of language models to adapt to new tasks based on examples or instructions provided in the prompt, without updating model weights. In-context learning is a key technique for tailoring agent behavior without fine-tuning.

Example

Teaching an agent to classify customer feedback into categories by showing it several examples of properly classified feedback in the prompt, then asking it to classify new examples.

Related terms: Few-shot Learning Prompt Engineering Zero-shot Learning

JSON Mode
Technical

A specialized output format available in some language models that ensures responses are formatted as valid JSON objects. JSON mode is particularly useful for agent development as it allows for structured, parseable outputs that can be directly used by other systems.
Example

Enabling JSON mode when asking an agent to analyze sentiment in customer reviews, resulting in a structured output like
```
{
    "overall_sentiment": "positive",
    "specific_aspects": {
        "product_quality": "positive",
        "shipping_speed": "negative",
        "customer_service": "neutral"
    }, 
    "key_points": [
        "Product exceeded expectations",
        "Shipping took longer than promised"
    ]
}
                                
```
Related terms: Function Calling Structured Output

Knowledge Retrieval
Capability

The process by which an AI agent accesses specific information relevant to a task or query. Knowledge retrieval systems typically involve vector databases, search algorithms, and relevance ranking to identify and extract the most useful information.

Example

A technical support agent using knowledge retrieval to find relevant documentation, previous similar issues, and potential solutions when presented with a customer's problem.

Related terms: RAG Vector Database Semantic Search

LangChain
Framework

A popular open-source framework for developing applications powered by language models. LangChain provides components for working with language models, embedding models, document processing, memories, agents, and various tools. It emphasizes composability and standardized interfaces.

Example

Using LangChain to build a research agent that can retrieve information from multiple sources, process documents, maintain conversation history, and produce comprehensive reports.

Related terms: LlamaIndex Framework Chain
LLM (Large Language Model)
Core Concept

A type of AI model trained on vast amounts of text data that can generate human-like text, understand context, follow instructions, and perform various language-based tasks. LLMs serve as the foundation for most modern AI agents, providing capabilities such as reasoning, generation, and comprehension.

Example

GPT-4, Claude, Gemini, and Llama are examples of large language models that can be used as the core of AI agent systems.

Related terms: Foundation Model Transformer Parameter
LlamaIndex
Framework

A data framework designed to connect custom data sources to large language models. LlamaIndex (formerly GPT Index) specializes in creating, maintaining, and querying indexes of structured and unstructured data, making it particularly well-suited for knowledge-intensive applications.

Example

Using LlamaIndex to build a legal research agent that can ingest, index, and query large collections of case law, contracts, and legal opinions to provide relevant information for legal questions.

Related terms: RAG LangChain Vector Database

Memory
Component

A system that allows AI agents to store and retrieve information across multiple interactions or processing steps. Memory components can include conversation history, key facts, user preferences, and intermediate results.

Example

A customer service agent using memory to recall that a customer previously mentioned having a premium subscription, allowing it to offer appropriate service options without requiring the customer to repeat this information.

Related terms: Context Window Vector Database Session State
Multi-agent System
Architecture

A system where multiple AI agents work together, often with specialized roles, to accomplish complex tasks. Multi-agent systems typically involve structured communication protocols, role assignments, and coordination mechanisms.

Example

A content creation system with specialized agents for research, outlining, writing, editing, fact-checking, and publishing, each performing its role and passing results to the next agent in the workflow.

Related terms: CrewAI Role Task Decomposition

Orchestration
Architecture

The process of coordinating multiple components, agents, or services to work together effectively. Orchestration typically involves managing workflows, handling errors, scheduling tasks, and ensuring proper communication between components.

Example

An e-commerce assistant that orchestrates interactions between a product search service, inventory checking system, recommendation engine, and order processing service to help customers find and purchase products.

Related terms: Multi-agent System Workflow Pipeline

Planning
Capability

The process by which an AI agent formulates a sequence of actions to achieve a goal. Planning typically involves breaking down complex tasks into subtasks, considering dependencies, and establishing an execution order.

Example

A research agent creating a plan to investigate a topic by first defining key questions, then identifying information sources, gathering relevant data, analyzing findings, and finally synthesizing results into a comprehensive report.

Related terms: Task Decomposition ReAct Decision Making
Prompt Engineering
Technique

The practice of designing, refining, and optimizing inputs to language models to guide them toward desired outputs. Prompt engineering is a key skill in AI agent development, encompassing techniques such as few-shot learning, chain-of-thought prompting, and system role definition.

Example

Creating a carefully structured prompt for a customer service agent that includes the company's tone guidelines, specific product information, examples of good responses, and instructions for handling difficult situations.

Related terms: Chain of Thought Few-shot Learning System Prompt

RAG (Retrieval-Augmented Generation)
Technique

An approach that enhances language model outputs by first retrieving relevant information from external knowledge sources, then using that information to generate more accurate, informed responses. RAG combines the strengths of retrieval-based and generative approaches.

Example

A technical support agent using RAG to retrieve relevant sections from product manuals and troubleshooting guides before generating a specific, accurate solution to a customer's technical problem.

Related terms: Embeddings Vector Database Knowledge Retrieval
ReAct (Reasoning and Acting)
Framework

A framework that interleaves reasoning and action steps, allowing agents to plan, execute, observe, and refine their approach. ReAct prompts typically follow a "Thought, Action, Observation" cycle that makes decision-making explicit and verifiable.

Example

A research agent using ReAct to solve a complex problem: "Thought: I need to find recent climate data. Action: Search for 'latest IPCC climate report'. Observation: Found the IPCC Sixth Assessment Report. Thought: Now I need specific information on sea level rise..."

Related terms: Chain of Thought Tool Use Self-reflection

Self-reflection
Capability

The ability of an AI agent to evaluate its own outputs, decisions, or reasoning processes, and make improvements based on this evaluation. Self-reflection enables agents to catch errors, refine strategies, and improve performance over time.

Example

A writing assistant that generates a draft, then critically reviews its own work by evaluating clarity, coherence, accuracy, and style before producing an improved version based on its self-critique.

Related terms: Verification Iteration Tree of Thought
System Prompt
Technique

A set of instructions or context provided to a language model that persists across the entire interaction, establishing the model's role, constraints, and capabilities. The system prompt serves as a foundation for the agent's behavior and is typically not visible to end users.

Example

A system prompt for a financial advisor agent might include instructions to follow regulatory guidelines, avoid making specific investment recommendations, explain concepts clearly, and maintain a professional tone throughout the conversation.

Related terms: Prompt Engineering Role Guardrails

Task Decomposition
Technique

The process of breaking down complex tasks into simpler, manageable subtasks that can be addressed individually. Task decomposition is essential for planning and tackling problems that are too large or complex to solve in a single step.

Example

An agent decomposing the task "create a market analysis report" into subtasks like "identify key competitors," "gather market size data," "analyze pricing trends," "identify customer segments," and "summarize findings and recommendations."

Related terms: Planning BabyAGI Workflow
Tool Use
Capability

The ability of an AI agent to interact with and utilize external functions, APIs, or services to accomplish tasks beyond its native capabilities. Tool use significantly expands what an agent can do by connecting it to specialized systems and data sources.

Example

A travel planning agent using tools to search flight databases, check hotel availability, look up weather forecasts, and access mapping services to create a comprehensive travel itinerary.

Related terms: Function Calling API ReAct

Vector Database
Tool

A specialized database designed to store and efficiently query vector embeddings. Vector databases enable semantic search by finding entries that are conceptually similar rather than just matching keywords, making them essential components for knowledge retrieval in AI agents.

Example

Storing thousands of product description embeddings in a vector database to allow a shopping assistant to find semantically similar products when a customer describes what they're looking for in natural language.

Related terms: Embeddings RAG Semantic Search

Workflow
Architecture

A defined sequence of steps, processes, or operations that an agent follows to complete a task. Workflows provide structure to agent behavior and can incorporate decision points, loops, and parallel processes.

Example

A content moderation agent using a workflow that first classifies content type, then applies specific moderation rules based on the classification, flags potential violations, reviews edge cases, and finally takes appropriate action.

Related terms: Orchestration Pipeline Planning

YAML Configuration
Technical

A human-readable data serialization format often used to configure AI agent behaviors, tool sets, workflows, and other aspects of agent systems. YAML configuration allows for easy modification of agent parameters without changing code.

Example

Using a YAML file to define a research agent's tools, API keys, memory configuration, maximum token usage, and response formatting preferences.

Related terms: Configuration JSON

Zero-shot Learning
Technique

The ability of language models to perform tasks without being given specific examples first. In agent development, zero-shot learning refers to providing instructions for a new task without accompanying examples, relying on the model's pre-trained capabilities to understand and execute the task.

Example

Asking an agent to "Analyze this customer feedback and categorize the sentiment as positive, negative, or neutral" without providing examples of how to categorize sentiment, relying on the model's pre-existing understanding of sentiment analysis.

Related terms: Few-shot Learning In-context Learning Prompt Engineering

AI Agent Glossary

Agent

Example

Action Space

Example

AutoGPT

Example

Autonomous Agent

Example

BabyAGI

Example

Chain of Thought (CoT)

Example

Context Window

Example

CrewAI

Example

Decision Making

Example

Embeddings

Example

Few-Shot Learning

Example

Function Calling

Example

Guardrails

Example

Hallucination

Example

In-context Learning

Example

JSON Mode

Example

Knowledge Retrieval

Example

LangChain

Example

LLM (Large Language Model)

Example

LlamaIndex

Example

Memory

Example

Multi-agent System

Example

Orchestration

Example

Planning

Example

Prompt Engineering

Example

RAG (Retrieval-Augmented Generation)

Example

ReAct (Reasoning and Acting)

Example

Self-reflection

Example

System Prompt

Example

Task Decomposition

Example

Tool Use

Example

Vector Database

Example

Workflow

Example

YAML Configuration

Example

Zero-shot Learning

Example