LLM Agents

The AI Operating System

Create production-ready stateful agents in a single API call

Quickstart Get a demo

build stateful agents with letta

Agents API

View Docs

Build stateful agents with advanced memory and infinite context, using any model. Built-in persistence and memory management, with full support for custom tools and MCP.

ADE

Get Started

Use the Agent Development Environment (ADE) to visualize your agent's memory, reasoning steps, and tool calls. Observe, test, and edit your agent’s state in real time.

BACKED BY RESEARCH, Trusted by developers

Your Agents Need an OS: Today's AI agents struggle with fragmented context and limited memory. Just as computers need operating systems to manage resources, your agents need intelligent context management to unlock their full potential.

agent development environment (ADE)

Agents as APIs

Letta agents are exposed as a REST API endpoints, ready to be integrated into your applications - auth and identities included.

Stateful Agents

Letta persists all state automatically in a model-agnostic representation. Move agents between LLM providers without losing their memories.

Backed by Research

Letta manages context and memory with techniques designed by AI PhDs from UC Berkeley, including the creators of MemGPT.

Framework agnostic

Program your agents and connect them to your applications through Letta’s Agents API, SDKs, and framework integrations.

View all integrations

from letta_client import Letta

client = Letta(token="LETTA_API_KEY")

agent_state = client.agents.create(
    model="openai/gpt-4.1",
    embedding="openai/text-embedding-3-small",
    memory_blocks=[
        {
          "label": "human",
          "value": "The human's name is Chad. They like vibe coding."
        },
        {
          "label": "persona",
          "value": "My name is Sam, the all-knowing sentient AI."
        }
    ]
)

import { LettaClient } from '@letta-ai/letta-client'

const client = new LettaClient({ token: "LETTA_API_KEY" });

const agentState = await client.agents.create({
    model: "openai/gpt-4.1",
    embedding: "openai/text-embedding-3-small",
    memoryBlocks: [
        {
          label: "human",
          value: "The human's name is Chad. They like vibe coding."
        },
        {
          label: "persona",
          value: "My name is Sam, the all-knowing sentient AI."
        }
    ]
});

import { lettaCloud } from '@letta-ai/vercel-ai-sdk-provider';
import { generateText } from 'ai';

const { text } = await generateText({
  model: lettaCloud('your-agent-id'),
  prompt: 'Write a vegetarian lasagna recipe for 4 people.',
});

import { LettaClient } from '@letta-ai/letta-client'

const client = new LettaClient({ token: "LETTA_API_KEY" });

const agentState = await client.agents.create({
    model: "openai/gpt-4.1",
    embedding: "openai/text-embedding-3-small",
    memoryBlocks: [
        {
          label: "human",
          value: "The human's name is Chad. They like vibe coding."
        },
        {
          label: "persona",
          value: "My name is Sam, the all-knowing sentient AI."
        }
    ]
});

import { LettaClient } from '@letta-ai/letta-client'

const client = new LettaClient({ token: "LETTA_API_KEY" });

const agentState = await client.agents.create({
    model: "openai/gpt-4.1",
    embedding: "openai/text-embedding-3-small",
    memoryBlocks: [
        {
          label: "human",
          value: "The human's name is Chad. They like vibe coding."
        },
        {
          label: "persona",
          value: "My name is Sam, the all-knowing sentient AI."
        }
    ]
});

Production-ready, proven at scale

Scale from prototypes to millions of agents, all on the same stack. Ensure that your data, state, and agent memories are safe from vendor lock-in.

Get started today

Try Letta Cloud Get a demo

PLAY DEMO

GitHub

Contribute to the Letta open source

Visit our GitHub

Documentation

Learn how to build stateful agents

Read our docs

Discord

Join our developer Discord community

Join our Discord

Stay up to date with Letta

Latest articles

Letta

Jun 17, 2025

Stateful Voice Agents

Introducing Stateful Voice Agents: agents with persistent memory and low-latency response times needed for natural conversation.

Letta

May 29, 2025

Letta Leaderboard: Benchmarking LLMs on Agentic Memory

We're excited to announce the Letta Leaderboard, a comprehensive benchmark suite that evaluates how effectively LLMs manage agentic memory.

Letta

May 14, 2025

Memory Blocks: The Key to Agentic Context Management

Memory blocks offer an elegant abstraction for context window management. By structuring the context into discrete, functional units, we can give LLM agents more consistent, usable memory.

Around the web

Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough)

What if AI models could figure out answers to your questions BEFORE you even ask them? An incredible new research paper introduces "Sleep-Time Compute," allowing models to think while they aren't being used.

MemGPT with Real-life Example: Bridging the Gap Between AI and OS

In this tutorial, we discuss and show how to run MemGPT - an LLM with the potential for infinite context understanding.

MemGPT: Unlimited Memory without Token Constraints for Generation

MemGPT, standing for Memory-GPT, is a system devised to enhance the performance of Large Language Models (LLMs) by introducing a more advanced memory management scheme, helping to overcome the challenges posed by fixed context windows. Below are some of the key features of MemGPT:

The AI Operating System

Create production-ready stateful agents in a single API call

Agents API

ADE

Framework agnostic

Production-ready, proven at scale

Latest articles

Stateful Voice Agents

Letta Leaderboard: Benchmarking LLMs on Agentic Memory

Memory Blocks: The Key to Agentic Context Management

Around the web

Product

DEVELOPERS

Company

Newsletter