Introducing the Agent Development Environment

Product

January 15, 2025

Watch a tutorial video on the Letta ADE on our YouTube channel!

We're excited to announce the Agent Development Environment (ADE), a visual development environment that brings unprecedented transparency to agent design and debugging. At its core, the ADE makes the once-opaque world of context windows and agent reasoning visible and manageable for developers.

‍

What makes agents special? They go beyond traditional LLMs in two fundamental ways:

Autonomy & Reasoning - Agents can autonomously take multiple actions, and achieve complex tasks that require reasoning and multiple LLM invocations.
State & Memory - Unlike standard LLMs that start fresh each time, agents can learn and grow from their interactions. An agent you've worked with before should remember important context and build on past conversations, just like a human colleague would.

Although features like reasoning, memory, and state are fundamental to agents, they are rarely transparently exposed to agent developers, making agents difficult to program and debug. And agents today are not easy to get right: they can get derailed, hallucinate, or run into context window overflows. These challenges are compounded by the "black box" nature of both LLM providers and agent frameworks. When something goes wrong, it's been nearly impossible to peek inside and understand why an agent made the decisions it did.

This is where the ADE comes in. Just as IDEs help developers understand what's happening inside their code, the ADE lets you see what's happening inside your agent's mind. We built it on a simple but powerful principle: designing great agents is all about designing great context windows. This means carefully crafting the system instructions, managing memory, choosing the right tools, and controlling how external data flows in. The ADE puts all these critical controls at your fingertips.

State & Context Management

The ADE puts context management front and center. As a developer, you get to shape each piece of the context window:

The system prompt (the unchanging instructions that guide your agent)
The tools your agent can use (which it can call on dynamically) along with their inputs and outputs
The agent's core memory (editable memory that stays in-context)

For each of these components, you can control how much of the context window they are allocated (such as by setting character limits to sections of memory or tool responses).

As you chat with your agent, messages naturally fill up the remaining context window space. Letta handles this intelligently, using recursive summarization and message pruning to automatically limit the context size.

Letta allows you to artificially import context window limits lower than the model’s actual context window limit. Just like humans, giving an agent too much to think about at once can actually make it less effective - not to mention slower and more expensive to run. While models like Claude technically support massive 200k token contexts, maxing out that space isn't always the best move.

Memory Architecture

Think of agent memory like a brain with two parts: what's actively in mind (in-context memory) and what's stored away for later (external memory). We've built on our original ideas with MemGPT for Letta’s memory management, carving out a dedicated space in the context window for editable in-context memory (which we call “Core Memory”).

In-Context Memory: Core Memory

Core memory works through "Blocks" - which each represent a persisted, editable section of the in-context memory. These blocks live in the "Core Memory" section of the context window, and your agent can update them using core_memory_append and core_memory_replace tools.

‍

Most developers use core memory with two blocks: a “human” block containing memories about the human, and a “persona” block with memories about the agent itself.

External Memory: Archival and Recall Memory

Letta automatically keeps track of everything in the conversation history as "recall memory" - your agent can search through this with conversation_search. For more free-form storage, there's "archival memory" - a vector database your agent can write to using archival_memory_insert. As a developer, you can also add data here directly or upload files through a data source. When your agent needs this archived information, it can find it with archival_memory_search.

‍

We make sure your agent always knows what's available by keeping memory statistics right in the context window. This way, it knows when it should dig into these memory banks for more information.

Transparent Reasoning

One of our core principles is that agent reasoning should never be a black box. That's why Letta requires all agents to show their work, regardless of whether they're powered by Gemini, OpenAI, or Anthropic:

We've designed Letta so agents must be explicit about what they're thinking versus what they're saying to users. When an agent wants to communicate with a user, it has to deliberately use the send_message(...) tool. This separation lets agents understand that they can think and act autonomously, and that communicating with the user is an explicit action.

Tool Integration

Tools are what allow agents to take actions in the real world and achieve complex tasks through tool composition. Letta comes with essential tools built in (like send_message and memory management tools), and you can add your own custom tools or tap into 7,000+ tools from Composio.

Tools are just as much a part of your context window as your prompts: designing your tool arguments, returns, and docstrings are a critical part of designing your context window. Be careful not to give your agent too many tools at once. Just like having too much context, too many tool options can make an agent indecisive. (OpenAI has some great insights on this in their function-calling guidelines.)

Tool Development and Debugging

Want to create your own tools? The Letta tool editor lets you write and test Python code directly in the ADE. When editing tools, you can try running them with mock inputs and view the resulting tool response, logs, and error messages.

‍

Tool updates are reflected immediately in your agent, so you can iterate quickly on agents without having to re-start or re-deploy a service.

Pre-built Tool Ecosystem

Our Composio integration gives you access to thousands of pre-built tools right out of the box. For tools that need authentication, you'll just need to create a Composio account and connect the services you want to use. Check out our Composio guide for the full details: https://docs.letta.com/guides/agents/composio

Production Deployment

The ADE is built for real-world applications. The simulator has three modes:

Debug mode when you need to see everything
Interactive mode for a balance of visibility and usability
Simple mode to preview exactly what your end users will see

Interestingly, many developers we've talked to actually like showing their users the agent's reasoning process - it can be fascinating to watch how the agent thinks! Through our REST API, you can access all this data (reasoning, tool calls, messages) by specifying the agent_id.

Conclusion

We built the ADE because we needed it ourselves - we wanted to see inside our agents' heads while we were building and researching them. We hope it helps you create more powerful, reliable agents through better control over tools, memory, and context windows.

During our public beta, you can try out the ADE at https://app.letta.com. Connect it to your local Letta Server or a remote self-hosted deployment (setup instructions included). If you have early access to Letta Cloud, you can create and deploy agents without running your own server.

We're working on making the ADE even more accessible, with plans for native, offline operation that won't require login or internet access. To stay in the loop:

Join our growing Discord community
Keep an eye on our docs at https://docs.letta.com
Drop us feedback on our Discord or at contact@letta.com

And for those that made it to the end of this post, a sneak preview for what’s coming next:

‍

Back

Twitter/X

Company

Company announcements, partnerships

Jul 7, 2025

Agent Memory: How to Build Agents that Learn and Remember

Traditional LLMs operate in a stateless paradigm—each interaction exists in isolation, with no knowledge carried forward from previous conversations. Agent memory solves this problem.

Jul 3, 2025

Anatomy of a Context Window: A Guide to Context Engineering

As AI agents become more sophisticated, understanding how to design and manage their context windows (via context engineering) has become crucial for developers.

Feb 13, 2025

RAG is not Agent Memory

Although RAG provides a way to connect LLMs and agents to more data than what can fit into context, traditional RAG is insufficient for building agent memory.

Nov 14, 2024

The AI agents stack

Understanding the AI agents stack landscape.

Nov 7, 2024

New course on Letta with DeepLearning.AI

DeepLearning.AI has released a new course on agent memory in collaboration with Letta.

Sep 23, 2024

Announcing Letta

We are excited to publicly announce Letta.

Sep 23, 2024

MemGPT is now part of Letta

The MemGPT open source project is now part of Letta.

Product

Release notes, feature announcements

Jul 24, 2025

Introducing Letta Filesystem

Today we're announcing Letta Filesystem, which provides an interface for agents to organize and reference content from documents like PDFs, transcripts, documentation, and more.

Apr 17, 2025

Announcing Letta Client SDKs for Python and TypeScript

We've releasing new client SDKs (support for TypeScript and Python) and upgraded developer documentation

Apr 2, 2025

Agent File

Introducing Agent File (.af): An open file format for serializing stateful agents with persistent memory and behavior.

Dec 13, 2024

Letta v0.6.4 release

Letta v0.6.4 adds Python 3.13 support and an official TypeScript SDK.

Nov 6, 2024

Letta v0.5.2 release

Letta v0.5.2 adds tool rules, which allows you to constrain the behavior of your Letta agents similar to graphs.

Oct 23, 2024

Letta v0.5.1 release

Letta v0.5.1 adds support for auto-loading entire external tool libraries into your Letta server.

Oct 14, 2024

Letta v0.5 release

Letta v0.5 adds dynamic model (LLM) listings across multiple providers.

Oct 3, 2024

Letta v0.4.1 release

Letta v0.4.1 adds support for Composio, LangChain, and CrewAI tools.

Research

Sleep-time compute, anatomy of a context window

May 29, 2025

Letta Leaderboard: Benchmarking LLMs on Agentic Memory

We're excited to announce the Letta Leaderboard, a comprehensive benchmark suite that evaluates how effectively LLMs manage agentic memory.

May 14, 2025

Memory Blocks: The Key to Agentic Context Management

Memory blocks offer an elegant abstraction for context window management. By structuring the context into discrete, functional units, we can give LLM agents more consistent, usable memory.

Apr 21, 2025

Sleep-time Compute

Sleep-time compute is a new way to scale AI capabilities: letting models "think" during downtime. Instead of sitting idle between tasks, AI agents can now use their "sleep" time to process information and form new connections by rewriting their memory state.

Feb 6, 2025

Stateful Agents: The Missing Link in LLM Intelligence

Introducing “stateful agents”: AI systems that maintain persistent memory and actually learn during deployment, not just during training.

Introducing the Agent Development Environment

State & Context Management

Memory Architecture

Transparent Reasoning

Tool Integration

Tool Development and Debugging

Pre-built Tool Ecosystem

Production Deployment

Conclusion

Company

Product

Research

Product

DEVELOPERS

Company

Newsletter