AgentCore CLI in Practice — Persist Conversation Memory Across Sessions

Introduction

In the previous article, we verified the basic AgentCore CLI lifecycle (create → dev → deploy → invoke). The default configuration creates a stateless agent — switch sessions and all context is lost.

AgentCore Memory solves this by automatically extracting and persisting information from conversations across sessions. It has a two-layer architecture: short-term memory (conversation history within a session) and long-term memory (knowledge extraction across sessions). Long-term memory offers four built-in strategies: SEMANTIC (factual information), USER_PREFERENCE (user settings), SUMMARIZATION (conversation summaries), and EPISODIC (episode memories).

This article walks through creating a Memory-enabled agent with the AgentCore CLI, deploying it, and verifying that memories persist across sessions. See the CLI Memory docs and configuration reference for the full spec.

This article was originally written using AgentCore CLI v0.3.0-preview and has been updated and re-verified with v0.6.0 (April 2026).

Prerequisites

Environment from Part 1 (Node.js 20+, uv, AWS CLI, AgentCore CLI v0.6.0)
An AWS account in a supported AgentCore region

The Three Memory Options

The --memory flag in agentcore create accepts three values:

Option	Short-term	Long-term Strategies
`none`	No	None
`shortTerm`	Yes	None
`longAndShortTerm`	Yes	SEMANTIC + USER_PREFERENCE + SUMMARIZATION + EPISODIC

shortTerm keeps conversation history within a session only. longAndShortTerm adds automatic extraction of facts, preferences, and summaries that persist across sessions.

This article uses longAndShortTerm to test all four long-term strategies.

Project Creation

Create a project with --memory longAndShortTerm:

Terminal

agentcore create \
  --name AgentCoreMemTest \
  --framework Strands \
  --model-provider Bedrock \
  --memory longAndShortTerm \
  --skip-git
cd AgentCoreMemTest

Generated agentcore.json

The memories array is populated with four strategies. Each strategy has namespaces with {actorId} and {sessionId} placeholders that are resolved at runtime.

agentcore/agentcore.json (memories section)

{
  "memories": [
    {
      "name": "AgentCoreMemTestMemory",
      "eventExpiryDuration": 30,
      "strategies": [
        {
          "type": "SEMANTIC",
          "namespaces": ["/users/{actorId}/facts"]
        },
        {
          "type": "USER_PREFERENCE",
          "namespaces": ["/users/{actorId}/preferences"]
        },
        {
          "type": "SUMMARIZATION",
          "namespaces": ["/summaries/{actorId}/{sessionId}"]
        },
        {
          "type": "EPISODIC",
          "namespaces": ["/episodes/{actorId}/{sessionId}"],
          "reflectionNamespaces": ["/episodes/{actorId}"]
        }
      ]
    }
  ]
}

eventExpiryDuration: 30 sets the retention period for conversation events (7–365 days).

What Each Strategy Does

Strategy	Extracts	Namespace	Purpose
SEMANTIC	Facts (name, job, project names)	`/users/{actorId}/facts`	Build a knowledge base about the user
USER_PREFERENCE	Settings (editor, theme, language)	`/users/{actorId}/preferences`	Personalized responses
SUMMARIZATION	Conversation summaries	`/summaries/{actorId}/{sessionId}`	Compress long conversation context
EPISODIC	Episode memories (specific events, experiences)	`/episodes/{actorId}/{sessionId}`	Reasoning based on past experiences

SEMANTIC and USER_PREFERENCE accumulate per actorId (user ID) and are referenced across sessions. SUMMARIZATION and EPISODIC are per sessionId, summarizing and recording conversations within a single session. EPISODIC also has reflectionNamespaces for cross-session episode reflection.

Generated Agent Code

With --memory, the CLI generates memory/session.py and integrates it into main.py. The key differences from a Memory-less agent:

memory/session.py — Initializes the Memory session manager
main.py — Extracts session_id and user_id from context, caches agents per session/user pair
main.py — Passes session_manager to Agent()

memory/session.py (auto-generated memory session manager)

app/AgentCoreMemTest/memory/session.py

import os
from typing import Optional
 
from bedrock_agentcore.memory.integrations.strands.config import AgentCoreMemoryConfig, RetrievalConfig
from bedrock_agentcore.memory.integrations.strands.session_manager import AgentCoreMemorySessionManager
 
MEMORY_ID = os.getenv("MEMORY_AGENTCOREMEMTESTMEMORY_ID")
REGION = os.getenv("AWS_REGION")
 
def get_memory_session_manager(session_id: str, actor_id: str) -> Optional[AgentCoreMemorySessionManager]:
    if not MEMORY_ID:
        return None
 
    retrieval_config = {
        f"/users/{actor_id}/facts": RetrievalConfig(top_k=3, relevance_score=0.5),
        f"/users/{actor_id}/preferences": RetrievalConfig(top_k=3, relevance_score=0.5),
        f"/summaries/{actor_id}/{session_id}": RetrievalConfig(top_k=3, relevance_score=0.5),
    }
 
    return AgentCoreMemorySessionManager(
        AgentCoreMemoryConfig(
            memory_id=MEMORY_ID,
            session_id=session_id,
            actor_id=actor_id,
            retrieval_config=retrieval_config,
        ),
        REGION
    )

The key detail is RetrievalConfig: top_k=3 (retrieve up to 3 records) and relevance_score=0.5 (minimum relevance threshold) per namespace. At invoke time, the agent fetches the most relevant memories from each namespace.

main.py (Memory-integrated agent code)

app/AgentCoreMemTest/main.py

from strands import Agent, tool
from bedrock_agentcore.runtime import BedrockAgentCoreApp
from model.load import load_model
from mcp_client.client import get_streamable_http_mcp_client
from memory.session import get_memory_session_manager
 
app = BedrockAgentCoreApp()
log = app.logger
 
# Define a Streamable HTTP MCP Client
mcp_clients = [get_streamable_http_mcp_client()]
 
# Define a collection of tools used by the model
tools = []
 
# Define a simple function tool
@tool
def add_numbers(a: int, b: int) -> int:
    """Return the sum of two numbers"""
    return a+b
tools.append(add_numbers)
 
# Add MCP client to tools if available
for mcp_client in mcp_clients:
    if mcp_client:
        tools.append(mcp_client)
 
 
def agent_factory():
    cache = {}
    def get_or_create_agent(session_id, user_id):
        key = f"{session_id}/{user_id}"
        if key not in cache:
            cache[key] = Agent(
                model=load_model(),
                session_manager=get_memory_session_manager(session_id, user_id),
                system_prompt="""
                    You are a helpful assistant. Use tools when appropriate.
                """,
                tools=tools
            )
        return cache[key]
    return get_or_create_agent
get_or_create_agent = agent_factory()
 
 
@app.entrypoint
async def invoke(payload, context):
    log.info("Invoking Agent.....")
 
    session_id = getattr(context, 'session_id', 'default-session')
    user_id = getattr(context, 'user_id', 'default-user')
    agent = get_or_create_agent(session_id, user_id)
 
    stream = agent.stream_async(payload.get("prompt"))
 
    async for event in stream:
        if "data" in event and isinstance(event["data"], str):
            yield event["data"]
 
 
if __name__ == "__main__":
    app.run()

Compared to the Memory-less main.py from Part 1, the agent_factory() pattern caches agent instances per session/user pair, ensuring the same conversation history is reused within a session.

Deploying to AWS

As noted in Part 1, Memory is not available during local development (agentcore dev). The MEMORY_<NAME>_ID environment variable is not set, so get_memory_session_manager() returns None and the agent runs without Memory. Deployment is required to test Memory behavior.

Configure Deployment Target

Terminal

ACCOUNT_ID=$(aws sts get-caller-identity --query Account --output text)
 
cat > agentcore/aws-targets.json << EOF
[
  {
    "name": "default",
    "description": "Tokyo (ap-northeast-1)",
    "account": "${ACCOUNT_ID}",
    "region": "ap-northeast-1"
  }
]
EOF

Deploy

Terminal

agentcore deploy -y -v

Compared to a Memory-less deployment, these additional resources are created:

Resource	Description
`AWS::IAM::Role` (Memory)	Execution role for the Memory resource
`AWS::BedrockAgentCore::Memory`	The AgentCore Memory resource
`AWS::IAM::Policy` (Runtime addition)	Read/write permissions from Runtime to Memory

CloudFormation resource count increases from 5 to 7. Deployment took about 4 minutes (Memory resource creation takes ~3 minutes).

Verify Deployment

Terminal

agentcore status --json

Output

{
  "success": true,
  "projectName": "AgentCoreMemTest",
  "targetName": "default",
  "targetRegion": "ap-northeast-1",
  "resources": [
    {
      "resourceType": "agent",
      "name": "AgentCoreMemTest",
      "deploymentState": "deployed",
      "detail": "READY"
    },
    {
      "resourceType": "memory",
      "name": "AgentCoreMemTestMemory",
      "deploymentState": "deployed",
      "detail": "SEMANTIC, USER_PREFERENCE, SUMMARIZATION"
    }
  ]
}

Verifying Cross-Session Memory

The test scenario:

Session 1: Share personal information
Wait 2 minutes (for long-term memory extraction)
New session: Check if the agent remembers

Terminal

agentcore invoke \
  "Hi! My name is Taro and I'm a software engineer working on Kubernetes. I prefer dark mode and use Vim as my editor." \
  --stream \
  --session-id session-memory-test-001-first-conv \
  --user-id user-taro

Output

Hi Taro! Nice to meet you! It's great to connect with a software engineer
working on Kubernetes.
 
I've noted your preferences:
- **Dark mode** - always easier on the eyes!
- **Vim** - a classic choice among developers who value efficiency and
  keyboard-driven workflows

Note: --session-id requires a string of 33 characters or more. Shorter strings produce a validation error: Value at 'runtimeSessionId' failed to satisfy constraint: Member must have length greater than or equal to 33.

Wait for Long-Term Memory Extraction

AgentCore Memory extraction is asynchronous. After conversation events are saved, background processes run the SEMANTIC, USER_PREFERENCE, and SUMMARIZATION strategies. This takes 1–2 minutes.

Terminal

sleep 120

New Session: Verify Memory

Invoke with a different session ID to check if the agent remembers information from Session 1.

Terminal

agentcore invoke \
  "What do you know about me? What's my name, what do I work on, and what are my preferences?" \
  --stream \
  --session-id session-memory-test-002-second-conv \
  --user-id user-taro

Output

Based on what I know about you:
 
## Work
- **Role**: You're a software engineer
- **Focus**: You work on Kubernetes
 
## Preferences & Setup
- **Editor**: You use Vim as your text editor
- **UI Preference**: You prefer dark mode

Information from Session 1 was correctly retrieved in a completely different session:

SEMANTIC strategy → extracted "software engineer" and "Kubernetes" as facts
USER_PREFERENCE strategy → extracted "Vim" and "dark mode" as preferences

The --user-id user-taro flag scopes the memory to namespace /users/user-taro/facts and /users/user-taro/preferences. Using a different --user-id creates an independent memory space per user.

Note that the SUMMARIZATION strategy works differently — it summarizes conversations within a single session rather than extracting cross-session knowledge. It shines in long, multi-turn conversations where context compression is needed, so its effect is not visible in this short test.

Adding Memory to an Existing Project

You can add Memory to a project created without it:

Terminal

agentcore add memory \
  --name SharedMemory \
  --strategies SEMANTIC,SUMMARIZATION \
  --expiry 30

However, agentcore add memory only updates agentcore.json. It does not generate memory/session.py or modify main.py. Code integration must be done manually. The CLI docs provide step-by-step instructions.

For new projects, agentcore create --memory longAndShortTerm is the simpler path.

Summary

--memory longAndShortTerm auto-configures four strategies — SEMANTIC (facts), USER_PREFERENCE (settings), SUMMARIZATION (summaries), and EPISODIC (episode memories) are set up in agentcore.json, and integration code is generated in memory/session.py and main.py. No manual wiring needed.
Long-term memory extraction is asynchronous — After conversation data is saved, background extraction takes 1–2 minutes. For use cases requiring immediate context, combine short-term memory (in-session history) with long-term memory.
Memory requires deployment — agentcore dev does not support Memory. Testing Memory behavior requires a full deploy, which lengthens the development cycle.
Session IDs must be 33+ characters — Short session IDs cause a validation error. Use UUIDs or sufficiently long identifiers.

Next up: connecting external MCP servers to agents using the Gateway feature.

Cleanup

Terminal

# Remove all resource definitions
agentcore remove all -y
 
# Delete AWS resources
agentcore deploy -y
 
# Uninstall CLI (if no longer needed)
npm uninstall -g @aws/agentcore

AgentCore CLI in Practice — Persist Conversation Memory Across Sessions

Introduction

Prerequisites

The Three Memory Options

Project Creation

Generated agentcore.json

What Each Strategy Does

Generated Agent Code

Deploying to AWS

Configure Deployment Target

Deploy

Verify Deployment

Verifying Cross-Session Memory

Wait for Long-Term Memory Extraction

New Session: Verify Memory

Adding Memory to an Existing Project

Summary

Cleanup

Related Posts

AgentCore CLI in Practice — Build a Tech Trend Advisor with Key Features Combined

Strands Agents SDK Deploy — Managed Deployment with AgentCore CLI

AgentCore CLI in Practice — Measure Agent Quality with Evaluations

Introduction

Prerequisites

The Three Memory Options

Project Creation

Generated agentcore.json

What Each Strategy Does

Generated Agent Code

Deploying to AWS

Configure Deployment Target

Deploy

Verify Deployment

Verifying Cross-Session Memory

Session 1: Share Personal Information

Wait for Long-Term Memory Extraction

New Session: Verify Memory

Adding Memory to an Existing Project

Summary

Cleanup

Related Posts

AgentCore CLI in Practice — Build a Tech Trend Advisor with Key Features Combined

Strands Agents SDK Deploy — Managed Deployment with AgentCore CLI

AgentCore CLI in Practice — Measure Agent Quality with Evaluations