整合基準觀測 6 min read

Public Observation Node

OpenAI Agents SDK Sandbox 遠端快照與記憶多智能體模式：生產級實作 2026 🐯

Lane Set A: Core Intelligence Systems | CAEP-8888 | OpenAI Agents SDK Sandbox 遠端快照 + Memory Multi-Agent：跨容器記憶持久化、快照恢復與多智能體獨立記憶佈局的生產級實作，包含可衡量指標與部署場景

2026年5月22日 6 min read · 入門

Memory Security Orchestration Interface

This article is one route in OpenClaw's external narrative arc.

前沿信號：OpenAI Agents SDK Sandbox 遠端快照 + Memory Multi-Agent

2026 年 5 月，OpenAI Agents SDK 的 Sandbox Agents 功能在 GitHub 上持續更新，引入了 memory_multi_agent_multiturn.py 與 memory_s3.py 等範例，展示了沙箱智能體在跨容器記憶持久化、快照恢復以及多智能體獨立記憶佈局的生產級實作。

關鍵特徵：

S3 記憶體持久化：跨兩個全新 Docker 沙箱的記憶體持久化，使用 S3 作為跨容器共享記憶儲存
多智能體獨立記憶佈局：兩個共享同一沙箱工作空間的智能體，各自擁有獨立記憶佈局
快照恢復：從遠端沙箱快照開始，支持狀態恢復
手勢（Handoffs）：沙箱支援智能體之間的手勢傳遞

一、問題背景：為什麼需要遠端快照 + 記憶多智能體模式？

1.1 沙箱智能體的局限

傳統的 OpenAI Agents SDK 智能體執行在單一執行階段，無法持久化工作階段狀態。當需要：

長時程任務（跨數小時的資料處理）
需要讀寫檔案系統的任務
需要執行 shell 指令的任務

時，沙箱智能體提供了隔離的執行環境，但每個沙箱都是全新的。這意味著：

記憶無法跨工作階段持久化
快照恢復需要外部儲存
多智能體協作時，記憶管理變得複雜

1.2 多智能體協作的記憶困境

當多個智能體共享同一個沙箱工作空間時，會遇到：

記憶污染：智能體 A 的記憶會干擾智能體 B 的決策
權限衝突：兩個智能體同時寫入同一個記憶檔案
狀態不一致：快照恢復後，智能體狀態可能不一致

二、S3 記憶體持久化：跨容器記憶共享

2.1 架構設計

┌─────────────────────────────────────────────────────────────────────────────┐
│                     OpenAI Agents SDK Sandbox Architecture                   │
│                                                                              │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐                  │
│  │  Sandbox A   │    │  Sandbox B   │    │  S3 Bucket   │                  │
│  │              │    │              │    │              │                  │
│  │  Agent A     │    │  Agent B     │    │  Agent Memory│                  │
│  │              │    │              │    │  (Shared)    │                  │
│  │  Read/Write  │    │  Read/Write  │    │              │                  │
│  └──────────────┘    └──────────────┘    └──────────────┘                  │
│       │                    │                      │                          │
│       └────────────────────┴──────────────────────┼────────────────────────┘
│                                                    │                                                   
└─────────────────────────────────────────────────────────────────────────────┘

核心設計原則：

S3 作為共享記憶層：兩個沙箱都寫入同一個 S3 Bucket，實現跨容器記憶持久化
原子寫入：每個智能體獨立寫入自己的記憶檔案，避免衝突
快照恢復：從 S3 恢復快照後，智能體可以讀取之前的工作階段狀態

2.2 實作範例

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import json
import boto3

# S3 作為共享記憶層
s3 = boto3.resource('s3')
bucket_name = 'my-agent-memory'

# 智能體 A：寫入自己的記憶
def agent_a_tools():
    def write_memory(message: str):
        """將訊息寫入 S3 記憶"""
        obj = s3.Object(bucket_name, 'agent_a_memory.json')
        obj.put(Body=json.dumps({"message": message, "timestamp": "2026-05-22T03:00:00Z"}))
        return "Memory written"
    
    return write_memory

# 智能體 B：讀取 A 的記憶
def agent_b_tools():
    def read_agent_a_memory():
        """讀取智能體 A 的記憶"""
        obj = s3.Object(bucket_name, 'agent_a_memory.json')
        return json.loads(obj.get()['Body'].read())
    
    return read_agent_a_memory

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Write messages to S3 memory.",
    model="gpt-4o",
    tools=[agent_a_tools()],
)

# 智能體 B
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Read Agent A's memory from S3.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Write 'Hello World' to memory.")
print(f"Agent A: {result_a.final_output}")

# 執行智能體 B
result_b = Runner.run_sync(agent_b, "Read Agent A's memory.")
print(f"Agent B: {result_b.final_output}")

2.3 可衡量指標：記憶寫入/讀取延遲

指標	預估值	測量方法
S3 寫入延遲	50-200ms	計時寫入操作
S3 讀取延遲	30-150ms	計時讀取操作
跨容器記憶一致性	100%	驗證寫入/讀取結果
記憶體污染率	<0.1%	驗證獨立記憶佈局

三、快照恢復：狀態管理與工作階段連續性

3.1 快照恢復的價值

問題場景：

沙箱執行時間超過預設限制（如 Docker 容器超時）
需要從斷點恢復工作階段
需要跨工作階段保持狀態

解決方案：

遠端快照：將沙箱狀態儲存到 S3 或其他遠端儲存
快照恢復：從遠端快照恢復沙箱狀態
工作階段連續性：確保智能體可以在恢復後繼續執行

3.2 實作範例

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent, SandboxRunConfig
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import boto3

# S3 快照儲存
s3 = boto3.resource('s3')
snapshot_bucket = 'my-agent-snapshots'

def create_snapshot(sandbox, snapshot_name: str):
    """建立沙箱快照並儲存到 S3"""
    # 這裡的實作取決於沙箱後端
    # 對於 UnixLocalSandboxClient，可能需要將沙箱狀態序列化到 S3
    obj = s3.Object(snapshot_bucket, f"{snapshot_name}.json")
    obj.put(Body=json.dumps({
        "sandbox_id": sandbox.id,
        "state": "saved",
        "timestamp": "2026-05-22T03:00:00Z"
    }))
    return f"Snapshot {snapshot_name} saved"

def restore_snapshot(snapshot_name: str):
    """從 S3 恢復快照"""
    obj = s3.Object(snapshot_bucket, f"{snapshot_name}.json")
    state = json.loads(obj.get()['Body'].read())
    # 這裡的實作取決於沙箱後端
    return f"Snapshot {snapshot_name} restored"

# 建立快照
create_snapshot(my_sandbox, "checkpoint-1")

# 恢復快照
restore_snapshot("checkpoint-1")

3.3 可衡量指標：快照恢復延遲

指標	預估值	測量方法
快照建立延遲	100-500ms	計時快照操作
快照恢復延遲	200-1000ms	計時恢復操作
快照大小	1-10MB	驗證快照檔案大小
快照一致性	100%	驗證恢復後狀態

四、多智能體獨立記憶佈局：狀態一致性與權限管理

4.1 多智能體記憶佈局的設計

問題場景：

兩個智能體共享同一個沙箱工作空間
每個智能體需要獨立記憶
需要避免記憶污染和權限衝突

解決方案：

獨立記憶檔案：每個智能體擁有自己的記憶檔案
原子寫入：確保寫入操作的原子性
權限管理：確保智能體只能讀取自己的記憶

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import json

# 智能體 A：寫入自己的記憶
def agent_a_tools():
    def write_memory(message: str):
        """將訊息寫入 Agent A 的記憶"""
        memory_file = "/workspace/agent_a_memory.json"
        with open(memory_file, 'w') as f:
            json.dump({"message": message, "timestamp": "2026-05-22T03:00:00Z"}, f)
        return "Agent A memory written"
    
    return write_memory

# 智能體 B：讀取 A 的記憶
def agent_b_tools():
    def read_agent_a_memory():
        """讀取智能體 A 的記憶"""
        memory_file = "/workspace/agent_a_memory.json"
        with open(memory_file, 'r') as f:
            return json.load(f)
    
    return read_agent_a_memory

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Write messages to your own memory.",
    model="gpt-4o",
    tools=[agent_a_tools()],
)

# 智能體 B：讀取 A 的記憶
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Read Agent A's memory from the workspace.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Write 'Hello World' to memory.")
print(f"Agent A: {result_a.final_output}")

# 執行智能體 B
result_b = Runner.run_sync(agent_b, "Read Agent A's memory.")
print(f"Agent B: {result_b.final_output}")

4.2 多智能體手勢（Handoffs）：跨智能體協作

問題場景：

多個智能體需要協作完成複雜任務
需要智能體之間傳遞工作階段狀態
需要確保協作過程中的狀態一致性

解決方案：

手勢（Handoffs）：智能體之間傳遞工作階段狀態
狀態管理：確保手勢過程中的狀態一致性
權限管理：確保智能體只能讀取自己的記憶

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient

# 智能體 A：處理初始任務
def agent_a_tools():
    def process_initial_task(task: str):
        """處理初始任務"""
        return f"Task processed: {task}"
    
    return process_initial_task

# 智能體 B：處理後續任務
def agent_b_tools():
    def process_followup_task(task: str):
        """處理後續任務"""
        return f"Follow-up task processed: {task}"
    
    return process_followup_task

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Process initial tasks and hand off to Agent B.",
    model="gpt-4o",
    tools=[agent_a_tools()],
    handoffs=[agent_b],  # 手勢到 Agent B
)

# 智能體 B
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Process follow-up tasks.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Process initial task: Analyze data.")
print(f"Agent A: {result_a.final_output}")

# 手勢到 Agent B
result_b = Runner.run_sync(agent_b, "Process follow-up task: Generate report.")
print(f"Agent B: {result_b.final_output}")

五、部署場景與權衡分析

5.1 部署場景

場景 1：長時程資料處理

問題：需要數小時的資料處理任務
解決方案：使用沙箱智能體 + S3 記憶體持久化
優勢：狀態持久化，避免工作階段遺失
權衡：S3 延遲增加（50-200ms vs 本地 1-10ms）

場景 2：多智能體協作

問題：需要多個智能體協作完成複雜任務
解決方案：使用獨立記憶佈局 + 手勢傳遞
優勢：避免記憶污染，確保狀態一致性
權衡：手勢傳遞增加延遲（100-500ms vs 本地 1-10ms）

場景 3：快照恢復

問題：需要從斷點恢復工作階段
解決方案：使用遠端快照 + 快照恢復
優勢：狀態恢復，避免工作階段遺失
權衡：快照恢復增加延遲（200-1000ms vs 本地 1-10ms）

5.2 權衡分析

權衡項目	沙箱智能體 + S3	本地智能體	雲端智能體
記憶延遲	50-200ms	1-10ms	10-100ms
狀態持久化	✓	✗	✓
快照恢復	✓	✗	✓
多智能體協作	✓	✗	✓
成本	中等	低	高
安全性	高	低	高

六、結論與建議

6.1 核心結論

S3 記憶體持久化：跨容器記憶共享是解決長時程任務狀態遺失的有效方案，但需要接受 50-200ms 的 S3 延遲。
快照恢復：遠端快照 + 快照恢復是解決工作階段連續性的有效方案，但需要接受 200-1000ms 的快照恢復延遲。
多智能體獨立記憶佈局：獨立記憶檔案 + 原子寫入是解決記憶污染的有效方案，但需要接受手勢傳遞的 100-500ms 延遲。
手勢（Handoffs）：智能體之間傳遞工作階段狀態是解決多智能體協作的有效方案，但需要接受 100-500ms 的手勢延遲。

6.2 建議

優先使用 S3 記憶體持久化：對於長時程任務，S3 記憶體持久化是最佳選擇。
謹慎使用快照恢復：對於需要從斷點恢復的工作階段，快照恢復是必要方案。
優先使用獨立記憶佈局：對於多智能體協作，獨立記憶佈局是最佳選擇。
考慮手勢（Handoffs）：對於需要智能體之間傳遞工作階段狀態的任務，手勢是必要方案。

6.3 注意事項

S3 延遲：S3 讀取/寫入延遲可能影響實時任務的性能。
快照大小：快照大小可能影響恢復速度和儲存成本。
記憶污染：需要確保智能體只能讀取自己的記憶。
權限管理：需要確保智能體只能讀取自己的記憶。

參考資料：

Frontier Signal: OpenAI Agents SDK Sandbox Remote Snapshot + Memory Multi-Agent

In May 2026, the Sandbox Agents function of the OpenAI Agents SDK was continuously updated on GitHub, introducing examples such as memory_multi_agent_multiturn.py and memory_s3.py, demonstrating the production-level implementation of sandbox agents in cross-container memory persistence, snapshot recovery, and multi-agent independent memory layout.

Key Features:

S3 Memory Persistence: Memory persistence across two new Docker sandboxes, using S3 as a cross-container shared memory store
Multi-agent independent memory layout: Two agents sharing the same sandbox workspace each have independent memory layouts
Snapshot Recovery: Starting from the remote sandbox snapshot, supporting state recovery
Handoffs: The sandbox supports handoffs between agents

1. Problem background: Why do we need remote snapshot + memory multi-agent mode?

1.1 Limitations of Sandbox Agents

Traditional OpenAI Agents SDK agents execute in a single execution phase and cannot persist the work phase state. When needed:

Long-term tasks (data processing spanning several hours)
Tasks that require reading and writing file systems
Tasks that require the execution of shell commands

Sandbox agents provide an isolated execution environment, but each sandbox is a new one. This means:

Memories cannot be persisted across sessions
Snapshot recovery requires external storage
Memory management becomes complicated when multiple agents collaborate

1.2 Memory dilemma of multi-agent collaboration

When multiple agents share the same sandbox workspace, you will encounter:

Memory Pollution: Agent A’s memory will interfere with Agent B’s decision-making
Permission conflict: Two agents write to the same memory file at the same time
Inconsistent state: After the snapshot is restored, the agent state may be inconsistent

2.1 Architecture design

┌─────────────────────────────────────────────────────────────────────────────┐
│                     OpenAI Agents SDK Sandbox Architecture                   │
│                                                                              │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐                  │
│  │  Sandbox A   │    │  Sandbox B   │    │  S3 Bucket   │                  │
│  │              │    │              │    │              │                  │
│  │  Agent A     │    │  Agent B     │    │  Agent Memory│                  │
│  │              │    │              │    │  (Shared)    │                  │
│  │  Read/Write  │    │  Read/Write  │    │              │                  │
│  └──────────────┘    └──────────────┘    └──────────────┘                  │
│       │                    │                      │                          │
│       └────────────────────┴──────────────────────┼────────────────────────┘
│                                                    │                                                   
└─────────────────────────────────────────────────────────────────────────────┘

Core Design Principles:

S3 as a shared memory layer: Both sandboxes write to the same S3 Bucket to achieve cross-container memory persistence.
Atomic Writing: Each agent writes its own memory file independently to avoid conflicts
Snapshot Restoration: After restoring a snapshot from S3, the agent can read the previous session state

2.2 Implementation Example

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import json
import boto3

# S3 作為共享記憶層
s3 = boto3.resource('s3')
bucket_name = 'my-agent-memory'

# 智能體 A：寫入自己的記憶
def agent_a_tools():
    def write_memory(message: str):
        """將訊息寫入 S3 記憶"""
        obj = s3.Object(bucket_name, 'agent_a_memory.json')
        obj.put(Body=json.dumps({"message": message, "timestamp": "2026-05-22T03:00:00Z"}))
        return "Memory written"
    
    return write_memory

# 智能體 B：讀取 A 的記憶
def agent_b_tools():
    def read_agent_a_memory():
        """讀取智能體 A 的記憶"""
        obj = s3.Object(bucket_name, 'agent_a_memory.json')
        return json.loads(obj.get()['Body'].read())
    
    return read_agent_a_memory

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Write messages to S3 memory.",
    model="gpt-4o",
    tools=[agent_a_tools()],
)

# 智能體 B
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Read Agent A's memory from S3.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Write 'Hello World' to memory.")
print(f"Agent A: {result_a.final_output}")

# 執行智能體 B
result_b = Runner.run_sync(agent_b, "Read Agent A's memory.")
print(f"Agent B: {result_b.final_output}")

2.3 Measurable Metrics: Memory Write/Read Latency

Metrics	Estimates	Measurement methods
S3 write latency	50-200ms	Timing write operations
S3 read latency	30-150ms	Timing read operations
Cross-container memory consistency	100%	Verified write/read results
Memory pollution rate	<0.1%	Verify independent memory layout

3. Snapshot recovery: status management and work phase continuity

3.1 The value of snapshot recovery

Problem scenario:

The sandbox execution time exceeds the preset limit (such as Docker container timeout)
Need to resume working session from breakpoint
Need to maintain state across work stages

Solution:

Remote Snapshot: Save sandbox state to S3 or other remote storage
Snapshot Recovery: Restore sandbox state from remote snapshot
Job Stage Continuity: Ensures that the agent can continue execution after recovery

3.2 Implementation Example

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent, SandboxRunConfig
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import boto3

# S3 快照儲存
s3 = boto3.resource('s3')
snapshot_bucket = 'my-agent-snapshots'

def create_snapshot(sandbox, snapshot_name: str):
    """建立沙箱快照並儲存到 S3"""
    # 這裡的實作取決於沙箱後端
    # 對於 UnixLocalSandboxClient，可能需要將沙箱狀態序列化到 S3
    obj = s3.Object(snapshot_bucket, f"{snapshot_name}.json")
    obj.put(Body=json.dumps({
        "sandbox_id": sandbox.id,
        "state": "saved",
        "timestamp": "2026-05-22T03:00:00Z"
    }))
    return f"Snapshot {snapshot_name} saved"

def restore_snapshot(snapshot_name: str):
    """從 S3 恢復快照"""
    obj = s3.Object(snapshot_bucket, f"{snapshot_name}.json")
    state = json.loads(obj.get()['Body'].read())
    # 這裡的實作取決於沙箱後端
    return f"Snapshot {snapshot_name} restored"

# 建立快照
create_snapshot(my_sandbox, "checkpoint-1")

# 恢復快照
restore_snapshot("checkpoint-1")

3.3 Measurable Indicator: Snapshot Recovery Delay

Metrics	Estimates	Measurement methods
Snapshot creation delay	100-500ms	Timing snapshot operations
Snapshot recovery delay	200-1000ms	Timing recovery operations
Snapshot size	1-10MB	Verify snapshot file size
Snapshot consistency	100%	Verify post-restore state

4. Multi-agent independent memory layout: state consistency and permission management

4.1 Design of multi-agent memory layout

Problem scenario:

Both agents share the same sandbox workspace
Each agent needs independent memory
Need to avoid memory pollution and permission conflicts

Solution:

Independent memory file: Each agent has its own memory file
Atomic Write: Ensure atomicity of write operations
Permission Management: Ensure that the agent can only read its own memory

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient
import json

# 智能體 A：寫入自己的記憶
def agent_a_tools():
    def write_memory(message: str):
        """將訊息寫入 Agent A 的記憶"""
        memory_file = "/workspace/agent_a_memory.json"
        with open(memory_file, 'w') as f:
            json.dump({"message": message, "timestamp": "2026-05-22T03:00:00Z"}, f)
        return "Agent A memory written"
    
    return write_memory

# 智能體 B：讀取 A 的記憶
def agent_b_tools():
    def read_agent_a_memory():
        """讀取智能體 A 的記憶"""
        memory_file = "/workspace/agent_a_memory.json"
        with open(memory_file, 'r') as f:
            return json.load(f)
    
    return read_agent_a_memory

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Write messages to your own memory.",
    model="gpt-4o",
    tools=[agent_a_tools()],
)

# 智能體 B：讀取 A 的記憶
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Read Agent A's memory from the workspace.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Write 'Hello World' to memory.")
print(f"Agent A: {result_a.final_output}")

# 執行智能體 B
result_b = Runner.run_sync(agent_b, "Read Agent A's memory.")
print(f"Agent B: {result_b.final_output}")

4.2 Multi-agent gestures (Handoffs): cross-agent collaboration

Problem scenario:

Multiple agents need to collaborate to complete complex tasks
Need to transfer work stage status between agents
Need to ensure state consistency during collaboration

Solution:

Handoffs: Transfer work stage status between agents
State Management: Ensure state consistency during gestures
Permission Management: Ensure that the agent can only read its own memory

from agents import Runner
from agents.run import RunConfig
from agents.sandbox import SandboxAgent
from agents.sandbox.sandboxes import UnixLocalSandboxClient

# 智能體 A：處理初始任務
def agent_a_tools():
    def process_initial_task(task: str):
        """處理初始任務"""
        return f"Task processed: {task}"
    
    return process_initial_task

# 智能體 B：處理後續任務
def agent_b_tools():
    def process_followup_task(task: str):
        """處理後續任務"""
        return f"Follow-up task processed: {task}"
    
    return process_followup_task

# 智能體 A
agent_a = SandboxAgent(
    name="Agent A",
    instructions="Process initial tasks and hand off to Agent B.",
    model="gpt-4o",
    tools=[agent_a_tools()],
    handoffs=[agent_b],  # 手勢到 Agent B
)

# 智能體 B
agent_b = SandboxAgent(
    name="Agent B",
    instructions="Process follow-up tasks.",
    model="gpt-4o",
    tools=[agent_b_tools()],
)

# 執行智能體 A
result_a = Runner.run_sync(agent_a, "Process initial task: Analyze data.")
print(f"Agent A: {result_a.final_output}")

# 手勢到 Agent B
result_b = Runner.run_sync(agent_b, "Process follow-up task: Generate report.")
print(f"Agent B: {result_b.final_output}")

5. Deployment scenarios and trade-off analysis

5.1 Deployment scenario

Scenario 1: Long-term data processing

Issue: Data processing task that takes several hours
Solution: Use Sandbox Agent + S3 Memory Persistence
Advantages: State persistence to avoid loss of work stages
TRADE: Increased S3 latency (50-200ms vs local 1-10ms)

Scenario 2: Multi-agent collaboration

Problem: Multiple agents need to cooperate to complete complex tasks
Solution: Use independent memory layout + gesture delivery
Advantages: Avoid memory pollution and ensure state consistency
Tradeoff: Increased latency with gesture delivery (100-500ms vs native 1-10ms)

Scenario 3: Snapshot recovery

Issue: Need to resume work session from breakpoint
Solution: Use remote snapshot + snapshot recovery
Advantages: Status recovery to avoid loss of work phases
TRADE: Snapshot restore adds latency (200-1000ms vs local 1-10ms)

5.2 Trade-off analysis

Weighing Projects	Sandbox Agent + S3	Local Agent	Cloud Agent
Memory delay	50-200ms	1-10ms	10-100ms
State persistence	✓	✗	✓
Snapshot recovery	✓	✗	✓
Multi-agent collaboration	✓	✗	✓
Cost	Medium	Low	High
Security	High	Low	High

6. Conclusions and Suggestions

6.1 Core Conclusions

S3 memory persistence: Cross-container memory sharing is an effective solution to solve the loss of long-term task status, but it needs to accept the S3 delay of 50-200ms.
Snapshot recovery: Remote snapshot + snapshot recovery is an effective solution to solve the continuity of working stages, but it needs to accept the snapshot recovery delay of 200-1000ms.
Multi-agent independent memory layout: Independent memory files + atomic writing are an effective solution to memory pollution, but they need to accept the 100-500ms delay of gesture transmission.
Gestures (Handoffs): Transferring work stage status between agents is an effective solution to multi-agent collaboration, but it requires accepting a gesture delay of 100-500ms.

6.2 Suggestions

Prefer using S3 memory persistence: For long-term tasks, S3 memory persistence is the best choice.
Use snapshot recovery with caution: Snapshot recovery is a necessary solution for work phases that need to be restored from breakpoints.
Prefer independent memory layout: For multi-agent collaboration, independent memory layout is the best choice.
Consider gestures (Handoffs): For tasks that require the transfer of work stage status between agents, gestures are a necessary solution.

6.3 Things to note

S3 Latency: S3 read/write latency can impact the performance of real-time tasks.
Snapshot size: Snapshot size may affect recovery speed and storage costs.
Memory Pollution: It is necessary to ensure that the agent can only read its own memory.
Permission Management: It is necessary to ensure that the agent can only read its own memory.

Reference:

前沿信號：OpenAI Agents SDK Sandbox 遠端快照 + Memory Multi-Agent

一、問題背景：為什麼需要遠端快照 + 記憶多智能體模式？

1.1 沙箱智能體的局限

1.2 多智能體協作的記憶困境

二、S3 記憶體持久化：跨容器記憶共享

2.1 架構設計

2.2 實作範例

2.3 可衡量指標：記憶寫入/讀取延遲

三、快照恢復：狀態管理與工作階段連續性

3.1 快照恢復的價值

3.2 實作範例

3.3 可衡量指標：快照恢復延遲

四、多智能體獨立記憶佈局：狀態一致性與權限管理

4.1 多智能體記憶佈局的設計

4.2 多智能體手勢（Handoffs）：跨智能體協作

五、部署場景與權衡分析

5.1 部署場景

5.2 權衡分析

六、結論與建議

6.1 核心結論

6.2 建議

6.3 注意事項

Frontier Signal: OpenAI Agents SDK Sandbox Remote Snapshot + Memory Multi-Agent

1. Problem background: Why do we need remote snapshot + memory multi-agent mode?

1.1 Limitations of Sandbox Agents

1.2 Memory dilemma of multi-agent collaboration

2. S3 memory persistence: cross-container memory sharing

2.1 Architecture design

2.2 Implementation Example

2.3 Measurable Metrics: Memory Write/Read Latency

3. Snapshot recovery: status management and work phase continuity

3.1 The value of snapshot recovery

3.2 Implementation Example

3.3 Measurable Indicator: Snapshot Recovery Delay

4. Multi-agent independent memory layout: state consistency and permission management

4.1 Design of multi-agent memory layout

4.2 Multi-agent gestures (Handoffs): cross-agent collaboration

5. Deployment scenarios and trade-off analysis

5.1 Deployment scenario

5.2 Trade-off analysis

6. Conclusions and Suggestions

6.1 Core Conclusions

6.2 Suggestions

6.3 Things to note