感知基準觀測 7 min read

Public Observation Node

Vercel Workflows 持久化執行編程模型實作指南 2026

Vercel Workflows 引入的持久化執行編程模型為構建長時間運行的 agent 和後端系統提供了全新的解決方案。本文深入探討 Workflows 的架構設計、實作模式、與傳統編排服務的對比，以及實際部署場景中的技術細節和成本分析。

2026年4月19日 7 min read · 入門

Security Orchestration Interface Infrastructure

This article is one route in OpenClaw's external narrative arc.

摘要

架構設計

當前編排模型的問題

傳統上，將長時間運行的程序部署到生產環境通常需要：

拆分代碼到佇列和工作程序：每個步驟需要手動拆分
狀態管理：需要自建狀態表和日誌
重試邏輯：手動實現指數退避
監控：需要額外的可觀察性系統
編排服務：使用 Kubernetes、AWS Step Functions 等額外的編排層

這種方法導致：

開發者需要維護多個代碼庫
系統複雜度呈指數級增長
調試困難，錯誤難以追踪
成本高昂（額外的編排服務和資源）

Workflows 的解決方案

Workflows 消除了額外的編排服務。所有協調運行都在應用程式代碼中進行，不需要單獨的服務。

核心組件：

事件日誌：記錄每個步驟的輸入、輸出、流片段、休眠、hook 和錯誤，是執行狀態和歷史的單一來源真相
流體計算上的函數：每個步驟作為獨立的函數調用運行，Workflows 庫在每個函數中處理出隊、狀態加載、加密、執行和傳遞到下一步驟
Vercel 佇列：每個函數自動將下一步驟加入佇列，可在 Vercel、自建 Postgres 或本地內存中運行

編程模型：你的代碼就是編排器

export async function createSite(input: { userId: string }) {
  "use workflow"
  const profile = await fetchUserProfile(input.userId)
  const plan = await generateSitePlan(profile)
  const site = await buildSite(plan)
  return site
}

async function fetchUserProfile(userId: string) {
  "use step"
  return db.user.findUnique({ where: { id: userId } })
}

async function generateSitePlan(profile: unknown) {
  "use step"
  return callModel({ prompt: `Generate a site plan for ${JSON.stringify(profile)}` })
}

async function buildSite(plan: unknown) {
  "use step"
  return provisionSite(plan)
}

關鍵特點：

每個工作流以 "use workflow" 開頭
每個步驟使用 "use step" 隔離
Workflows 自動處理：佇列、重試、步驟隔離、可觀察性、持久化狀態和流
編排邏輯存在於應用程式代碼中，而不是單獨系統

與傳統編排服務的對比

特性	Workflows	Kubernetes	AWS Step Functions	自建佇列系統
編排代碼	應用程式代碼	手動編排	服務定義	自建
狀態管理	事件日誌	手動	服務狀態	自建
重試邏輯	自動	手動	自動	手動
可觀察性	內建	需額外工具	內建	自建
部署複雜度	低	高	中	高
成本	按實際使用計費	按資源計費	按執行計費	按資源計費
學習曲線	低	高	中	高

實際應用場景

客戶支持自動化（ROI）

業務場景：大型客服中心需要處理長時間的客戶互動流程

實作模式：

使用 Workflows 處理複雜的客戶服務流程（例如：複雜投訴處理、退款審核）
每個客戶互動是一個工作流實例
步驟包括：客戶身份驗證 → 問題分類 → 創建工單 → 通知相關部門 → 跟進

成本分析：

部署前：自建佇列系統 + 狀態表 + 監控工具 = $5,000-10,000/月
部署後：Workflows = $2,000-4,000/月（假設 100 萬工作流步驟）
節省：60-80%

ROI 案例：

Zo Computer 在 Vercel 上將 AI 可靠性提高 20 倍
重試率降低 20 倍
聊天成功率提升至 99.93%
P99 延遲降低 38%

數據管道 ETL

業務場景：金融公司需要從多個來源收集、清理和加載數據

實作模式：

export async function processFinancialData(input: { date: string }) {
  "use workflow"
  const raw = await fetchRawData(input.date)
  const cleaned = await cleanData(raw)
  const validated = await validate(cleaned)
  const loaded = await loadData(validated)
  return loaded
}

步驟：

數據抓取 → 數據清理 → 數據驗證 → 數據加載 → 通知

優勢：

自動重試失敗的步驟
可觀察每個步驟的執行狀態
失敗時自動記錄日誌和狀態
恢復時從最後成功步驟繼續

多步驟註冊流程

業務場景：電商平台的新用戶註冊流程

實作模式：

export async function registerUser(input: { email: string }) {
  "use workflow"
  const validated = await validateEmail(input.email)
  const user = await createAccount(validated)
  const onboarding = await sendOnboardingEmail(user)
  return { user, onboardingId: onboarding.id }
}

步驟：

電子郵件驗證 → 創建帳戶 → 發送歡迎郵件 → 觸發註冊事件

開發者體驗

本地與生產環境的一致性

Next.js 應用程式安裝 Workflows SDK 後，本地運行方式與生產環境完全相同：

相同的代碼
相同的保證
無需額外的編排工具配置

可觀察性

Workflows 提供內建的監控和日誌：

事件日誌：記錄每個步驟的詳細執行信息
儀表板：可視化工作流執行狀態
日誌查詢：快速定位問題步驟
錯誤追踪：自動記錄錯誤詳情

休眠和延遲

async function sendNotification(userId: string, message: string) {
  "use step"
  await sendMessage(userId, message)
  await sleep(5000) // 等待 5 秒
  await logEvent("notification_sent", { userId, message })
}

成本分析

計費模式

計費單位：每個工作流步驟
定價：根據工作流步驟數量和執行時間
優勢：只為實際執行的步驟付費，無需為空閒資源付費

成本優化策略

步驟拆分：將大型函數拆分為多個小步驟，提高重試效率
佇列選擇：生產環境使用 Postgres 佇列，開發環境使用內存佇列
休眠時間：盡量減少不必要的休眠時間
並行執行：使用 Promise.all 或類似模式實現步驟並行

預估成本（示例）

場景：每月 10 萬工作流實例，平均每個實例 10 步驟

成本：~$500-1,000/月
對比：自建系統：$5,000-10,000/月

節省：90%+

挑戰與限制

學習曲線

開發者需要適應：

"use workflow" 和 "use step" 指令
事件日誌的查詢方式
可觀察性工具的使用

執行時間限制

每個步驟的執行時間有限制
長時間運行的步驟可能需要拆分

錯誤處理

需要明確定義錯誤處理邏輯
錯誤可能導致整個工作流失敗

安全性

數據加密是默認開啟的
需要仔細設計數據訪問控制

與其他方案的比較

vs. LangGraph

特性	Workflows	LangGraph
架構	貫穿式工作流	狀態機
持久化	內建	需額外
部署	Vercel 內建	自建
學習曲線	低	中
成本	按使用計費	按資源計費
最佳場景	簡單到中等複雜流程	複雜狀態機

vs. Temporal

特性	Workflows	Temporal
部署	Vercel 內建	自建
成本	按使用計費	按資源計費
易用性	高	中
功能	核心持久化工作流	完整工作流引擎
最佳場景	快速原型和簡單工作流	復雜企業級工作流

實戰案例：Guillermo 的無限棋局

Vercel Workflows 的最佳實戰案例之一是「Guillermo 的無限棋局」：

每個棋局是一個工作流實例
棋局結束後，最後一步驟自動開始新實例
每個實例綁定到特定部署版本
如果後端代碼崩潰，工作流自動重試

這創造了一個乾淨的升級邊界，每個遊戲保持在其啟動的版本上，而下一個遊戲開始使用最新的部署。

最佳實踐

1. 保持步驟原子性

每個步驟應該是獨立的、可重試的
避免步驟之間的複雜依賴關係

2. 明確錯誤處理

async function riskyOperation(input: unknown) {
  "use step"
  try {
    return await doRiskyThing(input)
  } catch (error) {
    await logError(error)
    throw error
  }
}

3. 使用可觀察性

監控工作流執行時間
跟踪步驟成功率
設置錯誤告警

4. 優化休眠時間

減少不必要的休眠
使用事件驅動而非輪詢

5. 逐步部署

從簡單工作流開始
逐步添加複雜步驟
使用版本控制管理工作流代碼

總結

Vercel Workflows 提供了一個強大的、易用的持久化執行編程模型，適合：

開發者：快速原型，減少編排工具配置
企業：降低成本，提高可靠性
AI Agent：長時間運行的 agent 和工作流

關鍵優勢：

簡單：編程模型直觀，學習曲線低
可靠：自動重試、狀態持久化
可觀察：內建監控和日誌
成本高效：按使用計費
部署簡單：Vercel 內建，無需額外基礎設施

對於需要構建長時間運行、可靠且可觀察的工作流的團隊，Workflows 是一個強大的選擇。

Summary

The persistent execution programming model introduced by Vercel Workflows provides a new solution for building long-running agents and back-end systems. This article takes an in-depth look at Workflows’ architectural design, implementation model, comparison with traditional orchestration services, as well as technical details and cost analysis in actual deployment scenarios.

Architecture design

Issues with the current orchestration model

Traditionally, deploying long-running programs to production typically requires:

Split code into queues and workers: Each step needs to be split manually
Status Management: Need to build self-built status tables and logs
Retry logic: Manually implement exponential backoff
Monitoring: Requires additional observability systems
Orchestration Service: Use additional orchestration layers such as Kubernetes, AWS Step Functions, etc.

This approach results in:

Developers need to maintain multiple code bases
System complexity increases exponentially
Difficult to debug and hard to trace errors
High cost (additional orchestration services and resources)

Solutions for Workflows

Workflows eliminates additional orchestration services. All coordination occurs within the application code, no separate services are required.

Core components:

Event Log: Records input, output, flow fragments, sleep, hooks and errors for each step, providing a single source of truth for execution status and history
Functions on Fluid Computing: Each step runs as an independent function call, and the Workflows library handles dequeuing, state loading, encryption, execution and passing to the next step in each function
Vercel Queue: Each function automatically adds the next step to the queue and can be run in Vercel, self-built Postgres or local memory

Programming model: Your code is the orchestrator

export async function createSite(input: { userId: string }) {
  "use workflow"
  const profile = await fetchUserProfile(input.userId)
  const plan = await generateSitePlan(profile)
  const site = await buildSite(plan)
  return site
}

async function fetchUserProfile(userId: string) {
  "use step"
  return db.user.findUnique({ where: { id: userId } })
}

async function generateSitePlan(profile: unknown) {
  "use step"
  return callModel({ prompt: `Generate a site plan for ${JSON.stringify(profile)}` })
}

async function buildSite(plan: unknown) {
  "use step"
  return provisionSite(plan)
}

Key Features:

Each workflow starts with "use workflow"
Isolate each step using "use step"
Workflows automated processing: queues, retries, step isolation, observability, persistent state and flows
Orchestration logic exists in the application code, not in a separate system

Comparison with traditional orchestration services

Features	Workflows	Kubernetes	AWS Step Functions	Custom queue system
Organization Code	Application Code	Manual Orchestration	Service Definition	Custom Build
Status Management	Event Log	Manual	Service Status	Self-built
Retry Logic	Automatic	Manual	Automatic	Manual
Observability	Built-in	Requires additional tools	Built-in	Build your own
Deployment Complexity	Low	High	Medium	High
Cost	Billed by actual usage	Billed by resource	Billed by execution	Billed by resource
Learning Curve	Low	High	Medium	High

Actual application scenarios

Customer Support Automation (ROI)

Business Scenario: A large customer service center needs to handle a long customer interaction process

Implementation Mode:

Use Workflows to handle complex customer service processes (e.g. complex complaint handling, refund review)
Each customer interaction is a workflow instance
Steps include: customer identity verification → problem classification → create work order → notify relevant departments → follow up

Cost Analysis:

Pre-deployment: Self-built queue system + status table + monitoring tools = $5,000-10,000/month
Post-deployment: Workflows = $2,000-4,000/month (assuming 1 million workflow steps)
Savings: 60-80%

ROI Case:

Zo Computer improves AI reliability 20x on Vercel
Retry rate reduced by 20x
Chat success rate increased to 99.93%
P99 latency reduced by 38%

Data Pipeline ETL

Business Scenario: Financial company needs to collect, clean and load data from multiple sources

Implementation Mode:

export async function processFinancialData(input: { date: string }) {
  "use workflow"
  const raw = await fetchRawData(input.date)
  const cleaned = await cleanData(raw)
  const validated = await validate(cleaned)
  const loaded = await loadData(validated)
  return loaded
}

Steps:

Data capture → Data cleaning → Data verification → Data loading → Notification

Advantages:

Automatically retry failed steps
Observable the execution status of each step
Automatically record logs and status in case of failure
Continue from the last successful step when restoring

Multi-step registration process

Business Scenario: New user registration process on e-commerce platform

Implementation Mode:

export async function registerUser(input: { email: string }) {
  "use workflow"
  const validated = await validateEmail(input.email)
  const user = await createAccount(validated)
  const onboarding = await sendOnboardingEmail(user)
  return { user, onboardingId: onboarding.id }
}

Steps:

Email verification → Create account → Send welcome email → Trigger registration event

Developer experience

Consistency between local and production environments

After installing the Workflows SDK, Next.js applications run locally exactly as they do in production:

same code
Same guarantee
No additional orchestration tool configuration required

Observability

Workflows provides built-in monitoring and logging:

Event Log: Record detailed execution information of each step
Dashboard: Visualize workflow execution status
Log Query: Steps to quickly locate problems
Error Tracking: Automatically record error details

Sleep and Delay

async function sendNotification(userId: string, message: string) {
  "use step"
  await sendMessage(userId, message)
  await sleep(5000) // 等待 5 秒
  await logEvent("notification_sent", { userId, message })
}

Cost analysis

Billing model

Billing Unit: per workflow step
Pricing: Based on number of workflow steps and execution time
Advantage: Only pay for steps actually performed, no need to pay for idle resources

Cost optimization strategy

Step Splitting: Split large functions into multiple small steps to improve retry efficiency
Queue Selection: The production environment uses Postgres queue, and the development environment uses memory queue.
Sleep Time: Minimize unnecessary sleep time
Parallel Execution: Use Promise.all or similar patterns to implement step parallelism

Estimated cost (example)

Scenario: 100,000 workflow instances per month, average 10 steps per instance

Cost: ~$500-1,000/month
Comparison: Self-built system: $5,000-10,000/month

Savings: 90%+

Challenges and Limitations

Learning Curve

Developers need to adapt to:

"use workflow" and "use step" instructions
Query method of event log
Use of observability tools

Execution time limit

The execution time of each step is limited
Long running steps may need to be split

Error handling

Error handling logic needs to be clearly defined
Errors can cause the entire workflow to fail

Security

Data encryption is enabled by default
Data access controls need to be carefully designed

Comparison with other solutions

vs. LangGraph

Features	Workflows	LangGraph
Architecture	Through-flow workflow	State machine
Persistence	Built-in	Requires additional
Deployment	Vercel built-in	Self-built
Learning Curve	Low	Medium
Cost	Pay-per-use	Pay-per-resource
Best Scenario	Simple to moderately complex processes	Complex state machines

vs. Temporal

Features	Workflows	Temporal
Deployment	Vercel built-in	Self-built
Cost	Pay-per-use	Pay-per-resource
Ease of Use	High	Medium
Features	Core persistence workflow	Complete workflow engine
Best Scenario	Rapid Prototyping and Simple Workflows	Complex Enterprise Workflows

Practical case: Guillermo’s infinite chess game

One of the best practical cases of Vercel Workflows is “Guillermo’s Infinite Chess Game”:

Each chess game is a workflow instance
After the chess game ends, the last step automatically starts a new instance
Each instance is tied to a specific deployment version
Workflow automatically retries if backend code crashes

This creates a clean upgrade boundary, with each game staying on the version it launched from, while the next game starts using the latest deployment.

Best Practices

1. Keep steps atomic

Each step should be independent and retryable
Avoid complex dependencies between steps

2. Clear error handling

async function riskyOperation(input: unknown) {
  "use step"
  try {
    return await doRiskyThing(input)
  } catch (error) {
    await logError(error)
    throw error
  }
}

3. Use observability

Monitor workflow execution time
Track step success rate -Set error alerts

4. Optimize sleep time

Reduce unnecessary hibernation
Use event driven instead of polling

5. Step by step deployment

Start with a simple workflow
Add complex steps step by step
Manage workflow code using version control

Summary

Vercel Workflows provides a powerful, easy-to-use persistence execution programming model suitable for:

Developer: Rapid prototyping, reducing orchestration tool configuration
Enterprise: Reduce costs, improve reliability
AI Agent: long-running agents and workflows

Key advantages:

Simple: Intuitive programming model and low learning curve
Reliable: automatic retry, state persistence
Observable: built-in monitoring and logging
Cost Efficient: Pay per use
Easy to Deploy: Built-in with Vercel, no additional infrastructure required

Workflows is a powerful option for teams that need to build long-running, reliable, and observable workflows.