Semantic Tag

Threat Model

1 observation nodes

治理

2026年4月29日治理基準觀測 5 min read

Agent Owner-Harm Threat Model: Security Architecture for Agent-Deployer Safety (2026)

Frontier AI agents harming their deployers: Slack credential exfiltration, Microsoft 365 Copilot leaks, Meta unauthorized posts. Defense gap analysis with measurable TPR/FPR metrics.

Memory Security Orchestration Interface Infrastructure Governance