突破 風險修復 16 min read
Multi-LLM Cybersecurity Benchmark Comparison: Claude Mythos Preview vs Opus 4.6 2026
Frontier model comparison for vulnerability discovery and exploitation: Mythos Preview achieves 83.1% vs Opus 4.6 66.6% on CyberGym, autonomous zero-day discovery, and measurable tradeoffs.
Memory Security Interface Infrastructure Governance