Cloudflare has published findings from testing Anthropic's Project Glasswing, a security-focused AI model that autonomously discovered thousands of high-severity vulnerabilities across major operating systems and browsers. The model demonstrates senior-level reasoning by chaining multiple exploit primitives into functional proofs, but Cloudflare warns that inconsistent built-in safeguards and the dual-use nature of the technology highlight why public release requires hardened protections.
Why it matters: As AI-powered vulnerability discovery accelerates, understanding both the capabilities and safety limitations of these tools is critical for security teams and policymakers weighing the tradeoff between defensive and offensive applications.