AI Security & Safety Research Lab

We secure the systems that think for themselves.

Offensive security research for the AI era. We break AI systems so you can ship them safely.

Our team previously helped secure

Hyperscale Cloud & Big TechPublic Sector & Critical InfrastructureGlobal Cybersecurity VendorsAI Agent Platforms & Autonomous SystemsEnterprise IoT & Embedded EcosystemsFortune-Scale Consumer & Industrial GroupsHigh-Scale Digital & Regulated ServicesHyperscale Cloud & Big TechPublic Sector & Critical InfrastructureGlobal Cybersecurity VendorsAI Agent Platforms & Autonomous SystemsEnterprise IoT & Embedded EcosystemsFortune-Scale Consumer & Industrial GroupsHigh-Scale Digital & Regulated Services

Focus Areas

LLM SecurityRL for Safety & SecurityAI Agent Threat SurfaceData Flow ProtectionAI Safety & AlignmentAgent Reliability Engineering

Making AI systems resilient by design

Projects

View all →

Adversarial RL Environment

Active

Reinforcement learning finally works for security. All you need is the right feedback loop. Our RL environments transform pretrained models into offensive and defensive security agents. Robust, packed with realistic attack scenarios, and designed with hundreds of programmatically verifiable security challenges at the edge of frontier capabilities.

AI Safety Research

Research

Research into how frontier AI systems fail - from reward hacking and hallucination cascades to alignment drift in autonomous pipelines. We build benchmarks and evaluation frameworks that map the boundaries of safe behavior across model families and real-world deployments.

Agent Security Research

Research

Threat modeling for autonomous AI agents that make decisions, call tools, and chain actions without human oversight. We study prompt injection, tool poisoning, memory manipulation, and privilege escalation - and publish the attack taxonomies and defense frameworks the industry relies on.

AI for Cybersecurity

Active

Applying AI to scale offensive security. We use language models and automated analysis to discover vulnerabilities in software - including two CVEs in Apple's software.

Latest

All posts →

KICR CCDC 26 Debrief: Deploying Autonomous Cyber Agents Under Real Competition Constraints2026-02-25

AI Governance Is Failing, what can we do?2026-01-26

Closing the AI Governance Implementation Gap2025-11-16

The Hidden Dangers of Browsing AI Agents2025-05-19