▸ internship · cohort forming
A few weeks. One benchmark. Real nerds only.
We hire the best — people with outstanding ideas. In 4–8 weeks you'll design and ship a public benchmark for frontier AI models: a capability eval, or a realistic cyber range.
- term
- 4–8 weeks
- stipend
- $1,200 + $5k API
- mode
- remote / Warsaw
- ship
- a public benchmark
01what you get
- $1,200 stipend + up to $5,000 in frontier-model API tokenspaid
- A public benchmark with your name on the specyours
- 1:1 calls with the leadership teammentorship
- Real fluency in how frontier AI is evaluatedyou'll learn
- The fastest path to a full-time seat herewhat's next
02who we hire
- Outstanding ML-research instinct — you read papers and run themresearch
- Taste for what makes a model capable — and what fakes itjudgment
- Niche depth in SWE or cybersecurity — CTF, low-level engineering, ML researchcraft
- Real nerds — the ones who felt the pain during the degree, not just collected itwe mean it
Any rigorous technical background is welcome — physics, applied mathematics, EE — not just CS.
03apply
What you've built or broken, and the benchmark you'd build here. Two minutes.
▸ Your own words only — answers that read as AI slop are rejected immediately.