My work focuses on training models to perform exhaustive browsing on the web, to code and, ultimately, to perform autonomous ML research to unlock recursive self-improvement for these models.
Previously, I worked on Safety infrastructure and guardrails, where I focused largely on training and deploying models to detect and mitigate jailbreaks at scale.