

Hello, visitor!
I am a machine learning researcher working on scalable data methods for language models, from pre-training to post-training.
Currently, I am a member of the technical staff at Microsoft AI on the Super Intelligence team, building pipelines for synthetic RL environments.
From 2022 to early 2026, I was a lead research scientist at Ai2, co-leading the Olmo project. Olmo is a state-of-the-art, fully-open model designed to accelerate the science of LLMs. During my tenure, we released three generations of dense, mixture-of-experts, hybrid, and multimodal variants, alongside the data, code, recipes, and checkpoints we used to build them.
Before Ai2, I was a senior applied scientist at Alexa working on efficient question answering systems for long-tail knowledge. In 2021, we scaled up generative question answering to millions of users!
I completed my Ph.D. in computer science at Georgetown University in the Information Retrieval Lab working with Nazli Goharian in 2018. My doctoral thesis focuses on information retrieval systems for medical experts and lay health users.
When not in front of a screen, I enjoy brewing espresso, going on runs, curating my ever-growing sticker collection, and hanging out with my handsome cats.