Here is a list of papers that I have authored, also available on my Semantic Scholar or Google Scholar profiles.

α indicates equal contribution; ω indicates core contributors.

2026

2025

  • Olmo 3

    preprint
  • DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

    preprint
  • FlexOlmo: Open Language Models for Flexible Data Use

    NeurIPS 2025

    spotlight
  • The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

    NeurIPS 2025

  • 2 OLMo 2 Furious

    COLM 2025

  • Tülu 3: Pushing Frontiers in Open Language Model Post-Training

    COLM 2025

  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

    ACL 2025 System Demonstration

    best paper award
  • OLMoE: Open Mixture-of-Experts Language Models

    ICLR 2025

    oral
  • Language models scale reliably with over-training and on downstream tasks

    ICLR 2025

  • Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

    CVPR 2025

    best paper honorable mention

2024

2023

  • Queer In AI: A Case Study in Community-Led Participatory AI

    FAccT 2023

    best paper award
  • The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    preprint
  • The Semantic Scholar Open Data Platform

    preprint

2022

2021

2020

2018

2017

2016

2015

2014

The content of this website is licensed under CC BY 4.0