Yangjun Ruan

I am a Ph.D. student in Computer Science at University of Toronto, where I am fortunate to be advised by Chris Maddison and Jimmy Ba. Currently, I am also a visiting scholar at Stanford University, hosted by Tatsunori Hashimoto.

Previously, I was a student researcher at Google Research and a research intern at Microsoft Research. In summer 2019, I was a visiting student at UCLA, where I worked with Cho-Jui Hsieh. I obtained my Bachelor degree in Information Engineering from Zhejiang University.

I am on the industrial job market now.

Research

My research focuses on the new scaling paradigms of language models and agents in data-constrained scenarios. I am generally interested in synthetic data, scalable evaluation and alignment, and agents.

Selected Publications [Full List]

* below denotes equal contribution

Reasoning to Learn from Latent Thoughts

Yangjun Ruan, Neil Band, Chris J Maddison, and Tatsunori Hashimoto

arXiv preprint arXiv:2503.18866, 2025

[TL;DR] [Paper] [Code]

TL;DR: We introduce "reasoning to learn", a new data-efficient pretraining paradigm that allows an LM to bootstrap its capability on limited, task-agnostic data.
Putting It All into Context: Simplifying Agents with LCLMs

Mingjian Jiang, Yangjun Ruan, Luis Lastras, Pavan Kapanipathi, and Tatsunori Hashimoto

arXiv preprint arXiv:2505.08120, 2025

[TL;DR] [Paper]

TL;DR: We demonstrate that an extremely simple and unscaffoldded agent approach works well for challenging agentic tasks by effectively leveraging the capabilities of long-contet LMs.
Observational Scaling Laws and the Predictability of Language Model Performance

Yangjun Ruan, Chris J Maddison, and Tatsunori Hashimoto

In Advances in Neural Information Processing Systems (NeurIPS), 2024 [Spotlight]

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We introduce observational scaling laws that unify a large set of public LMs in a shared capability space, enabling a low-cost, high-resolution, and broad-coverage scaling analysis for complex LM capabilities
Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Yangjun Ruan*, Honghua Dong*, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, and Tatsunori Hashimoto

In International Conference on Learning Representations (ICLR), 2024 [Spotlight]

[TL;DR] [Paper] [Code] [Poster] [Slides] [Website] [Demo]

TL;DR: An LM-based emulation framework for testing and identifying the risks of LM agents with tool use at scale.
Optimal Representations for Covariate Shift

Yangjun Ruan*, Yann Dubois*, and Chris J Maddison

In International Conference on Learning Representations (ICLR), 2022

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We derive a self-supervised objective for learning optimally robust representations under covariate shift, offering insights into CLIP’s robustness and further enhancing its distributional robustness.
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Yangjun Ruan*, Karen Ullrich*, Daniel Severo*, James Townsend, Ashish Khisti, Arnaud Doucet, Alireza Makhzani, and Chris J Maddison

In International Conference on Machine Learning (ICML), 2021 [Oral]

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We introduce the Monte Carlo bits-back coding framework for deriving asymptotically optimal compression algorithms from tighter variational bounds.

Selected Awards & Honors

Ontario Graduate Scholarship, 2023
DiDi Gruduate Student Award, 2021
CHU Kochen Scholarship (highest honor at Zhejiang University), 2019.
Cross-disciplinary Scholars in Science and Technology (CSST), UCLA, 2019.
National Scholarship (top 1.5%), 2017, 2018, 2019.
Meritorious Winner, Interdisciplinary Contest in Modeling (ICM), 2018.