Yangjun Ruan | Publications

Publications

* below denotes equal contribution

2025

Reasoning to Learn from Latent Thoughts

Yangjun Ruan, Neil Band, Chris J Maddison, and Tatsunori Hashimoto

arXiv preprint arXiv:2503.18866, 2025

[TL;DR] [Paper] [Code]

TL;DR: We introduce "reasoning to learn", a new data-efficient pretraining paradigm that allows an LM to bootstrap its capability on limited, task-agnostic data.
Putting It All into Context: Simplifying Agents with LCLMs

Mingjian Jiang, Yangjun Ruan, Luis Lastras, Pavan Kapanipathi, and Tatsunori Hashimoto

arXiv preprint arXiv:2505.08120, 2025

[TL;DR] [Paper]

TL;DR: We demonstrate that an extremely simple and unscaffoldded agent approach works well for challenging agentic tasks by effectively leveraging the capabilities of long-contet LMs.
MixMin: Finding Data Mixtures via Convex Minimization

Anvith Thudi, Evianne Rovers, Yangjun Ruan, Tristan Thrush, and Chris J Maddison

In International Conference on Machine Learning (ICML), 2025

[TL;DR] [Paper]

TL;DR: We develop a compute-efficient data selection method by reducing the bi-level optimization to a convex minimization problem.

2024

Observational Scaling Laws and the Predictability of Language Model Performance

Yangjun Ruan, Chris J Maddison, and Tatsunori Hashimoto

In Advances in Neural Information Processing Systems (NeurIPS), 2024 [Spotlight]

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We introduce observational scaling laws that unify a large set of public LMs in a shared capability space, enabling a low-cost, high-resolution, and broad-coverage scaling analysis for complex LM capabilities
Graph-based Uncertainty Metrics for Long-form Language Model Outputs

Mingjian Jiang, Yangjun Ruan, Prasanna Sattigeri, Salim Roukos, and Tatsunori Hashimoto

In Advances in Neural Information Processing Systems (NeurIPS), 2024 [Spotlight]

[TL;DR] [Paper]

TL;DR: We introduce a family of graph-based uncertainty metrics for long-form LLM generations, and demonstrate consistent gains over existing methods.
Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Yangjun Ruan*, Honghua Dong*, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, and Tatsunori Hashimoto

In International Conference on Learning Representations (ICLR), 2024 [Spotlight]

[TL;DR] [Paper] [Code] [Poster] [Slides] [Website] [Demo]

TL;DR: An LM-based emulation framework for testing and identifying the risks of LM agents with tool use at scale.

2023

Calibrating Language Models via Augmented Prompt Ensembles

Mingjian Jiang*, Yangjun Ruan*, Sicong Huang, Saifei Liao, Silviu Pitis, Roger Baker Grosse, and Jimmy Ba

ICML Workshop on Deployment Challenges for Generative AI, 2023

[TL;DR] [Paper]

TL;DR: A prompt-augmented ensemble method for calibrating LLMs that can be generalized to open-ended generations.
Weighted Ensemble Self-Supervised Learning

Yangjun Ruan, Saurabh Singh, Warren Morningstar, Alexander A. Alemi, Sergey Ioffe, Ian Fischer, and Joshua V. Dillon

In International Conference on Learning Representations (ICLR), 2023

[TL;DR] [Paper] [Slides]

TL;DR: An efficient training-time ensemble method for improving self-supervised representation learning, achieving SOTA results on ImageNet SSL & few-shot benchmarks.

2022

Augment with Care: Contrastive Learning for the Boolean Satisfiability Problem

Haonan Duan*, Pashootan Vaezipoor*, Max B Paulus, Yangjun Ruan, and Chris J Maddison

In International Conference on Machine Learning (ICML), 2022

[TL;DR] [Paper]

TL;DR: A label-efficient contrastive pre-training method for combinatorial optimization.
Optimal Representations for Covariate Shift

Yangjun Ruan*, Yann Dubois*, and Chris J Maddison

In International Conference on Learning Representations (ICLR), 2022

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We derive a self-supervised objective for learning optimally robust representations under covariate shift, offering insights into CLIP’s robustness and further enhancing its distributional robustness.

2021

Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Yangjun Ruan*, Karen Ullrich*, Daniel Severo*, James Townsend, Ashish Khisti, Arnaud Doucet, Alireza Makhzani, and Chris J Maddison

In International Conference on Machine Learning (ICML), 2021 [Oral]

[TL;DR] [Paper] [Code] [Poster] [Slides]

TL;DR: We introduce the Monte Carlo bits-back coding framework for deriving asymptotically optimal compression algorithms from tighter variational bounds.

2020

Learning to Learn by Zeroth-Order Oracle

Yangjun Ruan, Yuanhao Xiong, Sashank Reddi, Sanjiv Kumar, and Cho-Jui Hsieh

In International Conference on Learning Representations (ICLR), 2020

[TL;DR] [Paper] [Code] [Slides]

TL;DR: A meta-learned zeroth-order optimizer that outperforms hand-designed algorithms.

2019

FastSpeech: Fast, Robust and Controllable Text to Speech

Yi Ren*, Yangjun Ruan*, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, and Tie-Yan Liu

In Advances in Neural Information Processing Systems (NeurIPS), 2019

[TL;DR] [Paper] [Poster] [Demo]

TL;DR: A non-autoregressive Transformer-based text-to-speech model that improves inference speed by 270x and enables controllable speech synthesis.
Data transmission in mobile edge networks: Whether and where to compress?

Jinke Ren*, Yangjun Ruan*, and Guanding Yu

IEEE Communications Letters, 2019

[TL;DR] [Paper]

TL;DR: An analysis of the optimal compression ratio for minimizing end-to-end latency in mobile edge networks.