Lunjun Zhang

Research

I work on unsupervised learning and reinforcement learning.

I am fascinated by the following questions:

How can AI agents autonomously learn from experience in open-ended environments without external supervision?
What mechanisms of recursive self improvement can lead to unbounded growth in model capabilities?
Can reinforcement learning shift the scaling laws of generative models to a hard takeoff?

Lunjun Zhang, Yuwen Xiong, Ze Yang, Sergio Casas, Rui Hu, Raquel Urtasun

International Conference on Learning Representations (ICLR), 2024

Discrete diffusion on tokenized experience can lead to a GPT-like learning paradigm for robotics.

Lunjun Zhang, Anqi Joyce Yang, Yuwen Xiong, Sergio Casas, Bin Yang, Mengye Ren, Raquel Urtasun

Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Self-supervision combined with object priors can enable scalable object discovery in the wild.

Lunjun Zhang, Bradly Stadie

Foundation Models for Decision Making (FMDM) workshop, NeurIPS 2022

Deep Reinforcement Learning (Deep RL) workshop, NeurIPS 2022

Recasting goal-conditioned RL into the imitation learning framework.

Lunjun Zhang, Ge Yang, Bradly Stadie

International Conference on Machine Learning (ICML), 2021 (Long Talk)

Learning world models that endow agents with the ability to do temporally extended reasoning.

Lunjun Zhang, Bradly Stadie, Jimmy Ba

Conference on Uncertainty in Artificial Intelligence (UAI), 2020

Recasting the problem of finding intrinsic rewards as hyper-parameter optimization.