I obtained my PhD degree at Department of Computer Science, University of Toronto, supervised by Roger Grosse. Previously, I did my undergraduate at Department of Electronic Engineering, Tsinghua University.

My research focuses on machine learning, especially the combination between Bayesian methods and deep neural networks. I aim to leverage probabilistic methods to improve the quality, reliability and efficiency of machine learning systems. Specifically, I investigate how to provide uncertainty estimation in probabilistic models and exploit the uncertainty to improve the robustness and guide exploration. Furthermore, I am interested in improving the learning efficiency and out-of-distribution generalization of intelligent systems, such as in continual learning and meta-learning.

CV / Github / Google Scholar / Twitter

Research

Peer-Reviewed Papers

Information-theoretic Online Memory Selection for Continual Learning Shengyang Sun, Daniele Calandriello, Huiyi Hu, Ang Li, Michalis Titsias ICLR. 2022
Understanding the Variance Collapse of SVGD in High Dimensions Jimmy Ba, Murat A Erdogdu, Marzyeh Ghassemi, Shengyang Sun, Taiji Suzuki, Denny Wu, Tianzong Zhang ICLR. 2022
Scalable Variational Gaussian Processes via Harmonic Kernel Decomposition Shengyang Sun, Jiaxin Shi, Andrew Gordon Wilson, Roger Grosse ICML. 2021
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?[oral] Chaoqi Wang*, Shengyang Sun*, Roger Grosse AISTATS. 2021
Fast-rate PAC-Bayes Generalization Bounds via Shifted Rademacher Processes Jun Yang*, Shengyang Sun*, Daniel M. Roy NeurIPS. 2019
Functional Variational Bayesian Neural Networks Shengyang Sun*, Guodong Zhang*, Jiaxin Shi*, Roger Grosse ICLR. 2019
Aggregated Momentum: Stability Through Passive Damping James Lucas, Shengyang Sun, Richard Zemel, Roger Grosse ICLR. 2019
Differentiable Compositional Kernel Learning for Gaussian Processes Shengyang Sun, Guodong Zhang, Chaoqi Wang, Wenyuan Zeng, Jiaman Li, Roger Grosse ICML. 2018
Noisy Natural Gradient as Variational Inference Guodong Zhang*, Shengyang Sun*, David Duvenaud, Roger Grosse ICML. 2018
A Spectral Approach to Gradient Estimation for Implicit Distributions Jiaxin Shi, Shengyang Sun, Jun Zhu ICML. 2018
Kernel implicit variational inference Jiaxin Shi*, Shengyang Sun*, Jun Zhu ICLR. 2018
Learning structured weight uncertainty in bayesian neural networks Shengyang Sun, Changyou Chen, Lawrence Carin AISTATS. 2017
On the Spectral Efficiency of Massive MIMO Systems With Low-Resolution ADCs. Jiayi Zhang, Linglong Dai, Shengyang Sun, Zhaocheng Wang IEEE Communications Letters. 2016

Workshops and Preprints

Neural Networks as Inter-Domain Inducing Points Shengyang Sun*, Jiaxin Shi*, Roger Grosse AABI. 2021
ZhuSuan: A library for Bayesian deep learning Jiaxin Shi, Jianfei Chen, Jun Zhu, Shengyang Sun, Yucen Luo, Yihong Gu, Yuhao Zhou 2017

* denotes equal contribution.