publications
2024
- Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training DataarXiv preprint arXiv:2406.14546 2024
- LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguagearXiv preprint arXiv:2405.12856 2024
2023
- arXivTools for Verifying Neural Models’ Training DataarXiv preprint arXiv:2307.00682 2023
2020
- ICBINBSelf-Tuning Stochastic Optimization with Curvature-Aware Gradient FilteringWorkshop on "I Can’t Believe It’s Not Better!", NeurIPS 2020
2019
- arXivOn empirical comparisons of optimizers for deep learningarXiv preprint arXiv:1910.05446 2019
- arXivFaster neural network training with data echoingarXiv preprint arXiv:1907.05550 2019
- ICMLGuided evolutionary strategies: Augmenting random search with surrogate gradientsIn International Conference on Machine Learning 2019
2018
- ICLRBackpropagation through the Void: Optimizing control variates for black-box gradient estimationIn International Conference on Learning Representations 2018