Reinforcement Learning · Large Language Models

Hi, I'm Scott!

Looking for Summer 2026 ML Engineer/Scientist internship! 🔍

I'm a MScAC student at the University of Toronto working on Reinforcement Learning and Large Language Models. I'm fortunate to be supervised by Prof. Sheila McIlraith and Prof. Si Xujie. I previously worked on LLM evaluation in Jimmy Ba's group, guided by Silviu Pitis, Michael Zhang, Pashootan Vaezipoor, and more.

I love music 🎵, beatboxing 🎤, table tennis 🏓, and bodybuilding 🏋️. Feel free to connect if anything comes to mind or you just want to say hi!

Email GitHub LinkedIn

Selected Publications

View all →

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Blair Yang *, Fuyang Cui *, Keiran Paster, Jimmy Ba, Pashootan Vaezipoor, Silviu Pitis, Michael R. Zhang

NeurIPS 2024 SoLaR Workshop (Spotlight) · 2024

An automated qualitative-evaluation framework for specialized, open-ended, and agentic tasks of LLMs.

PDF Website

Notes & Projects

View all →

Dec 30, 2025

Multimodal Retrieval for Automated Music Video Synthesis

A framework for automating music video editing by bridging the 'semantic gap' between abstract lyrics and concrete footage using Multi-Stream Synergy Retrieval.

Dec 5, 2025

Beyond the Fork: Is High Entropy Enough?

Investigating 'Silent Errors' in LLM Reasoning—where models are confident but wrong.

Dec 1, 2025

CausalPool: Do VLMs Learn Physics or Just Match Patterns?

Investigating the 'Causal Disconnect' in Vision-Language Models through counterfactual supervision.

Dec 18, 2024

Test Blog Post

This is a test blog post to verify the blog functionality.