Publications

Filter by type:
Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages

PDF

How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning

PDF

DeltaPrompts: Escaping the Zero-Delta Trap in Multimodal Distillation

PDF

Privasis: Synthesizing the Largest 'Public' Private Dataset from Scratch

PDF Project

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

PDF Dataset

Long Grounded Thoughts: Synthesizing Visual Problems and Reasoning Chains at Scale

PDF Dataset

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

PDF Code Project

Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions

PDF

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning

PDF

LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception

PDF Code Dataset

Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning

Preprint

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

PDF Project

RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

PDF

Structured Domain Randomization: Bridging the Reality Gap by Context-Aware Synthetic Data

PDF Video

Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization

PDF