publications
2025
- Can Vision-Language Models Answer Face to Face Questions in the Real-World?arXiv 2025 (* joint first authors)
2024
- AirLetters: An Open Video Dataset of Characters Drawn in the AirECCV HANDS Workshop 2024
- SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial SoundSIGGRAPH Posters 2025, ICML Workshop 2024
2023
- CPPE-5: Medical Personal Protective Equipment DatasetSN Computer Science 2023