Multilinguality, Language Diversity, and Low-Resource Languages
Annie En-Shiun Lee is an Assistant Professor at Ontario Tech University (OTU) and a Status-only Assistant Professor at University of Toronto. Her goal is to make language technology as inclusive and accessible to as many people as possible. She directs the Lee Language Lab (L³), focusing on language diversity, multilinguality, and multiculturalism, aligning with OTU’s vision for “Tech with a Conscience”. Hers research has been published in Nature Digital Medicine, ACM Computing Surveys, *ACL, SIGCSE, IEEE TKDE, and Bioinformatics.
Dr. Lee is the demo co-chair for NAACL 2024 and has received numerous recognitions, including Outstanding Paper Award and Best Theme Paper Award at NAACL 2025, Women in AI Researcher of the Year Award 2025. for MScAC 2024, as well as nominations for the Tim McTiernan Student Mentorship Award 2025 and Women in AI Researcher of the Year Award 2025.
She was an Assistant Professor (teaching stream) at the University of Toronto for the MScAC. She earned her PhD from the University of Waterloo, was a visiting researcher at the Fields Institute and the Chinese University of Hong Kong, and worked as a research scientist at VerticalScope (Research lead) and Stradigi AI.
A lightweight performance proxy that predicts LM accuracy using ~30× less compute. Enables faster model selection, fine-tuning, and prompt iteration while maintaining high predictive reliability.
Freezes early transformer layers to preserve syntactic knowledge during fine-tuning. Boosts zero-shot and cross-domain performance with minimal additional training, improving stability and efficiency.
1.2M image–question pairs across 30 languages, capturing global culinary knowledge. A benchmark for cross-cultural multimodal reasoning with applications in cultural AI research.
Award-winning Teaching NLP workshop on empowering multilinguality in NLP education. Showcases teaching strategies, open resources, and collaborative projects funded by NSERC USRA and the Fields Institute.
Expanded typological and geographic language database with improved NLP integration. Includes a Python package for multilingual benchmarking, cross-lingual transfer, and dataset alignment.
Multimodal app for Taiwanese Hokkien featuring speech, text, and audio examples. Developed as a student-led project and awarded the Student Engagement Award by U of T Computer Science.
Interactive translation quality assistant that detects and corrects machine translation errors. Enhances translator efficiency while maintaining linguistic fluency and semantic accuracy.
This project aims to strengthen the multilingual understanding and reasoning capabilities of large language models (LLMs), with a focus on low-resource languages.
Browse the complete, up-to-date publication list, citations, and co-authors.
Choose the path that fits you. Please carefully follow the instructions below:
Propose joint research, co-supervision, visiting positions, R&D projects, evaluations, or joint award submissions with L³.
Research and project opportunities for undergraduate students with L³.
Application process and requirements for graduate study with L³.
For former L³ students requesting an academic reference letter.