Can Large Vision-Language Models Correct Grounding Errors and Reason By Themselves?

Yuan-Hong Liao , Rafid Mahmood , Sanja Fidler , David Acuna

January, 2025

Preprint

Type

Conference paper

Publication

CVPR 2025