Abstract: Human-gaze–target prediction aims to predict the target point or object that humans are looking at in images. However, existing methods predominantly rely on vision-only features, which ...