Abstract: Zero-shot semantic segmentation continues to face challenges in effectively handling unseen object classes, despite its critical applications in medical imaging, autonomous driving, and ...
Abstract: Visual simultaneous localization and mapping (SLAM) systems that assume static environments often struggle with dynamic objects, resulting in degraded localization robustness. To address ...
A fundamental challenge for GUI agents is robustly grounding natural language instructions, which requires not only precise spatial alignment (locating elements accurately) but also correct semantic ...