Abstract: Object detection using visible-infrared images has become increasingly crucial for all-day applications of uncrewed aerial vehicles (UAVs). However, existing multimodal detection methods ...
Abstract: Referring video object segmentation (RVOS) seeks to segment the objects within a video referred by linguistic expressions. Existing RVOS solutions follow a "fuse then select" paradigm: ...