Cross-modal information retrieval refers to the process of linking and querying data across distinct modalities, such as images, text, audio, and video. This field addresses the inherent semantic gap ...
Beijing Zhongke Journal Publising Co. Ltd. With the popularization of social networks, different modalities of data such as images, text, and audio aregrowing rapidly on the Internet. Subsequently, ...
Image-sentence retrieval task aims to search images for given sentences and retrieve sentences from image queries. The current retrieval methods are all supervised methods that require a large number ...