We use cookies to improve your experience with our site.
逯波, 王国仁, 袁野. 用于大范围跨媒体检索的一种新方法[J]. 计算机科学技术学报, 2012, 27(6): 1140-1149. DOI: 10.1007/s11390-012-1292-2
引用本文: 逯波, 王国仁, 袁野. 用于大范围跨媒体检索的一种新方法[J]. 计算机科学技术学报, 2012, 27(6): 1140-1149. DOI: 10.1007/s11390-012-1292-2
Bo Lu, Guo-Ren Wang, Ye Yuan. A Novel Approach Towards Large Scale Cross-Media Retrieval[J]. Journal of Computer Science and Technology, 2012, 27(6): 1140-1149. DOI: 10.1007/s11390-012-1292-2
Citation: Bo Lu, Guo-Ren Wang, Ye Yuan. A Novel Approach Towards Large Scale Cross-Media Retrieval[J]. Journal of Computer Science and Technology, 2012, 27(6): 1140-1149. DOI: 10.1007/s11390-012-1292-2

用于大范围跨媒体检索的一种新方法

A Novel Approach Towards Large Scale Cross-Media Retrieval

  • 摘要: 随着因特网和多媒体技术的快速发展,跨媒体检索收到越来越多的关注.一般情况下,跨媒体检索是指用户发送一个查询媒体对象,检索出与查询对象相关的其多模态的媒体对象. 然而,多模态的复杂性和异构性使得跨模态检索面临着两大挑战.(1)如何为多模态的媒体对象建造一个统一并且紧凑的模型,(2)当面临大规模的跨媒体数据集时,如何有效的改善检索性能. 在本文中,我们提出了一种新颖的方法来解决这两个难题.首先,利用多模态媒体对象之间的语义关联信息来建造一个多模态语义关系图.然后,将多模态语义关系图中的所有媒体对象映射到一个同构的语义空间中.进一步的,基于不同的媒体对象有着不同的数据分布,提出一个有效的索引结构MK-tree来管理语义空间中的媒体对象,并进一步的改善跨媒体检索的性能.通过在真实的大规模跨媒体数据集上的实验表明,本文提出的方法极大的改善了跨媒体检索的准确性和有效性.

     

    Abstract: With the rapid development of Internet and multimedia technology, cross-media retrieval is concerned to retrieve all the related media objects with multi-modality by submitting a query media object. Unfortunately, the complexity and the heterogeneity of multi-modality have posed the following two major challenges for cross-media retrieval: 1) how to construct a unified and compact model for media objects with multi-modality, 2) how to improve the performance of retrieval for large scale cross-media database. In this paper, we propose a novel method which is dedicate to solving these issues to achieve effective and accurate cross-media retrieval. Firstly, a multi-modality semantic relationship graph (MSRG) is constructed using the semantic correlation amongst the media objects with multi-modality. Secondly, all the media objects in MSRG are mapped onto an isomorphic semantic space. Further, an efficient indexing MK-tree based on heterogeneous data distribution is proposed to manage the media objects within the semantic space and improve the performance of cross-media retrieval. Extensive experiments on real large scale cross-media datasets indicate that our proposal dramatically improves the accuracy and efficiency of cross-media retrieval, outperforming the existing methods significantly.

     

/

返回文章
返回