We use cookies to improve your experience with our site.

一种基于内容和文本的高效图像检索方法

CATIRI: An Efficient Method for Content-and-Text Based Image Retrieval

  • 摘要: 在图像检索中结合视觉和文本信息可以有效地减轻传统技术的语义鸿沟问题,因此最近受到了大量的关注,基于这种结合方式的图像检索也被称为基于内容和文本的图像检索(CTBIR)。然而,据我们所知,这方面现有的工作多集中于提高图像的检索质量,对于如何提高检索效率却鲜有提及。如今,图像数据在我们的日常生活中被广泛使用,数据规模急剧扩大,因此对图像检索效率的研究具有重要的意义和价值。这篇文章提出了一种高效的图像检索方法,名为CATIRI,该方法使用一种三段式解决方案框架,核心是一种新型的索引结构MHIM-tree。MHIM-tree集成了曼哈顿哈希方法和倒排索引、M-tree等多种结构。为了在查询中使用此索引MHIM-tree,我们提出了一组重要的度量指标,显示了它们的内在性质。并基于MHIM-tree和这些度量,设计了一种top-k查询算法来完成基于内容和文本的图像检索。基于测试数据集的实验结果说明,CATIRI方法的检索效率比竞争算法要高将近一个数量级。

     

    Abstract: The combination of visual and textual information in image retrieval remarkably alleviates the semantic gap of traditional image retrieval methods, and thus it has attracted much attention recently. Image retrieval based on such a combination is usually called the content-and-text based image retrieval (CTBIR). Nevertheless, existing studies in CTBIR mainly make efforts on improving the retrieval quality. To the best of our knowledge, little attention has been focused on how to enhance the retrieval efficiency. Nowadays, image data is widespread and expanding rapidly in our daily life. Obviously, it is important and interesting to investigate the retrieval efficiency. To this end, this paper presents an efficient image retrieval method named CATIRI (content-and-text based image retrieval using indexing). CATIRI follows a three-phase solution framework that develops a new indexing structure called MHIM-tree. The MHIM-tree seamlessly integrates several elements including Manhattan Hashing, Inverted index, and M-tree. To use our MHIM-tree wisely in the query, we present a set of important metrics and reveal their inherent properties. Based on them, we develop a top-k query algorithm for CTBIR. Experimental results based on benchmark image datasets demonstrate that CATIRI outperforms the competitors by an order of magnitude.

     

/

返回文章
返回