基于异构特征融合的RGB-D手持物体识别

吕雄; 蒋树强; 王双

doi:10.1007/s11390-015-1527-0

基于异构特征融合的RGB-D手持物体识别

RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion

摘要

摘要: 物体识别在人机交互和多媒体检索中有很多应用.然而因为巨大的类内多样性和偶发的类间相似性,依赖RGB数据的准确物体识别依然是一个巨大挑战。最近,随着廉价的RGB-D设备出现,利用深度信息可以更好的解决这个挑战。同时在物体识别中一个特殊但是很重要的情况就是手持物体识别,因为使用手操作物体在人与人和人机交互中非常普遍并且直观。在本文中,我们研究了这个问题并且提出了一种有效的解决框架。这个框架首先结合骨骼信息和深度信息检测和分割手中所持物体。然后在物体识别阶段,从不同模态中提取异质特征进行融合以提高识别准确度。特别是我们把手工设计的特征和深度学习特征合并然后研究了几种多步融合方法。我们还介绍了手持物体数据集(Hand-held Object Dataset),并使用它评价手持物体识别的方法。

Abstract: Object recognition has many applications in human-machine interaction and multimedia retrieval. However, due to large intra-class variability and inter-class similarity, accurate recognition relying only on RGB data is still a big challenge. Recently, with the emergence of inexpensive RGB-D devices, this challenge can be better addressed by leveraging additional depth information. A very special yet important case of object recognition is hand-held object recognition, as manipulating objects with hands is common and intuitive in human-human and human-machine interactions. In this paper, we study this problem and introduce an effective framework to address it. This framework first detects and segments the hand-held object by exploiting skeleton information combined with depth information. In the object recognition stage, this work exploits heterogeneous features extracted from different modalities and fuses them to improve the recognition accuracy. In particular, we incorporate handcrafted and deep learned features and study several multi-step fusion variants. Experimental evaluations validate the effectiveness of the proposed method.

HTML全文

参考文献()

施引文献

资源附件()