We use cookies to improve your experience with our site.

Indexed in:

SCIE, EI, Scopus, INSPEC, DBLP, CSCD, etc.

Submission System
(Author / Reviewer / Editor)
Tam V. Nguyen, Jiashi Feng, Shuicheng Yan. Seeing Human Weight from a Single RGB-D Image[J]. Journal of Computer Science and Technology, 2014, 29(5): 777-784. DOI: 10.1007/s11390-014-1467-0
Citation: Tam V. Nguyen, Jiashi Feng, Shuicheng Yan. Seeing Human Weight from a Single RGB-D Image[J]. Journal of Computer Science and Technology, 2014, 29(5): 777-784. DOI: 10.1007/s11390-014-1467-0

Seeing Human Weight from a Single RGB-D Image

Funds: This work is partially supported by Singapore Ministry of Education under Research Grant No. MOE2012-TIF-2-G-016, and also partially by the National Natural Science Foundation of China under Grant No. 61328205.
More Information
  • Author Bio:

    Tam V. Nguyen is a research scientist at ARTIC Centre,Department for Technology, Innovation and Enterprise, SingaporePolytechnic. He obtained his Ph.D. degree in electrical and computerengineering from National University of Singapore in 2013. Prior tothat, he obtained his M.E. and B.S. degrees from Chonnam NationalUniversity, South Korea, in 2009, andUniversity of Science, Vietnam, in 2005, respectively. His researchinterests include computer vision, multimedia and machine learning.He is the recipient of numerous awards including the Best PaperAward at NUS GSS 2013, the 2nd prize winner of ICPR 2012 Contest onAction Recognition, and the best technical demonstration from ACMMM 2012.

  • Received Date: December 24, 2013
  • Revised Date: July 03, 2014
  • Published Date: September 04, 2014
  • Human weight estimation is useful in a variety of potential applications, e.g., targeted advertisement, entertainment scenarios and forensic science. However, estimating weight only from color cues is particularly challenging since these cues are quite sensitive to lighting and imaging conditions. In this article, we propose a novel weight estimator based on a single RGB-D image, which utilizes the visual color cues and depth information. Our main contributions are three-fold. First, we construct the W8-RGBD dataset including RGB-D images of different people with ground truth weight. Second, the novel sideview shape feature and the feature fusion model are proposed to facilitate weight estimation. Additionally, we consider gender as another important factor for human weight estimation. Third, we conduct comprehensive experiments using various regression models and feature fusion models on the new weight dataset, and encouraging results are obtained based on the proposed features and models.
  • [1]
    Velardo C, Dugelay J L. Weight estimation from visual body appearance. In Proc. the 4th IEEE International Conference on Biometrics: Theory, Applications and Systems, Sept. 2010, pp.1-6.
    [2]
    Buckley R G, Stehman C R, DosSantos F L et al. Bedside method to estimate actual body weight in the emergency department. The Journal of Emergency Medicine, 2012, 42(1): 100-104.
    [3]
    Bloomfield R, Steel E, MacLennan G, Noble D W. Accuracy of weight and height estimation in an intensive care unit: Implications for clinical practice and research. Critical Care Medicine, 2006, 34(8): 2153-2157.
    [4]
    Weise T, Bouaziz S, Li H, Pauly M. Realtime performancebased facial animation. ACM Transactions on Graphics, 2011, 30(4): Article No.77.
    [5]
    Shotton J, Fitzgibbon A, Cook M et al. Real-time human pose recognition in parts from single depth images. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.1297-1304.
    [6]
    Xia L, Chen C C, Aggarwal J K. Human detection using depth information by Kinect. In Proc. the 2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, June 2011, pp.15-22.
    [7]
    Sun C, Zhang T, Bao B, Xu C, Mei T. Discriminative exemplar coding for sign language recognition with Kinect. IEEE Transactions on Cybernetics, 2013, 43(5): 1418-1428.
    [8]
    Sun C, Zhang T, Bao B K, Xu C. Latent support vector machine for sign language recognition with Kinect. In Proc. the 20th IEEE International Conference on Image Processing, Sept. 2013, pp.4190-4194.
    [9]
    Liu S, Nguyen T, Feng J et al. Hi, magic closet, tell me what to wear! In Proc. the 20th ACM Multimedia, Oct.29-Nov.2, 2012, pp.1333-1334.
    [10]
    Velardo C, Dugelay J, Paleari M, Ariano P. Building the space scale or how to weight a person with no gravity. In Proc. International Conference on Emerging Signal Processing Applications, Jan. 2012, pp.67-70.
    [11]
    Dalal N, Triggs B. Histograms of oriented gradients for human detection. In Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2005, pp.886-893.
    [12]
    Mikolajczyk K, Schmid C, Zisserman A. Human detection based on a probabilistic assembly of robust part detectors. In Proc. the 8th European Conference on Computer Vision, May 2004, pp.69-82.
    [13]
    Basso F, Munaro M, Michieletto S et al. Fast and robust multi-people tracking from RGB-D data for a mobile robot. Advances in Intelligent Systems and Computing, 2013, 193: 265-276. 784 J. Comput. Sci. & Technol., Sept. 2014, Vol.29, No.5
    [14]
    Spinello L, Arras K. Leveraging RGB-D data: Adaptive fusion and domain adaptation for object detection. In Proc. IEEE International Conference on Robotics and Automation, May 2012, pp.4469-4474.
    [15]
    Janoch A, Karayev S, Jia Y et al. A category-level 3-D object dataset: Putting the Kinect to work. In Proc. IEEE International Conference on Computer Vision Workshops, Nov. 2011, pp.1168-1174.
    [16]
    Achanta R, Shaji A, Smith K et al. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(11): 2274-2282.
    [17]
    Gonzalez R, Woods R. Digital Image Processing. AddisonWesley Pub., 1992.
    [18]
    Jabid T, Kabir M H, Chae O. Gender classification using local directional pattern (LDP). In Proc. the 20th International Conference on Pattern Recognition, Aug. 2010, pp.2162-2165.
    [19]
    Viola P, Jones M. Robust real-time face detection. International Journal of Computer Vision, 2004, 57(2): 137-154.
    [20]
    Chang C C, Lin C J. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): Article No.27.
    [21]
    Guo G, Mu G, Fu Y, Huang T S. Human age estimation using bio-inspired features. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2009, pp.112-119.
    [22]
    Yang M, Zhu S, Lv F, Yu K. Correspondence driven adaptation for human profile recognition. In Proc. the 24th IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.505-512.
  • Related Articles

    [1]Xiao-Long Yang, Xiao-Hong Jia, Yuan Liang, Lu-Bin Fan. 6D Object Pose Estimation in Cluttered Scenes from RGB Images[J]. Journal of Computer Science and Technology, 2022, 37(3): 719-730. DOI: 10.1007/s11390-021-1311-2
    [2]Jin-Gong Jia, Yuan-Feng Zhou, Xing-Wei Hao, Feng Li, Christian Desrosiers, Cai-Ming Zhang. Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition[J]. Journal of Computer Science and Technology, 2020, 35(3): 538-550. DOI: 10.1007/s11390-020-0405-6
    [3]Yan Zheng, Jian-Ye Hao, Zong-Zhang Zhang, Zhao-Peng Meng, Xiao-Tian Hao. Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments[J]. Journal of Computer Science and Technology, 2020, 35(2): 268-280. DOI: 10.1007/s11390-020-9967-6
    [4]Bo Ren, Jia-Cheng Wu, Ya-Lei Lv, Ming-Ming Cheng, Shao-Ping Lu. Geometry-Aware ICP for Scene Reconstruction from RGB-D Camera[J]. Journal of Computer Science and Technology, 2019, 34(3): 581-593. DOI: 10.1007/s11390-019-1928-6
    [5]Nai-Ming Yao, Hui Chen, Qing-Pei Guo, Hong-An Wang. Non-Frontal Facial Expression Recognition Using a Depth-Patch Based Deep Neural Network[J]. Journal of Computer Science and Technology, 2017, 32(6): 1172-1185. DOI: 10.1007/s11390-017-1792-1
    [6]Shihong Xia, Lin Gao, Yu-Kun Lai, Ming-Ze Yuan, Jinxiang Chai. A Survey on Human Performance Capture and Animation[J]. Journal of Computer Science and Technology, 2017, 32(3): 536-554. DOI: 10.1007/s11390-017-1742-y
    [7]Xiong Lv, Shu-Qiang Jiang, Luis Herranz, Shuang Wang. RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion[J]. Journal of Computer Science and Technology, 2015, 30(2): 340-352. DOI: 10.1007/s11390-015-1527-0
    [8]Wei Guoqing, Ma Songde. 3D Motion Estimation and Motion Fusion by Affine Region Matching[J]. Journal of Computer Science and Technology, 1993, 8(1): 17-25.
    [9]Xiang Dong, Wei Daozheng, Chen Shisong. Probabilistic Models for Estimation of Random and Pseudo-Random Test Length[J]. Journal of Computer Science and Technology, 1992, 7(2): 164-174.
    [10]Zhang Fuyan, Cai Shijie, Wang Shu, Ge Ruding. The Human-Computer Dialogue Management of FCAD System[J]. Journal of Computer Science and Technology, 1988, 3(3): 221-227.

Catalog

    Article views (51) PDF downloads (1685) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return