Journal of Computer Science and Technology

   

Hardware Acceleration for SLAM in Mobile Systems

Zhe Fan1,2,3, Yi-Fan Hao1,3, Tian Zhi1,3, Member, CCF, Qi Guo1, Member, CCF, ACM, IEEE, and Zi-Dong Du1,3,4,*, Member, CCF, ACM, IEEE   

  1. 1State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    2School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China
    3Cambricon Technologies, Beijing 100191, China
    4Beijing Academy of Artificial Intelligence, Beijing 100084, China

The emerging mobile robot industry has spurred a flurry of interest in solving the simultaneous localization and mapping (SLAM) problem. However, existing SLAM platforms have difficulty in meeting the real-time and low-power requirements imposed by mobile systems. Though specialized hardware is promising with regard to achieving high performance and lowering the power, designing an efficient accelerator for SLAM is severely hindered by a wide variety of SLAM algorithms. Based on our detailed analysis of representative SLAM algorithms, we observe that SLAM algorithms advance two challenges for designing efficient hardware accelerators: the large number of computational primitives and irregular control flows. To address these two challenges, we propose a hardware accelerator that features composable computation units classified as the matrix, vector, scalar, and control units. Also, we design a hierarchical instruction set for coping with a broad range of SLAM algorithms with irregular control flows. Experimental results show that, compared against an Intel x86 processor, on average, our accelerator with the area of 7.41mm2 achieves 10.52x and 112.62x better performance and energy saving, respectively, across different datasets. Compared against a more energy efficient ARM Cortex processor, our accelerator still achieves 33.03x and 62.64x better performance and energy saving, respectively.


中文摘要

同步定位与建图(SLAM)是机器智能化道路上的一个重要问题,在移动智能系统中更是扮演着重要角色。各种SLAM算法被提出解决该问题,但现存硬件平台在运行SLAM算法时难以满足移动系统对实时性和低功耗的要求。虽然设计专门的硬件来解决性能和功耗问题是十分有前景的,但是SLAM算法的多样性对这样的硬件设计提出了巨大的挑战。目前相关硬件工作都仅仅针对特定的SLAM算法或SLAM算法中特定阶段进行加速,却无法涵盖多类SLAM算法。本文旨在设计出一种既能够满足实时性、低功耗要求,又能支持各种SLAM算法的通用SLAM硬件加速器。为了实现这一目的,我们分析了几类有代表性的SLAM算法,发现SLAM算法中存在大量不同的运算模式且具有不规整的控制流。于是我们设计了一个包含矩阵、向量、标量三种不同粒度运算单元的加速器以覆盖SLAM算法多样的运算模式;同时设计了一种层次化的指令集,简化SLAM算法的控制流并支持更多SLAM算法。我们以几种有代表性的SLAM算法作为测试用例,以Intel i7-3770和ARM Cortex A57作为比较对象进行实验。结果表明,相对于Intel处理器性能,本文加速器性能提升10.52倍,能耗减少112.62倍;相对于ARM处理器,本文加速器性能提升33.03倍,能耗减少62.64倍。


Key words: hardware accelerator, instruction set, mobile system, simultaneous localization and mapping (SLAM) algorithm

;

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] Zhou Di;. A Recovery Technique for Distributed Communicating Process Systems[J]. , 1986, 1(2): 34 -43 .
[2] Feng Yulin;. Recursive Implementation of VLSI Circuits[J]. , 1986, 1(2): 72 -82 .
[3] Jin Lan; Yang Yuanyuan;. A Modified Version of Chordal Ring[J]. , 1986, 1(3): 15 -32 .
[4] Wang Jianchao; Wei Daozheng;. An Effective Test Generation Algorithm for Combinational Circuits[J]. , 1986, 1(4): 1 -16 .
[5] Min Yinghua; Han Zhide;. A Built-in Test Pattern Generator[J]. , 1986, 1(4): 62 -74 .
[6] Wu Yunzeng;. On the Development of Applications of Logic in Programming[J]. , 1987, 2(1): 30 -34 .
[7] Zhong Renbao; Xing Lin; Ren Zhaoyang;. An Interactive System SDI on Microcomputer[J]. , 1987, 2(1): 64 -71 .
[8] Xia Peisu; Fang Xinwo; Wang Yuxiang; Yan Kaiming; Zhang Tingjun; Liu Yulan; Zhao Chunying; Sun Jizhong;. Design of Array Processor Systems[J]. , 1987, 2(3): 163 -173 .
[9] S. T. Chanson; L. Liang; A. Kumar. Throughput Models of CSMA Network with Stations Uniformly Distributed along the Bus[J]. , 1987, 2(4): 243 -264 .
[10] Qi Yulu;. A Systolic Approach for an Improvement of a Finite Field Multiplier[J]. , 1987, 2(4): 303 -309 .

ISSN 1000-9000(Print)

         1860-4749(Online)
CN 11-2296/TP

Home
Editorial Board
Author Guidelines
Subscription
Journal of Computer Science and Technology
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
Tel.:86-10-62610746
E-mail: jcst@ict.ac.cn
 
  Copyright ©2015 JCST, All Rights Reserved