A 55nm, 1.0V-0.4V, 1.25pJ/MAC Time-Domain Mixed-Signal Neuromorphic Accelerator with Stochastic Synapses for Reinforcement Learning in Autonomous Micro-Robots

  • Authors:
    Anvesha Amaravati (Georgia Tech), Saad Bin Nasir (Georgia Tech), Justin Ting (Georgia Tech), Insik Yoon (Georgia Tech), Arijit Raychowdhury (Georgia Tech)
    Publication ID:
    P093524
    Publication Type:
    Paper
    Received Date:
    8-May-2018
    Last Edit Date:
    11-Nov-2018
    Research:
    2777.006 (Purdue University)

Abstract

Reinforcement learning (RL) is a bio-mimetic learning approach where agents can learn about an environment by performing specific tasks, without any human supervision. RL is inspired by behavioral psychology, where agents take actions to maximize a cumulative reward. In this paper, we present an RL neuromorphic accelerator performing obstacle avoidance in a micro-robot at the edge of the cloud. We propose an energy-efficient time-domain mixed-signal (TD-MS) computational framework. In TD-MS computation, we demonstrate that the energy to compute is proportional to the importance of the computation. We leverage the unique properties of stochastic networks and recent advances in Q-learning in the proposed RL implementation. The 55nm test-chip implements RL using a three-layered fully-connected neural network and consumes a peak power of 690uW.

4819 Emperor Blvd, Suite 300 Durham, NC 27703 Voice: (919) 941-9400 Fax: (919) 941-9450

Important Information for the SRC website. This site uses cookies to store information on your computer. By continuing to use our site, you consent to our cookies. If you are not happy with the use of these cookies, please review our Cookie Policy to learn how they can be disabled. By disabling cookies, some features of the site will not work.