DACAPO: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

  • Yoonsung Kim
  • , Changhun Oh
  • , Jinwoo Hwang
  • , Wonung Kim
  • , Seongryong Oh
  • , Yubin Lee
  • , Hardik Sharma
  • , Amir Yazdanbakhsh
  • , Jongse Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

Deep neural network (DNN) video analytics is crucial for autonomous systems such as self-driving vehicles, unmanned aerial vehicles (UAVs), and security robots. However, real-world deployment faces challenges due to their limited computational resources and battery power. To tackle these challenges, continuous learning exploits a lightweight 'student' model at deployment (inference), leverages a larger 'teacher' model for labeling sampled data (labeling), and continuously retrains the student model to adapt to changing scenarios (retraining). This paper highlights the limitations in state-of-theart continuous learning systems: (1) they focus on computations for retraining, while overlooking the compute needs for inference and labeling, (2) they rely on power-hungry GPUs, unsuitable for battery-operated autonomous systems, and (3) they are located on a remote centralized server, intended for multi-tenant scenarios, again unsuitable for autonomous systems due to privacy, network availability, and latency concerns. We propose a hardwarealgorithm co-designed solution for continuous learning, DACAPO, that enables autonomous systems to perform concurrent executions of inference, labeling, and retraining in a performant and energy-efficient manner. DACapo comprises (1) a spatiallypartitionable and precision-flexible accelerator enabling parallel execution of kernels on sub-accelerators at their respective precisions, and (2) a spatiotemporal resource allocation algorithm that strategically navigates the resource-accuracy tradeoff space, facilitating optimal decisions for resource allocation to achieve maximal accuracy. Our evaluation shows that DACAPO achieves 6. 5% and 5. 5% higher accuracy than a state-of-theart GPU-based continuous learning systems, Ekya and EOMU, respectively, while consuming 254 × less power.

Original languageEnglish
Title of host publicationProceeding - 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture, ISCA 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1246-1261
Number of pages16
ISBN (Electronic)9798350326581
DOIs
StatePublished - 2024
Event51st ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2024 - Buenos Aires, Argentina
Duration: 29 Jun 20243 Jul 2024

Publication series

NameProceedings - International Symposium on Computer Architecture
ISSN (Print)1063-6897
ISSN (Electronic)2575-713X

Conference

Conference51st ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2024
Country/TerritoryArgentina
CityBuenos Aires
Period29/06/243/07/24

Fingerprint

Dive into the research topics of 'DACAPO: Accelerating Continuous Learning in Autonomous Systems for Video Analytics'. Together they form a unique fingerprint.

Cite this