MODI: Mobile Deep Inference Made Efficient by Edge Computing

Samuel S. Ogden, Tian Guo

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose a novel mobile deep inference platform, MODI , that delivers good inference performance. MODI improves deep learning powered mobile applications performance with optimizations in three complementary aspects. First, MODI provides a number of models and dynamically selects the best one during runtime. Second, MODI extends the set of models each mobile application can use by storing high quality models at the edge servers. Third, MODI manages a centralized model repository and periodically updates models at edge locations, ensuring up-to-date models for mobile applications without incurring high network latency. Our evaluation demonstrates the feasibility of trading off inference accuracy for improved inference speed, as well as the acceptable performance of edge-based inference.
Original languageAmerican English
JournalUSENIX Annual Technical Conference HotEdge Workshop 2018 (HotEdge’18)
StatePublished - 2018
Externally publishedYes

Disciplines

  • Computer Sciences

Cite this