Mittal, Sparsh and Vaishay, Shraiysh
(2019)
A Survey of Techniques for Optimizing Deep Learning on GPUs.
Journal of Systems Architecture.
p. 101635.
ISSN 13837621
Full text not available from this repository.
(
Request a copy)
Abstract
The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. Due to its unique features, the GPU continues to remain the most widely used accelerator for DL applications. In this paper, we present a survey of architecture and system-level techniques for optimizing DL applications on GPUs. We review techniques for both inference and training and for both single GPU and distributed system with multiple GPUs. We bring out the similarities and differences of different works and highlight their key attributes. This survey will be useful for both novice and experts in the field of machine learning, processor architecture and high-performance computing.
Actions (login required)
|
View Item |