Scaling Deep Learning workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Deep Learning (DL) algorithms have become ubiquitous in data analytics. As a result, major computing vendors—including NVIDIA, Intel, AMD, and IBM—have architectural road maps influenced by DL workloads. Furthermore, several vendors have recently advertised new computing products as accelerating large DL workloads. Unfortunately, it is difficult for data scientists to quantify the potential of these different products. Here, this article provides a performance and power analysis of important DL workloads on two major parallel architectures: NVIDIA DGX-1 (eight Pascal P100 GPUs interconnected with NVLink) and Intel Knights Landing (KNL) CPUs interconnected with Intel Omni-Path or Cray Aries. Our evaluation consists of a cross section of convolutional neural net workloads: CifarNet, AlexNet, GoogLeNet, and ResNet50 topologies using the Cifar10 and ImageNet datasets. The workloads are vendor-optimized for each architecture. We use sequentially equivalent implementations to maintain iso-accuracy between parallel and sequential DL models. Our analysis indicates that although GPUs provide the highest overall performance, the gap can close for some convolutional networks; and the KNL can be competitive in performance/watt. We find that NVLink facilitates scaling efficiency on GPUs. However, its importance is heavily dependent on neural network architecture. Furthermore, for weak-scaling—sometimes encouraged by restricted GPU memory—NVLink is less important.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1617450
- Alternate ID(s):
- OSTI ID: 1778383
- Report Number(s):
- PNNL-SA-134513
- Journal Information:
- Future Generations Computer Systems, Vol. 108, Issue C; ISSN 0167-739X
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Web of Science
Applications of Artificial Intelligence Methodologies to Behavioral and Social Sciences
|
journal | December 2019 |
A Framework for Memory Oversubscription Management in Graphics Processing Units
|
conference | April 2019 |
Similar Records
Scaling Deep Learning Workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing
Evaluating On-Node GPU Interconnects for Deep Learning Workloads