Thư viện trường đại học Phenikaa: Search

HOME

HOME BROWSE HELP CONTACT

Search

Author

Subject

NVIDIA Denver2 (1)

Date issued

2023 (1)

Has File(s)

true (1)

Search Results

Results 1-1 of 1 (Search time: 0.001 seconds).

Performance–energy trade-offs of deep learning convolution algorithms on ARM processors

Authors: Manuel F., Dolz; Sergio, Barrachina; Héctor, Martínez; Advisor: -; Co-Author: - (2023)

In this work, we assess the performance and energy efficiency of high-performance codes for the convolution operator, based on the direct, explicit/implicit lowering and Winograd algorithms used for deep learning (DL) inference on a series of ARM-based processor architectures. Specifically, we evaluate the NVIDIA Denver2 and Carmel processors, as well as the ARM Cortex-A57 and Cortex-A78AE CPUs as part of a recent set of NVIDIA Jetson platforms. The performance–energy evaluation is carried out using the ResNet-50 v1.5 convolutional neural network (CNN) on varying configurations of convolution algorithms, number of threads/cores, and operating frequencies on the tested processor cores. The results demonstrate that the best throughput is obtained on all platforms with the Winograd con...