Item Infomation

Full metadata record
DC FieldValueLanguage
dc.contributor.authorGuillermo, Alaejos-
dc.contributor.authorAdrián, Castelló-
dc.contributor.authorHéctor, Martínez-
dc.date.accessioned2023-03-30T04:06:20Z-
dc.date.available2023-03-30T04:06:20Z-
dc.date.issued2023-
dc.identifier.urihttps://link.springer.com/article/10.1007/s11227-022-05003-3-
dc.identifier.urihttps://dlib.phenikaa-uni.edu.vn/handle/PNK/7328-
dc.descriptionCC BYvi
dc.description.abstractOur work exposes the structure of the template-based micro-kernels for ARM Neon (128-bit SIMD), ARM SVE (variable-length SIMD) and Intel AVX512 (512-bit SIMD), showing considerable performance for an NVIDIA Carmel processor (ARM Neon), a Fujitsu A64FX processor (ARM SVE) and on an AMD EPYC 7282 processor (256-bit SIMD).vi
dc.language.isoenvi
dc.publisherSpringervi
dc.subjecttemplate-based micro-kernelsvi
dc.subjectAMD EPYCvi
dc.titleMicro-kernels for portable and efficient matrix multiplication in deep learningvi
dc.typeBookvi
Appears in CollectionsOER - Công nghệ thông tin

Files in This Item: