Heterogeneous gradient computing optimization for scalable deep neural networks

Moreno Álvarez, Sergio; Paoletti, Mercedes Eugenia; Rico Gallego, Juan Antonio; Haut, Juan M.

Publicación:
Heterogeneous gradient computing optimization for scalable deep neural networks

dc.contributor.author	Moreno Álvarez, Sergio
dc.contributor.author	Paoletti, Mercedes Eugenia
dc.contributor.author	Rico Gallego, Juan Antonio
dc.contributor.author	Haut, Juan M.
dc.contributor.orcid	https://orcid.org/0000-0003-1030-3729
dc.contributor.orcid	https://orcid.org/0000-0002-4264-7473
dc.contributor.orcid	https://orcid.org/0000-0001-6701-961X
dc.date.accessioned	2024-11-18T11:34:54Z
dc.date.available	2024-11-18T11:34:54Z
dc.date.issued	2022
dc.description	The registered version of this article, first published in “The Journal of Supercomputing, 78, 2022", is available online at the publisher's website: Springer, https://doi.org/10.1007/s11227-022-04399-2 La versión registrada de este artículo, publicado por primera vez en “The Journal of Supercomputing, 78, 2022", está disponible en línea en el sitio web del editor: Springer, https://doi.org/10.1007/s11227-022-04399-2
dc.description.abstract	Nowadays, data processing applications based on neural networks cope with the growth in the amount of data to be processed and with the increase in both the depth and complexity of the neural networks architectures, and hence in the number of parameters to be learned. High-performance computing platforms are provided with fast computing resources, including multi-core processors and graphical processing units, to manage such computational burden of deep neural network applications. A common optimization technique is to distribute the workload between the processes deployed on the resources of the platform. This approach is known as data-parallelism. Each process, known as replica, trains its own copy of the model on a disjoint data partition. Nevertheless, the heterogeneity of the computational resources composing the platform requires to unevenly distribute the workload between the replicas according to its computational capabilities, to optimize the overall execution performance. Since the amount of data to be processed is different in each replica, the influence of the gradients computed by the replicas in the global parameter updating should be different. This work proposes a modification of the gradient computation method that considers the different speeds of the replicas, and hence, its amount of data assigned. The experimental results have been conducted on heterogeneous high-performance computing platforms for a wide range of models and datasets, showing an improvement in the final accuracy with respect to current techniques, with a comparable performance.	en
dc.description.version	versión final
dc.identifier.citation	Sergio Moreno-Álvarez, Mercedes E Paoletti, Juan A Rico-Gallego, Juan M Haut. "Heterogeneous gradient computing optimization for scalable deep neural networks". The Journal of Supercomputing, 78, 11, 19 March 2022, 13455-13469.
dc.identifier.doi	https://doi.org/10.1007/s11227-022-04399-2
dc.identifier.issn	0920-8542
dc.identifier.uri	https://hdl.handle.net/20.500.14468/24403
dc.journal.title	The Journal of Supercomputing
dc.journal.volume	78
dc.language.iso	en
dc.page.final	13469
dc.page.initial	13455
dc.publisher	Springer
dc.relation.center	E.T.S. de Ingeniería Informática
dc.relation.department	Lenguajes y Sistemas Informáticos
dc.rights	info:eu-repo/semantics/openAccess
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/deed.es
dc.subject	12 Matemáticas::1203 Ciencia de los ordenadores ::1203.17 Informática
dc.subject.keywords	deep learning	en
dc.subject.keywords	deep neural networks	en
dc.subject.keywords	high-performance computing	en
dc.subject.keywords	heterogeneous platforms	en
dc.subject.keywords	distributed training	en
dc.title	Heterogeneous gradient computing optimization for scalable deep neural networks	en
dc.type	journal article	en
dspace.entity.type	Publication
relation.isAuthorOfPublication	3482d7bc-e120-48a3-812e-cc4b25a6d2fe
relation.isAuthorOfPublication.latestForDiscovery	3482d7bc-e120-48a3-812e-cc4b25a6d2fe

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: MorenoAlvarez_Sergio_2022HeterogeneousGradien_SERGIO MORENO ALVARE.pdf
Tamaño:: 1.86 MB
Formato:: Adobe Portable Document Format

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 3.62 KB
Formato:: Item-specific license agreed to upon submission
Descripción:

Descargar

Colecciones

Artículos y papers

Publicación: Heterogeneous gradient computing optimization for scalable deep neural networks

Archivos

Bloque original

Bloque de licencias

Colecciones

Publicación:
Heterogeneous gradient computing optimization for scalable deep neural networks