dc.contributor.author | López, Marta | |
dc.contributor.author | Stafford Fernández, Esteban | |
dc.contributor.author | Bosque Orero, José Luis | |
dc.contributor.other | Universidad de Cantabria | es_ES |
dc.date.accessioned | 2025-02-20T17:21:20Z | |
dc.date.available | 2025-02-20T17:21:20Z | |
dc.date.issued | 2025-01 | |
dc.identifier.issn | 0920-8542 | |
dc.identifier.issn | 1573-0484 | |
dc.identifier.other | PID2022-136454NB-C21 | es_ES |
dc.identifier.other | TED2021-131176B-I00 | es_ES |
dc.identifier.uri | https://hdl.handle.net/10902/35705 | |
dc.description.abstract | In recent years, energy consumption has become a limiting factor in the evolution of high-performance computing (HPC) clusters in terms of environmental concern and maintenance cost. The computing power of these clusters is increasing, together with the demands of the workloads they execute. A key component in HPC systems is the workload manager, whose operation has a substantial impact on the performance and energy consumption of the clusters. Recent research has employed machine learning techniques to optimise the operation of this component. However, these attempts have focused on homogeneous clusters where all the cores are pooled together and considered equal, disregarding the fact that they are contained in nodes and that they can have different performances. This work presents an intelligent job scheduler based on deep reinforcement learning that focuses on reducing energy consumption of heterogeneous HPC clusters. To this aim it leverages information provided by the users as well as the power consumption specifications of the compute resources of the cluster. The scheduler is evaluated against a set of heuristic algorithms showing that it has potential to give similar results, even in the face of the extra complexity of the heterogeneous cluster. | es_ES |
dc.description.sponsorship | This work has been supported by the Spanish Science and Technology Commission under contract
PID2022-136454NB-C21, the Ministerio de Ciencia e Innovación; Proyectos de Transición Ecológica y Digital 2021 under grant TED2021-131176B-I00 and the European HiPEAC Network of Excellence. | es_ES |
dc.format.extent | 23 p. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Kluwer Academic Publishers | es_ES |
dc.rights | This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.source | Journal of Supercomputing, 2025, 81(2), 427 | es_ES |
dc.subject.other | Task scheduling | es_ES |
dc.subject.other | Deep reinforcement learning | es_ES |
dc.subject.other | High-performance computing | es_ES |
dc.subject.other | Heterogeneous clusters | es_ES |
dc.subject.other | Energy consumption | es_ES |
dc.title | Intelligent energy pairing scheduler (InEPS) for heterogeneous HPC clusters | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.relation.publisherVersion | https://doi.org/10.1007/s11227-024-06907-y | es_ES |
dc.rights.accessRights | openAccess | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-136454NB-C21/ES/ARQUITECTURA Y PROGRAMACION DE COMPUTADORES ESCALABLES DE ALTO RENDIMIENTO Y BAJO CONSUMO III-UC (TEAM-MATES UC)/ | es_ES |
dc.identifier.DOI | 10.1007/s11227-024-06907-y | |
dc.type.version | publishedVersion | es_ES |