dc.contributor.author | Nozal, Raúl | |
dc.contributor.author | Pérez Pavón, Borja | |
dc.contributor.author | Bosque Orero, José Luis | |
dc.contributor.author | Beivide Palacio, Ramón | |
dc.contributor.other | Universidad de Cantabria | es_ES |
dc.date.accessioned | 2025-01-20T14:30:13Z | |
dc.date.available | 2025-01-20T14:30:13Z | |
dc.date.issued | 2019-03 | |
dc.identifier.issn | 0920-8542 | |
dc.identifier.issn | 1573-0484 | |
dc.identifier.other | TIN2016-76635-C2-2-R | es_ES |
dc.identifier.other | TIN2016-81840-REDT | es_ES |
dc.identifier.uri | https://hdl.handle.net/10902/35076 | |
dc.description.abstract | Heterogeneous systems composed by a CPU and a set of different hardware accelerators are very compelling thanks to their excellent performance and energy consumption features. One of the most important problems of those systems is the workload distribution among their devices. This paper describes an extension of the Maat library to allow the co-execution of a data-parallel OpenCL kernel on a heterogeneous system composed by a CPU and an Intel Xeon Phi. Maat provides an abstract view of the heterogeneous system as well as set of load balancing algorithms to squeeze the performance out of the node. It automatically performs the data partition and distribution among the devices, generates the kernels and efficiently merges the partial outputs together. Experimental results show that this approach always outperforms the baseline with only a Xeon Phi, giving excellent performance and energy efficiency. Furthermore, it is essential to select the right load balancing algorithm because it has a huge impact in the system performance and energy consumption. | es_ES |
dc.description.sponsorship | This work has been supported by the Spanish Ministry of Education, FPU grant FPU16/03299, the University of Cantabria, grant CVE-2014-18166, the Spanish Science and Technology Commission under contracts TIN2016-76635-C2-2-R and TIN2016-81840-REDT (CAPAP-H6 network), the European Research Council (G.A. No. 321253) and the European HiPEAC Network of Excellence. The Mont-Blanc project has received funding from the European Unions Horizon 2020 research and innovation programme under Grant Agreement No. 671697. | es_ES |
dc.format.extent | 14 p. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Kluwer Academic Publishers | es_ES |
dc.rights | © Springer Science+Business Media, LLC, part of Springer Nature 2018. This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature's AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11227-018-2318-5 | es_ES |
dc.source | Journal of Supercomputing. 2019, 75(3),1123-1136 | es_ES |
dc.subject.other | Heterogeneous computing | es_ES |
dc.subject.other | Co-execution CPU-Xeon Phi | es_ES |
dc.subject.other | Load balancing | es_ES |
dc.subject.other | OpenCL | es_ES |
dc.subject.other | Performance portability | es_ES |
dc.subject.other | Energy efficiency | es_ES |
dc.title | Load balancing in a heterogeneous world: CPU-Xeon Phi co-execution of data-parallel kernels | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.relation.publisherVersion | https://doi.org/10.1007/s11227-018-2318-5 | es_ES |
dc.rights.accessRights | openAccess | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/671697/EU/MONT-BLANC 3, European scalable and power efficient fpc platform based on low-power embedded technology/MONT-BLANC 3/ | es_ES |
dc.identifier.DOI | 10.1007/s11227-018-2318-5 | |
dc.type.version | acceptedVersion | es_ES |