Vis enkel innførsel

dc.contributor.authorHolm, Håvard Heitlo
dc.contributor.authorBrodtkorb, André R.
dc.contributor.authorSætra, Martin Lilleeng
dc.date.accessioned2020-04-15T07:26:26Z
dc.date.available2020-04-15T07:26:26Z
dc.date.created2020-01-07T10:49:25Z
dc.date.issued2020
dc.identifier.issn2079-3197
dc.identifier.urihttps://hdl.handle.net/11250/2651068
dc.description.abstractIn this work, we examine the performance, energy efficiency, and usability when using Python for developing high-performance computing codes running on the graphics processing unit (GPU). We investigate the portability of performance and energy efficiency between Compute Unified Device Architecture (CUDA) and Open Compute Language (OpenCL); between GPU generations; and between low-end, mid-range, and high-end GPUs. Our findings showed that the impact of using Python is negligible for our applications, and furthermore, CUDA and OpenCL applications tuned to an equivalent level can in many cases obtain the same computational performance. Our experiments showed that performance in general varies more between different GPUs than between using CUDA and OpenCL. We also show that tuning for performance is a good way of tuning for energy efficiency, but that specific tuning is needed to obtain optimal energy efficiency.en_US
dc.language.isoengen_US
dc.publisherMDPIen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.subjectGPU computingen_US
dc.subjectCUDAen_US
dc.subjectOpenCLen_US
dc.subjectHigh performance computingen_US
dc.subjectShallow-water simulationen_US
dc.subjectPower efficiencyen_US
dc.titleGPU Computing with Python: Performance, Energy Efficiency and Usabilityen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.rights.holderThis is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly citeden_US
dc.source.volume8en_US
dc.source.journalComputationen_US
dc.source.issue1en_US
dc.identifier.doi10.3390/computation8010004
dc.identifier.cristin1767477
dc.relation.projectNorges forskningsråd: 250935en_US
dc.relation.projectNotur/NorStore: NN9550Ken_US
cristin.unitcode7401,90,26,0
cristin.unitnameMathematics and Cybernetics
cristin.ispublishedfalse
cristin.fulltextoriginal
cristin.fulltextpostprint
cristin.fulltextpreprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal