Show simple item record

dc.contributor.authorLeu, Brian
dc.contributor.authorAseeri, Samar
dc.contributor.authorMuite, Benson
dc.date.accessioned2021-02-24T06:52:13Z
dc.date.available2021-02-24T06:52:13Z
dc.date.issued2021-01-06
dc.identifier.citationLeu, B., Aseeri, S., & Muite, B. (2021). A Comparison of Parallel Profiling Tools for Programs utilizing the FFT. The International Conference on High Performance Computing in Asia-Pacific Region Companion. doi:10.1145/3440722.3440881
dc.identifier.isbn9781450383035
dc.identifier.doi10.1145/3440722.3440881
dc.identifier.urihttp://hdl.handle.net/10754/667636
dc.description.abstractPerformance monitoring is an important component of code optimization. Performance monitoring is also important for the beginning user, but can be difficult to configure appropriately. The overhead of the performance monitoring tools Craypat, FPMP, mpiP, Scalasca and TAU, are measured using default configurations likely to be choosen by a novice user and shown to be small when profiling Fast Fourier Transform based solvers for the Klein Gordon equation based on 2decomp&FFT and on FFTE. Performance measurements help explain that despite FFTE having a more efficient parallel algorithm, it is not always faster than 2decom&FFT because the complied single core FFT is not as fast as that in FFTW which is used in 2decomp&FFT.
dc.description.sponsorshipWe thank José Gracia, Chirstoph Niethammer, Sameer Shende, Daisuke Takahashi and Brian Wylie for helpful discussions. B.L.’s work was primarily done while affiliated with the University of Michigan. B.K.M. was partially supported by HPC Europa 3 (INFRAIA-2016-1-730897) and the Estonian Center for Excellence in IT (TK148),and his work was mostly done while affiliated with the Institute of Computer Science at the University of Tartu. The computational resources used to build and test the programs were: •Shaheen and Shaheen II operated by the KAUST Supercomputing Laboratory •Hazelhen at HLRS. •K computer that was operated by RIKEN. •Rocket and Vedur operated by the University of Tartu HPCcenter. •Mira that was operated by the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.
dc.publisherAssociation for Computing Machinery (ACM)
dc.relation.urlhttps://dl.acm.org/doi/10.1145/3440722.3440881
dc.rightsArchived with thanks to ACM. © 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM.
dc.titleA Comparison of Parallel Profiling Tools for Programs utilizing the FFT
dc.typeConference Paper
dc.contributor.departmentExtreme Computing Research Center
dc.contributor.departmentComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
dc.conference.date2021-01-22
dc.conference.name2021 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2021
dc.conference.locationVirtual, Online, KOR
dc.eprint.versionPost-print
dc.contributor.institutionApplied Dynamics International, USA
dc.contributor.institutionKichakato Kizito, Kenya
dc.identifier.pages36-45
kaust.personAseeri, Samar
dc.identifier.eid2-s2.0-85099469676
refterms.dateFOA2021-02-24T10:18:01Z
kaust.acknowledged.supportUnitKAUST Supercomputing Laboratory
kaust.acknowledged.supportUnitShaheen II
kaust.acknowledged.supportUnitShaheen


Files in this item

Thumbnail
Name:
ixpug_paper_4.pdf
Size:
554.8Kb
Format:
PDF
Description:
Accepted Manuscript

This item appears in the following Collection(s)

Show simple item record