hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affi...
Saved in:
Published in: | 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing pp. 180 - 186 |
---|---|
Main Authors: | , , , , , , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
01-02-2010
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities. We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner. hwloc may significantly help performance by having runtime systems place their tasks or adapt their communication strategies depending on hardware affinities. We show that hwloc can already be used by popular high-performance OpenMP or MPI software. Indeed, scheduling OpenMP threads according to their affinities or placing MPI processes according to their communication patterns shows interesting performance improvement thanks to hwloc. An optimized MPI communication strategy may also be dynamically chosen according to the location of the communicating processes in the machine and its hardware characteristics. |
---|---|
AbstractList | The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities. We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner. hwloc may significantly help performance by having runtime systems place their tasks or adapt their communication strategies depending on hardware affinities. We show that hwloc can already be used by popular high-performance OpenMP or MPI software. Indeed, scheduling OpenMP threads according to their affinities or placing MPI processes according to their communication patterns shows interesting performance improvement thanks to hwloc. An optimized MPI communication strategy may also be dynamically chosen according to the location of the communicating processes in the machine and its hardware characteristics. |
Author | Clet-Ortega, Jerome Goglin, Brice Thibault, Samuel Mercier, Guillaume Broquedis, Franois Moreaud, Stephanie Furmento, Nathalie Namyst, Raymond |
Author_xml | – sequence: 1 givenname: Franois surname: Broquedis fullname: Broquedis, Franois email: broquedi@labri.fr organization: LaBRI, Univ. of Bordeaux, Talence, France – sequence: 2 givenname: Jerome surname: Clet-Ortega fullname: Clet-Ortega, Jerome email: jcletort@labri.fr organization: LaBRI, Univ. of Bordeaux, Talence, France – sequence: 3 givenname: Stephanie surname: Moreaud fullname: Moreaud, Stephanie email: smoreaud@labri.fr organization: LaBRI, INRIA, Talence, France – sequence: 4 givenname: Nathalie surname: Furmento fullname: Furmento, Nathalie email: furmento@labri.fr organization: LaBRI, CNRS, Talence, France – sequence: 5 givenname: Brice surname: Goglin fullname: Goglin, Brice email: goglin@labri.fr organization: LaBRI, INRIA, Talence, France – sequence: 6 givenname: Guillaume surname: Mercier fullname: Mercier, Guillaume email: mercier@labri.fr organization: LaBRI, ENSEIRB, Talence, France – sequence: 7 givenname: Samuel surname: Thibault fullname: Thibault, Samuel email: thibault@labri.fr organization: LaBRI, Univ. of Bordeaux, Talence, France – sequence: 8 givenname: Raymond surname: Namyst fullname: Namyst, Raymond email: namyst@labri.fr organization: LaBRI, Univ. of Bordeaux, Talence, France |
BookMark | eNpVj01PAjEURatiIiArl276Bwbf68c86m6CAiYYWOialE4LVeiQDgnx34vRjYubm5Ob3OT0WCc1yTN2hzBEBPOwfFoOBZyppAs2MDRCJZTSJUl1ybpCEhWaNFz92wR1WBehLIsSjbhhvbb9AABSwnTZYnvaNe6RV3zqk8_R8Um2e39q8icPTeavNtlNTBs-s7k-2ex5FUJM8Rh9y2Pis-WYV4fDLjp7jE1qb9l1sLvWD_66z94nz2_jWTFfTF_G1bzYSlTHwmhB59QYZHABaU0_CMFIeZZAjcq5kSgBDFlZGxKwrj2CxKDRjdZG9tn972_03q8OOe5t_lpppX-c5Tf_YFG3 |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/PDP.2010.67 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9781424456734 1424456738 |
EISSN | 2377-5750 |
EndPage | 186 |
ExternalDocumentID | 5452445 |
Genre | orig-research |
GroupedDBID | 29N 29O 6IE 6IF 6IH 6IK 6IL 6IN AAJGR ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI JC5 M43 OCL RIE RIL RNS |
ID | FETCH-LOGICAL-h314t-9527952d1f3fcf17b779520f9333771514cc8260097a3d9720bde1031f51c8b93 |
IEDL.DBID | RIE |
ISBN | 9781424456727 142445672X |
ISSN | 1066-6192 |
IngestDate | Wed Jun 26 19:22:25 EDT 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-h314t-9527952d1f3fcf17b779520f9333771514cc8260097a3d9720bde1031f51c8b93 |
OpenAccessLink | https://hal.inria.fr/file/index/docid/429889/filename/main.pdf |
PageCount | 7 |
ParticipantIDs | ieee_primary_5452445 |
PublicationCentury | 2000 |
PublicationDate | 2010-02 |
PublicationDateYYYYMMDD | 2010-02-01 |
PublicationDate_xml | – month: 02 year: 2010 text: 2010-02 |
PublicationDecade | 2010 |
PublicationTitle | 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing |
PublicationTitleAbbrev | PDP |
PublicationYear | 2010 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0007429 ssj0000452741 |
Score | 1.9515082 |
Snippet | The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 180 |
SubjectTerms | Application software Bandwidth Computer architecture Concurrent computing Hardware Topology Affinities Placement MPI OpenMP Memory management Multicore processing Software libraries |
Title | hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications |
URI | https://ieeexplore.ieee.org/document/5452445 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED7RTkwFWsRbHhgxTWrHD7aqD3WCSIDUrYpfapcUlVb9-9hOUkBiYYgUZ4oucu47333fB3AvNM8YLST2UEFgj_81Fo4o7Hx2JzxRjEWXiNkrf56L8STI5DwcuDDW2jh8Zh_Dbezlm7XehaOyfvDDpjRrQYtLUXG1DucpQRo8usjUf2Ff8snY6WQMhyKhIXVljA_mjdZTveY1cS9NZD8f59XAF_ttuBLzzbTzvzc9gd43cQ_lh5R0Cke2PINO49yA6o3chZfl3iexJzREUXZ6pdG0GdJCHsWixr0Ihcb-vthYNHRuVUb1VbQq0SwfoeGP1ncP3qeTt9EM19YKeElSusXSB8tfJnXEaZdyxcMycZIQwrlHAVRrEbTrJS-IkXyQKGODI4TLUi2UJOfQLtelvQBEhXQexohE6IK6xPqS2jE1MIoakxBuL6Eb4rP4qNQzFnVorv5-fA3HVX8-DIzcQHu72dlbaH2a3V383l89NaFt |
link.rule.ids | 310,311,782,786,791,792,798,27936,54770 |
linkProvider | IEEE |
linkToHtml | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV27TsMwFLVoGWAq0CLeeGDENKkdP9iqPhREKZEoUrcq8UPtkqLSqr-P7SQFJBaGSHGm6EbOPdf3nnMAuOOSRZSkAlmowJHF_xJxgzNkbHbHLMgo9S4R8RsbT3l_4GRy7ndcGK21Hz7TD-7W9_LVUm7cUVnb-WETEtXAfkQYCwq21u5ExYmDex-Z8j9siz7he52UIlcmVLSuiLLOtFJ7KtespO6FgWgn_aQY-aK_LVd8xhk2_veuR6D1Td2DyS4pHYM9nZ-ARuXdAMut3ASv861NY4-wC73w9ELCYTWmBS2OhZV_EXSt_W260rBrzCL3-qtwkcM46cHuj-Z3C7wPB5NejEpzBTTHIVkjYYNlLxUabKQJWcbcMjACY8yYxQFESu7U6wVLsRKsE2RKO08IE4WSZwKfgnq-zPUZgIQLY4EMD7hMiQm0LaoNzToqI0oFmOlz0HTxmX0U-hmzMjQXfz--BQfx5GU0Gz2Nny_BYdGtd-MjV6C-Xm30Nah9qs2N__Zf6qakuA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+18th+Euromicro+Conference+on+Parallel%2C+Distributed+and+Network-based+Processing&rft.atitle=hwloc%3A+A+Generic+Framework+for+Managing+Hardware+Affinities+in+HPC+Applications&rft.au=Broquedis%2C+Franois&rft.au=Clet-Ortega%2C+Jerome&rft.au=Moreaud%2C+Stephanie&rft.au=Furmento%2C+Nathalie&rft.date=2010-02-01&rft.pub=IEEE&rft.isbn=9781424456727&rft.issn=1066-6192&rft.eissn=2377-5750&rft.spage=180&rft.epage=186&rft_id=info:doi/10.1109%2FPDP.2010.67&rft.externalDocID=5452445 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1066-6192&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1066-6192&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1066-6192&client=summon |