hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications

The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affi...

Full description

Saved in:
Bibliographic Details
Published in:2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing pp. 180 - 186
Main Authors: Broquedis, Franois, Clet-Ortega, Jerome, Moreaud, Stephanie, Furmento, Nathalie, Goglin, Brice, Mercier, Guillaume, Thibault, Samuel, Namyst, Raymond
Format: Conference Proceeding
Language:English
Published: IEEE 01-02-2010
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities. We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner. hwloc may significantly help performance by having runtime systems place their tasks or adapt their communication strategies depending on hardware affinities. We show that hwloc can already be used by popular high-performance OpenMP or MPI software. Indeed, scheduling OpenMP threads according to their affinities or placing MPI processes according to their communication patterns shows interesting performance improvement thanks to hwloc. An optimized MPI communication strategy may also be dynamically chosen according to the location of the communicating processes in the machine and its hardware characteristics.
AbstractList The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications now have to carefully adapt their placement and behavior according to the underlying hierarchy of hardware resources and their software affinities. We introduce the Hardware Locality (hwloc) software which gathers hardware information about processors, caches, memory nodes and more, and exposes it to applications and runtime systems in a abstracted and portable hierarchical manner. hwloc may significantly help performance by having runtime systems place their tasks or adapt their communication strategies depending on hardware affinities. We show that hwloc can already be used by popular high-performance OpenMP or MPI software. Indeed, scheduling OpenMP threads according to their affinities or placing MPI processes according to their communication patterns shows interesting performance improvement thanks to hwloc. An optimized MPI communication strategy may also be dynamically chosen according to the location of the communicating processes in the machine and its hardware characteristics.
Author Clet-Ortega, Jerome
Goglin, Brice
Thibault, Samuel
Mercier, Guillaume
Broquedis, Franois
Moreaud, Stephanie
Furmento, Nathalie
Namyst, Raymond
Author_xml – sequence: 1
  givenname: Franois
  surname: Broquedis
  fullname: Broquedis, Franois
  email: broquedi@labri.fr
  organization: LaBRI, Univ. of Bordeaux, Talence, France
– sequence: 2
  givenname: Jerome
  surname: Clet-Ortega
  fullname: Clet-Ortega, Jerome
  email: jcletort@labri.fr
  organization: LaBRI, Univ. of Bordeaux, Talence, France
– sequence: 3
  givenname: Stephanie
  surname: Moreaud
  fullname: Moreaud, Stephanie
  email: smoreaud@labri.fr
  organization: LaBRI, INRIA, Talence, France
– sequence: 4
  givenname: Nathalie
  surname: Furmento
  fullname: Furmento, Nathalie
  email: furmento@labri.fr
  organization: LaBRI, CNRS, Talence, France
– sequence: 5
  givenname: Brice
  surname: Goglin
  fullname: Goglin, Brice
  email: goglin@labri.fr
  organization: LaBRI, INRIA, Talence, France
– sequence: 6
  givenname: Guillaume
  surname: Mercier
  fullname: Mercier, Guillaume
  email: mercier@labri.fr
  organization: LaBRI, ENSEIRB, Talence, France
– sequence: 7
  givenname: Samuel
  surname: Thibault
  fullname: Thibault, Samuel
  email: thibault@labri.fr
  organization: LaBRI, Univ. of Bordeaux, Talence, France
– sequence: 8
  givenname: Raymond
  surname: Namyst
  fullname: Namyst, Raymond
  email: namyst@labri.fr
  organization: LaBRI, Univ. of Bordeaux, Talence, France
BookMark eNpVj01PAjEURatiIiArl276Bwbf68c86m6CAiYYWOialE4LVeiQDgnx34vRjYubm5Ob3OT0WCc1yTN2hzBEBPOwfFoOBZyppAs2MDRCJZTSJUl1ybpCEhWaNFz92wR1WBehLIsSjbhhvbb9AABSwnTZYnvaNe6RV3zqk8_R8Um2e39q8icPTeavNtlNTBs-s7k-2ex5FUJM8Rh9y2Pis-WYV4fDLjp7jE1qb9l1sLvWD_66z94nz2_jWTFfTF_G1bzYSlTHwmhB59QYZHABaU0_CMFIeZZAjcq5kSgBDFlZGxKwrj2CxKDRjdZG9tn972_03q8OOe5t_lpppX-c5Tf_YFG3
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/PDP.2010.67
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781424456734
1424456738
EISSN 2377-5750
EndPage 186
ExternalDocumentID 5452445
Genre orig-research
GroupedDBID 29N
29O
6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
JC5
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-h314t-9527952d1f3fcf17b779520f9333771514cc8260097a3d9720bde1031f51c8b93
IEDL.DBID RIE
ISBN 9781424456727
142445672X
ISSN 1066-6192
IngestDate Wed Jun 26 19:22:25 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-h314t-9527952d1f3fcf17b779520f9333771514cc8260097a3d9720bde1031f51c8b93
OpenAccessLink https://hal.inria.fr/file/index/docid/429889/filename/main.pdf
PageCount 7
ParticipantIDs ieee_primary_5452445
PublicationCentury 2000
PublicationDate 2010-02
PublicationDateYYYYMMDD 2010-02-01
PublicationDate_xml – month: 02
  year: 2010
  text: 2010-02
PublicationDecade 2010
PublicationTitle 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing
PublicationTitleAbbrev PDP
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0007429
ssj0000452741
Score 1.9515082
Snippet The increasing numbers of cores, shared caches and memory nodes within machines introduces a complex hardware topology. High-performance computing applications...
SourceID ieee
SourceType Publisher
StartPage 180
SubjectTerms Application software
Bandwidth
Computer architecture
Concurrent computing
Hardware Topology Affinities Placement MPI OpenMP
Memory management
Multicore processing
Software libraries
Title hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
URI https://ieeexplore.ieee.org/document/5452445
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED7RTkwFWsRbHhgxTWrHD7aqD3WCSIDUrYpfapcUlVb9-9hOUkBiYYgUZ4oucu47333fB3AvNM8YLST2UEFgj_81Fo4o7Hx2JzxRjEWXiNkrf56L8STI5DwcuDDW2jh8Zh_Dbezlm7XehaOyfvDDpjRrQYtLUXG1DucpQRo8usjUf2Ff8snY6WQMhyKhIXVljA_mjdZTveY1cS9NZD8f59XAF_ttuBLzzbTzvzc9gd43cQ_lh5R0Cke2PINO49yA6o3chZfl3iexJzREUXZ6pdG0GdJCHsWixr0Ihcb-vthYNHRuVUb1VbQq0SwfoeGP1ncP3qeTt9EM19YKeElSusXSB8tfJnXEaZdyxcMycZIQwrlHAVRrEbTrJS-IkXyQKGODI4TLUi2UJOfQLtelvQBEhXQexohE6IK6xPqS2jE1MIoakxBuL6Eb4rP4qNQzFnVorv5-fA3HVX8-DIzcQHu72dlbaH2a3V383l89NaFt
link.rule.ids 310,311,782,786,791,792,798,27936,54770
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV27TsMwFLVoGWAq0CLeeGDENKkdP9iqPhREKZEoUrcq8UPtkqLSqr-P7SQFJBaGSHGm6EbOPdf3nnMAuOOSRZSkAlmowJHF_xJxgzNkbHbHLMgo9S4R8RsbT3l_4GRy7ndcGK21Hz7TD-7W9_LVUm7cUVnb-WETEtXAfkQYCwq21u5ExYmDex-Z8j9siz7he52UIlcmVLSuiLLOtFJ7KtespO6FgWgn_aQY-aK_LVd8xhk2_veuR6D1Td2DyS4pHYM9nZ-ARuXdAMut3ASv861NY4-wC73w9ELCYTWmBS2OhZV_EXSt_W260rBrzCL3-qtwkcM46cHuj-Z3C7wPB5NejEpzBTTHIVkjYYNlLxUabKQJWcbcMjACY8yYxQFESu7U6wVLsRKsE2RKO08IE4WSZwKfgnq-zPUZgIQLY4EMD7hMiQm0LaoNzToqI0oFmOlz0HTxmX0U-hmzMjQXfz--BQfx5GU0Gz2Nny_BYdGtd-MjV6C-Xm30Nah9qs2N__Zf6qakuA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+18th+Euromicro+Conference+on+Parallel%2C+Distributed+and+Network-based+Processing&rft.atitle=hwloc%3A+A+Generic+Framework+for+Managing+Hardware+Affinities+in+HPC+Applications&rft.au=Broquedis%2C+Franois&rft.au=Clet-Ortega%2C+Jerome&rft.au=Moreaud%2C+Stephanie&rft.au=Furmento%2C+Nathalie&rft.date=2010-02-01&rft.pub=IEEE&rft.isbn=9781424456727&rft.issn=1066-6192&rft.eissn=2377-5750&rft.spage=180&rft.epage=186&rft_id=info:doi/10.1109%2FPDP.2010.67&rft.externalDocID=5452445
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1066-6192&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1066-6192&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1066-6192&client=summon