Combining Data Reuse With Data-Level Parallelization for FPGA-Targeted Hardware Compilation: A Geometric Programming Framework

A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hardware compilation. Buffering frequently accessed data in...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on computer-aided design of integrated circuits and systems Vol. 28; no. 3; pp. 305 - 315
Main Authors:	Qiang Liu, Constantinides, G.A., Masselos, K., Cheung, P.
Format:	Journal Article
Language:	English
Published:	New York IEEE 01-03-2009 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:	Automatic programming Computation Computer architecture data reuse Data-level parallelization Design engineering Design optimization Field programmable gate arrays field-programmable gate-array (FPGA) hardware compilation geometric programming Hardware Memory management Nonlinearity Optimization Optimization methods Parallel processing Parallel programming Programming Random access memory Reuse Studies System-on-a-chip
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hardware compilation. Buffering frequently accessed data in on-chip memories can reduce off-chip memory accesses and open avenues for parallelization. However, the exploitation of both data reuse and parallelization is limited by the memory resources available on-chip. As a result, considering these two problems separately, e.g., first exploring data reuse and then exploring data-level parallelization, based on the data-reuse options determined in the first step, may not yield the performance-optimal designs for limited on-chip memory resources. We consider both problems at the same time, exposing the dependence between the two. We show that this combined problem can be formulated as a nonlinear program and further show that efficient solution techniques exist for this problem, based on recent advances in optimization of so-called geometric programming problems. The results from applying this framework to several real benchmarks implemented on a Xilinx device demonstrate that given different constraints on on-chip memory utilization, the corresponding performance-optimal designs are automatically determined by the framework. We have also implemented designs determined by a two-stage optimization method that first explores data reuse and then explores parallelization on the same platform, and by comparison, the performance-optimal designs proposed by our framework are faster than the designs determined by the two-stage method by up to 5.7 times.
AbstractList	A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hardware compilation. Buffering frequently accessed data in on-chip memories can reduce off-chip memory accesses and open avenues for parallelization. However, the exploitation of both data reuse and parallelization is limited by the memory resources available on-chip. As a result, considering these two problems separately, e.g., first exploring data reuse and then exploring data-level parallelization, based on the data-reuse options determined in the first step, may not yield the performance-optimal designs for limited on-chip memory resources. We consider both problems at the same time, exposing the dependence between the two. We show that this combined problem can be formulated as a nonlinear program and further show that efficient solution techniques exist for this problem, based on recent advances in optimization of so-called geometric programming problems. The results from applying this framework to several real benchmarks implemented on a Xilinx device demonstrate that given different constraints on on-chip memory utilization, the corresponding performance-optimal designs are automatically determined by the framework. We have also implemented designs determined by a two-stage optimization method that first explores data reuse and then explores parallelization on the same platform, and by comparison, the performance-optimal designs proposed by our framework are faster than the designs determined by the two-stage method by up to 5.7 times. A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hardware compilation.
Author	Masselos, K. Cheung, P. Constantinides, G.A. Qiang Liu
Author_xml	– sequence: 1 surname: Qiang Liu fullname: Qiang Liu organization: Dept. of Electr. & Electron. Eng., Imperial Coll. London, London – sequence: 2 givenname: G.A. surname: Constantinides fullname: Constantinides, G.A. organization: Dept. of Electr. & Electron. Eng., Imperial Coll. London, London – sequence: 3 givenname: K. surname: Masselos fullname: Masselos, K. organization: Dept. of Comput. Sci. & Technol., Univ. of Peloponnese, Tripolis – sequence: 4 givenname: P. surname: Cheung fullname: Cheung, P. organization: Dept. of Electr. & Electron. Eng., Imperial Coll. London, London
BookMark	eNpdkUtrGzEUhUVJoE7aH1CyEd10NaledyR1Z5zYKRhqgkuXQp654yidGbnSOCFZ5Ld3HIcuurkP-M65F84ZOeljj4R84uySc2a_rmfTq0vBmB0Ll6D4OzLhVupCceAnZMKENgVjmr0nZznfM8YVCDshL7PYbUIf-i298oOnt7jPSH-F4e51L5b4gC1d-eTbFtvw7IcQe9rEROerxbRY-7TFAWt641P96BPS0W8X2lfsG53SBcYOhxQqukpxm3zXHU7NxwEfY_r9gZw2vs348a2fk5_z6_Xsplj-WHyfTZdFJUs9FMIaW4HllTSy3BjUGlWpUDEpvdDQVKWqwSolDMCmYd7omoGEjQVeMw8gz8mXo-8uxT97zIPrQq6wbX2PcZ-d0cCEMqYcyc__kfdxn_rxOWdAKwVKihHiR6hKMeeEjdul0Pn05DhzhzzcIQ93yMO95TFqLo6agIj_eKUNSCXkXxf5hzU
CODEN	ITCSDI
CitedBy_id	crossref_primary_10_1109_TSP_2016_2566608 crossref_primary_10_4316_AECE_2015_01014 crossref_primary_10_1109_TC_2011_205 crossref_primary_10_1145_2675359 crossref_primary_10_1109_TCE_2011_5955224 crossref_primary_10_1109_TVLSI_2014_2342213 crossref_primary_10_1007_s11554_017_0690_7 crossref_primary_10_1109_ACCESS_2016_2635378 crossref_primary_10_1109_TCAD_2016_2608861 crossref_primary_10_1109_TVLSI_2018_2820016 crossref_primary_10_1016_j_jpdc_2012_08_007
Cites_doi	10.1109/FCCM.2007.18 10.1145/951746.951747 10.1145/1278349.1278353 10.1007/978-1-4615-6199-6 10.1109/FPL.2006.311242 10.1049/ip-cdt:20045086 10.1109/71.127259 10.1145/1146909.1146925 10.1109/FPL.2006.311241 10.1142/3234 10.1109/43.739055 10.1145/193209.193217 10.1109/DATE.2005.35 10.1049/ip-cdt:20010514 10.1007/978-1-4757-2849-1 10.1109/FPT.2004.1393262 10.1017/CBO9780511804441 10.1145/998300.997199 10.1145/76263.76337 10.1007/978-1-4757-5676-0 10.1109/TCAD.2003.822123 10.1049/ip-cdt:20020468 10.1016/S0167-8191(98)00029-5
ContentType	Journal Article
Copyright	Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2009
Copyright_xml	– notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2009
DBID	97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3
DOI	10.1109/TCAD.2009.2013541
DatabaseName	IEEE All-Society Periodicals Package (ASPP) 2005-present IEEE All-Society Periodicals Package (ASPP) 1998-Present IEEE Electronic Library Online CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ANTE: Abstracts in New Technology & Engineering Engineering Research Database
DatabaseTitle	CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional Engineering Research Database ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList	Technology Research Database Technology Research Database
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	1937-4151
EndPage	315
ExternalDocumentID	2294984351 10_1109_TCAD_2009_2013541 4785342
Genre	orig-research
GroupedDBID	--Z -~X 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AASAJ ABQJQ ABVLG ACGFS ACIWK ACNCT AENEX AETIX AI. AIBXA AKJIK ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ H~9 IBMZZ ICLAB IFIPE IFJZH IPLJI JAVBF LAI M43 O9- OCL P2P PZZ RIA RIE RIG RNS TN5 VH1 VJK XFK AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D F28 FR3
ID	FETCH-LOGICAL-c367t-2989c591c3836b8e77e464e4033a275fc64d59442855bf0a87d0535b951d0a553
IEDL.DBID	RIE
ISSN	0278-0070
IngestDate	Fri Aug 16 01:04:17 EDT 2024 Thu Oct 10 19:45:06 EDT 2024 Fri Aug 23 02:53:53 EDT 2024 Wed Jun 26 19:27:02 EDT 2024
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	3
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c367t-2989c591c3836b8e77e464e4033a275fc64d59442855bf0a87d0535b951d0a553
Notes	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
OpenAccessLink	http://users.uop.gr/%7Ekmas/J28%20IEEE%20TCAD%20data%20reuse%20parallelism%20combo%202009.pdf
PQID	857445432
PQPubID	85470
PageCount	11
ParticipantIDs	ieee_primary_4785342 proquest_miscellaneous_875024886 crossref_primary_10_1109_TCAD_2009_2013541 proquest_journals_857445432
PublicationCentury	2000
PublicationDate	2009-03-01
PublicationDateYYYYMMDD	2009-03-01
PublicationDate_xml	– month: 03 year: 2009 text: 2009-03-01 day: 01
PublicationDecade	2000
PublicationPlace	New York
PublicationPlace_xml	– name: New York
PublicationTitle	IEEE transactions on computer-aided design of integrated circuits and systems
PublicationTitleAbbrev	TCAD
PublicationYear	2009
Publisher	IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml	– name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References	ref12 ref15 ref14 ref11 ref2 ref1 (ref30) 2006 ref17 ref16 ref19 ref18 kandemir (ref13) 2006 el-rewini (ref20) 2005 ref24 ref23 (ref25) 1987 lofberg (ref28) 2004 ref22 ref21 ref27 ref29 ref8 ref7 ref9 ref4 ref3 ref6 hall (ref31) 2003 wilson (ref10) 1994; 29 (ref26) 2006 liu (ref5) 2007
References_xml	– ident: ref4 doi: 10.1109/FCCM.2007.18 – ident: ref15 doi: 10.1145/951746.951747 – ident: ref27 doi: 10.1145/1278349.1278353 – ident: ref29 doi: 10.1007/978-1-4615-6199-6 – ident: ref3 doi: 10.1109/FPL.2006.311242 – ident: ref1 doi: 10.1049/ip-cdt:20045086 – ident: ref18 doi: 10.1109/71.127259 – ident: ref23 doi: 10.1145/1146909.1146925 – ident: ref22 doi: 10.1109/FPL.2006.311241 – ident: ref19 doi: 10.1142/3234 – ident: ref21 doi: 10.1109/43.739055 – year: 2003 ident: ref31 publication-title: Cache and bandwidth aware matrix multiplication on the GPU contributor: fullname: hall – volume: 29 start-page: 31 year: 1994 ident: ref10 article-title: suif: an infrastructure for research on parallelizing and optimizing compilers publication-title: SIGPLAN Not doi: 10.1145/193209.193217 contributor: fullname: wilson – ident: ref7 doi: 10.1109/DATE.2005.35 – ident: ref16 doi: 10.1049/ip-cdt:20010514 – ident: ref2 doi: 10.1007/978-1-4757-2849-1 – ident: ref8 doi: 10.1109/FPT.2004.1393262 – ident: ref12 doi: 10.1017/CBO9780511804441 – ident: ref9 doi: 10.1145/998300.997199 – ident: ref11 doi: 10.1145/76263.76337 – year: 1987 ident: ref25 publication-title: CFT77 Reference Manual – year: 2006 ident: ref30 – start-page: 808 year: 2006 ident: ref13 article-title: maximizing data reuse for minimizing memory space requirements and execution cycles publication-title: Proc ASPDAC contributor: fullname: kandemir – ident: ref6 doi: 10.1007/978-1-4757-5676-0 – ident: ref14 doi: 10.1109/TCAD.2003.822123 – year: 2007 ident: ref5 article-title: data reuse exploration under area constraints for low power reconfigurable systems publication-title: Proc WASPAA contributor: fullname: liu – year: 2006 ident: ref26 publication-title: Handel-C Language Reference Manual – start-page: 284 year: 2004 ident: ref28 article-title: yalmip: a toolbox for modeling and optimization in matlab publication-title: Proc IEEE Int Symp Comput Aided Control Syst Des contributor: fullname: lofberg – ident: ref24 doi: 10.1049/ip-cdt:20020468 – ident: ref17 doi: 10.1016/S0167-8191(98)00029-5 – year: 2005 ident: ref20 publication-title: Advanced Computer Architecture and Parallel Processing (Wiley Series on Parallel and Distributed Computing) contributor: fullname: el-rewini
SSID	ssj0014529
Score	2.0641322
Snippet	A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and...
SourceID	proquest crossref ieee
SourceType	Aggregation Database Publisher
StartPage	305
SubjectTerms	Automatic programming Computation Computer architecture data reuse Data-level parallelization Design engineering Design optimization Field programmable gate arrays field-programmable gate-array (FPGA) hardware compilation geometric programming Hardware Memory management Nonlinearity Optimization Optimization methods Parallel processing Parallel programming Programming Random access memory Reuse Studies System-on-a-chip
Title	Combining Data Reuse With Data-Level Parallelization for FPGA-Targeted Hardware Compilation: A Geometric Programming Framework
URI	https://ieeexplore.ieee.org/document/4785342 https://www.proquest.com/docview/857445432 https://search.proquest.com/docview/875024886
Volume	28
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELaACQbeiPKSByZEICF2HLNVtIEBoQqKYIsc5yIqQYraRmz8du6cNALBwpYozsvn83f23X3H2LFBiIlFYFHFM-sJBCDPREHggY5yozQCdkH5zjcP6u457vWJJue0zYUBABd8Bmd06Hz5-dhWtFV2LhSCi8AJd1HpuM7Vaj0G5EB0-ynEGIvjuPFgBr4-H-JP1cyUiHahFMEPDHJFVX7NxA5ekrX_fdg6W23MSN6t5b7BFqDcZCvfyAW32CeqeubKP_CemRl-D9UU-NNo9uLOvVsKF-IDM6FqKq9NOiZHG5Yng-uuN3Qh4pBz8u1_mAlwmjpGdejcJe_yaxi_UTkuywd1jNcbvSqZB3tts8ekP7y68ZpqC54NIzXziIrdSh1YXLNGWQxKgYgECD8MzYWShY1ELrXA5YqUWeGbWOXEDZOhiZb7Rspwhy2V4xJ2GTcCMmLKD4siEFlMtTw0YjHeZP0QiqjDTub9n77XpBqpW4z4OiVhUW1MnTbC6rAt6vC2YdPXHbY_l1jaqN00jaUSQooQr_L2KuoLOUFMCeMKm6CJdIGzVrT393P32XLtMKIwswO2NJtUcMgWp3l15AbcF_fv0nw
link.rule.ids	315,782,786,798,27933,27934,54767
linkProvider	IEEE
linkToHtml	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT8MwDI54HIADb8R45sAJUWjXPFpuEzCGGGiCIbhVaeoKJNjQtoobvx077SYQXLi1avqK43xObH9m7MAgxEQisKjiqfUEApBnVBB4EKvM6BgBO6d859a9vn2Kzi-IJudokgsDAC74DI7p0Pnys74taKvsRGgEF4ET7qwUWukyW2viMyAXottRIc5YHMmVDzPw45Mu_lbJTYl4F0oR_EAhV1bl11zsAKa59L9PW2aLlSHJG6XkV9gU9FbZwjd6wTX2icqeugIQ_NyMDL-DYgj88WX07M69NgUM8Y4ZUD2V1yohk6MVy5udy4bXdUHikHHy7n-YAXCaPF7K4LlT3uCX0H-jglyWd8oorzd6VXMc7rXOHpoX3bOWV9Vb8Gyo9MgjMnYr48DiqlWlEWgNQgkQfhiaupa5VSKTscAFi5Rp7ptIZ8QOk6KRlvlGynCDzfT6Pdhk3AhIiSs_zPNApBFV84gRjfEm64eQqxo7HPd_8l7SaiRuOeLHCQmLqmPGSSWsGlujDp80rPq6xrbHEksqxRsmkdRCSBHiVT65ihpDbhDTg36BTdBIquO8pbb-fu4-m2t1b9pJ--r2epvNl-4jCjrbYTOjQQG7bHqYFXtu8H0BMqDVzQ
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Combining+Data+Reuse+With+Data-Level+Parallelization+for+FPGA-Targeted+Hardware+Compilation%3A+A+Geometric+Programming+Framework&rft.jtitle=IEEE+transactions+on+computer-aided+design+of+integrated+circuits+and+systems&rft.au=Qiang+Liu&rft.au=Constantinides%2C+G.A.&rft.au=Masselos%2C+K.&rft.au=Cheung%2C+P.&rft.date=2009-03-01&rft.pub=IEEE&rft.issn=0278-0070&rft.eissn=1937-4151&rft.volume=28&rft.issue=3&rft.spage=305&rft.epage=315&rft_id=info:doi/10.1109%2FTCAD.2009.2013541&rft.externalDocID=4785342
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0278-0070&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0278-0070&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0278-0070&client=summon