Search Results - "Jablin, Thomas B." :: Katalog Arama

1
Ten Lessons From Three Generations Shaped Google's TPUv4i : Industrial Product by Jouppi, Norman P., Hyun Yoon, Doe, Ashcraft, Matthew, Gottscho, Mark, Jablin, Thomas B., Kurian, George, Laudon, James, Li, Sheng, Ma, Peter, Ma, Xiaoyu, Norrie, Thomas, Patil, Nishant, Prasad, Sushma, Young, Cliff, Zhou, Zongwei, Patterson, David

Published in 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) (01-06-2021)
“…Google deployed several TPU generations since 2015, teaching us lessons that changed our views: semi-conductor technology advances unequally; compiler…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
2
MLPerf Inference Benchmark by Reddi, Vijay Janapa, Cheng, Christine, Kanter, David, Mattson, Peter, Schmuelling, Guenther, Wu, Carole-Jean, Anderson, Brian, Breughe, Maximilien, Charlebois, Mark, Chou, William, Chukka, Ramesh, Coleman, Cody, Davis, Sam, Deng, Pan, Diamos, Greg, Duke, Jared, Fick, Dave, Gardner, J. Scott, Hubara, Itay, Idgunji, Sachin, Jablin, Thomas B., Jiao, Jeff, John, Tom St, Kanwar, Pankaj, Lee, David, Liao, Jeffery, Lokhmotov, Anton, Massa, Francisco, Meng, Peng, Micikevicius, Paulius, Osborne, Colin, Pekhimenko, Gennady, Rajan, Arun Tejusve Raghunath, Sequeira, Dilip, Sirasao, Ashish, Sun, Fei, Tang, Hanlin, Thomson, Michael, Wei, Frank, Wu, Ephrem, Xu, Lingjie, Yamada, Koichi, Yu, Bing, Yuan, George, Zhong, Aaron, Zhang, Peizhao, Zhou, Yuchen

Published in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01-05-2020)
“…Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Automatic execution of single-GPU computations across multiple GPUs by Cabezas, Javier, Vilanova, Lluis, Geladeno, Isaac, Jablin, Thomas B., Navarro, Nacho, Wen-mei Hwu

Published in 2014 23rd International Conference on Parallel Architecture and Compilation Techniques (PACT) (24-08-2014)
“…We present AMGE, a programming framework and runtime system to decompose data and GPU kernels and execute them on multiple GPUs concurrently. AMGE exploits the…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Warp-aware trace scheduling for GPUs by Jablin, James A., Jablin, Thomas B., Mutlu, Onur, Herlihy, Maurice

Published in 2014 23rd International Conference on Parallel Architecture and Compilation Techniques (PACT) (01-08-2014)
“…GPU performance depends not only on thread/warp level parallelism (TLP) but also on instruction-level parallelism (ILP). It is not enough to schedule…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
5
Speculatively exploiting cross-invocation parallelism by Jialu Huang, Prabhu, Prakash, Jablin, Thomas B., Ghosh, Soumyadeep, Apostolakis, Sotiris, Lee, Jae W., August, David I.

Published in 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) (01-09-2016)
“…Automatic parallelization has shown promise in producing scalable multi-threaded programs for multi-core architectures. Most existing automatic techniques…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
Automatic Parallelization for GPUs by Jablin, Thomas B

Published 01-01-2013
“…GPUs are flexible parallel processors capable of accelerating real applications. To exploit them, programmers rewrite programs in new languages using intimate…”

Get full text

Dissertation
QR Code
Save to List

Saved in:
7
Chai: Collaborative heterogeneous applications for integrated-architectures by Gomez-Luna, Juan, Hajj, Izzat El, Chang, Li-Wen, Garcia-Flores, Victor, de Gonzalo, Simon Garcia, Jablin, Thomas B., Pena, Antonio J., Hwu, Wen-mei

Published in 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-04-2017)
“…Heterogeneous system architectures are evolving towards tighter integration among devices, with emerging features such as shared virtual memory, memory…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
A collaborative dependence analysis framework by Johnson, Nick P., Fix, Jordan, Beard, Stephen R., Taewook Oh, Jablin, Thomas B., August, David I.

Published in 2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (01-02-2017)
“…Compiler optimizations discover facts about program behavior by querying static analysis. However, developing or extending precise analysis is difficult. Some…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Automatic Parallelization for GPUs by Jablin, Thomas B

“…GPUs are flexible parallel processors capable of accelerating real applications. To exploit them, programmers rewrite programs in new languages using intimate…”

Get full text

Dissertation
QR Code
Save to List

Saved in:
10
Automatically exploiting cross-invocation parallelism using runtime information by August, David I., Huang, Jialu, Beard, Stephen R., Johnson, Nick P., Jablin, Thomas B.

Published in Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (23-02-2013)
“…Automatic parallelization is a promising approach to producing scalable multi-threaded programs for multicore architectures. Many existing automatic techniques…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
MLPerf Inference Benchmark by Reddi, Vijay Janapa, Cheng, Christine, Kanter, David, Mattson, Peter, Schmuelling, Guenther, Wu, Carole-Jean, Anderson, Brian, Breughe, Maximilien, Charlebois, Mark, Chou, William, Chukka, Ramesh, Coleman, Cody, Davis, Sam, Deng, Pan, Diamos, Greg, Duke, Jared, Fick, Dave, Gardner, J. Scott, Hubara, Itay, Idgunji, Sachin, Jablin, Thomas B, Jiao, Jeff, John, Tom St, Kanwar, Pankaj, Lee, David, Liao, Jeffery, Lokhmotov, Anton, Massa, Francisco, Meng, Peng, Micikevicius, Paulius, Osborne, Colin, Pekhimenko, Gennady, Rajan, Arun Tejusve Raghunath, Sequeira, Dilip, Sirasao, Ashish, Sun, Fei, Tang, Hanlin, Thomson, Michael, Wei, Frank, Wu, Ephrem, Xu, Lingjie, Yamada, Koichi, Yu, Bing, Yuan, George, Zhong, Aaron, Zhang, Peizhao, Zhou, Yuchen

Published 06-11-2019
“…Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
A survey of the practice of computational science by Prabhu, Prakash, Jablin, Thomas B., Raman, Arun, Yun Zhang, Jialu Huang, Hanjun Kim, Johnson, Nick P., Feng Liu, Ghosh, Soumyadeep, Beard, Stephen, Taewook Oh, Zoufaly, Matthew, Walker, David, August, David I.

Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (01-11-2011)
“…Computing plays an indispensable role in scientific research. Presently, researchers in science have different problems, needs, and beliefs about computation…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling by Shen, Jonathan, Nguyen, Patrick, Wu, Yonghui, Chen, Zhifeng, Chen, Mia X, Jia, Ye, Kannan, Anjuli, Sainath, Tara, Cao, Yuan, Chiu, Chung-Cheng, He, Yanzhang, Chorowski, Jan, Hinsu, Smit, Laurenzo, Stella, Qin, James, Firat, Orhan, Macherey, Wolfgang, Gupta, Suyog, Bapna, Ankur, Zhang, Shuyuan, Pang, Ruoming, Weiss, Ron J, Prabhavalkar, Rohit, Liang, Qiao, Jacob, Benoit, Liang, Bowen, Lee, HyoukJoong, Chelba, Ciprian, Jean, Sébastien, Li, Bo, Johnson, Melvin, Anil, Rohan, Tibrewal, Rajat, Liu, Xiaobing, Eriguchi, Akiko, Jaitly, Navdeep, Ari, Naveen, Cherry, Colin, Haghani, Parisa, Good, Otavio, Cheng, Youlong, Alvarez, Raziel, Caswell, Isaac, Hsu, Wei-Ning, Yang, Zongheng, Wang, Kuan-Chieh, Gonina, Ekaterina, Tomanek, Katrin, Vanik, Ben, Wu, Zelin, Jones, Llion, Schuster, Mike, Huang, Yanping, Chen, Dehao, Irie, Kazuki, Foster, George, Richardson, John, Macherey, Klaus, Bruguier, Antoine, Zen, Heiga, Raffel, Colin, Kumar, Shankar, Rao, Kanishka, Rybach, David, Murray, Matthew, Peddinti, Vijayaditya, Krikun, Maxim, Bacchiani, Michiel A. U, Jablin, Thomas B, Suderman, Rob, Williams, Ian, Lee, Benjamin, Bhatia, Deepti, Carlson, Justin, Yavuz, Semih, Zhang, Yu, McGraw, Ian, Galkin, Max, Ge, Qi, Pundak, Golan, Whipkey, Chad, Wang, Todd, Alon, Uri, Lepikhin, Dmitry, Tian, Ye, Sabour, Sara, Chan, William, Toshniwal, Shubham, Liao, Baohua, Nirschl, Michael, Rondon, Pat

Published 21-02-2019
“…Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence…”

Get full text

Journal Article
QR Code
Save to List

Saved in: