Jee Whan Choi
Home
Teaching
Talks
Publications
Contact
Publications
Type
Conference paper
Journal article
Report
Book
Book section
Date
2023
2022
2021
2018
2017
2016
2015
2014
2013
2012
2011
2010
2008
2004
Joseph Mclaughlin
,
Jee Whan Choi
,
Ramakrishnan Durairajan
(2023).
×Grid: A Location-oriented Topology Design for LEO Satellites
.
LEO-NET’23: Proceedings of the 1st ACM Workshop on LEO Networking and Communication
.
PDF
Cite
DOI
Akash Dutta
,
Jee Whan Choi
,
Ali Jannesari
(2023).
Power Constrained Autotuning using Graph Neural Networks
.
37th IEEE International Parallel and Distributed Processing Symposium (IPDPS’23)
.
PDF
Cite
DOI
Yongseok Soh
,
Ahmed E. Helal
,
Fabio Checconi
,
Jan Laukemann
,
Jesmin Jahan Tithis
,
Teresa Ranadive
,
Fabrizio Petrini
,
Jee W. Choi
(2023).
Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams
.
37th IEEE International Parallel and Distributed Processing Symposium (IPDPS’23)
.
PDF
Cite
DOI
Andy Nguyen
,
Ahmed E. Helal
,
Fabio Checconi
,
Jan Laukemann
,
Jesmin Jahan Tithis
,
Yongseok Soh
,
Teresa Ranadive
,
Fabrizio Petrini
,
Jee W. Choi
(2022).
Efficient, Out-of-memory Sparse MTTKRP on Massively Parallel Architectures
.
Proceedings of the 36th ACM International Conference on Supercomputing
.
PDF
Cite
DOI
Sean Isaac Geronimo Anderson
,
Keita Teranishi
,
Daniel M. Dunlavy
,
Jee W. Choi
(2021).
Extended Abstract: Performance-Portable Sparse Tensor Decomposition Kernels on Emerging Parallel Architectures
.
The 25th Annual IEEE Conference on High Performance Extreme Computing (HPEC’21)
.
PDF
Cite
DOI
Ahmed E. Helal
,
Jan Laukemann
,
Fabio Checconi
,
Jesmin Jahan Tithi
,
Teresa Ranadive
,
Fabrizio Petrini
,
Jee W. Choi
(2021).
ALTO: Adaptive Linearized Storage of Sparse Tensors
.
The 35th ACM International Conference on Supercomputing (ICS’21)
.
PDF
Cite
DOI
Yongseok Soh
,
Patrick Flick
,
Xing Liu
,
Shaden Smith
,
Fabio Checconi
,
Fabrizio Petrini
,
Jee Choi
(2021).
High Performance Streaming Tensor Decomposition
.
35th IEEE International Parallel and Distributed Processing Symposium (IPDPS’21)
.
PDF
Cite
DOI
Venkatesan T. Chakaravarthy
,
Jee W. Choi
,
Douglas J. Joseph
,
Prakash Murali
,
Yogish Sabharwal
,
S. Shivmaran
,
Dheeraj Sreedhar
(2018).
On Optimizing Distributed Tucker Decomposition for Sparse Tensors
.
The 32nd ACM International Conference on Supercomputing (ICS’18)
.
PDF
Cite
DOI
Jee W. Choi
,
Xing Liu
,
Venkatesan T. Chakaravarthy
(2018).
High-performance Dense Tucker Decomposition on GPU Clusters
.
The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’18)
.
PDF
Cite
DOI
Jee W. Choi
,
Xing Liu
,
Shaden Smith
,
Tyler Simon
(2018).
Blocking Optimization Techniques for Sparse Tensor Computation
.
32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS’18)
.
PDF
Cite
DOI
Venkatesan T. Chakaravarthy
,
Jee W. Choi
,
Xing Liu
,
Douglas J. Joseph
,
Prakash Murali
,
Yogish Sabharwal
,
Dheeraj Sreedhar
(2017).
On Optimizing Distributed Tucker Decomposition for Dense Tensors
.
31st IEEE International Parallel and Distributed Processing Symposium (IPDPS’17)
.
PDF
Cite
DOI
Jiajia Li
,
Jee W. Choi
,
Ioakeim Perros
,
Jimeng Sun
,
Richard Vuduc
(2017).
Model-Driven Sparse CP Decomposition for High-Order Tensors
.
31st IEEE International Parallel and Distributed Processing Symposium (IPDPS’17)
.
PDF
Cite
DOI
Daniele Buono
,
Fausto Artico
,
Fabio Checconi
,
Jee W. Choi
,
Xinyu Que
,
Lars Schneidenbach
(2017).
Data Analytics with NVLink: An SpMV Case Study
.
Proceedings of the Computing Frontiers Conference
.
PDF
Cite
DOI
Jee W. Choi
,
Richard Vuduc
(2016).
Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model
.
30th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
.
PDF
Cite
DOI
Xing Liu
,
Daniele Buono
,
Fabio Checconi
,
Jee W. Choi
,
Xinyu Que
,
Fabrio Petrini
,
John A. Gunnels
,
Jeffrey A. Stuecheli
(2016).
An Early Performance Study of Large-Scale POWER8 SMP Systems
.
30th IEEE International Parallel and Distributed Processing Symposium (IPDPS’16)
.
PDF
Cite
DOI
Jee W. Choi
(2015).
Power and performance modeling for high-performance computing algorithms
.
PDF
Cite
Jee W. Choi
,
Marat Dukhan
,
Xing Liu
,
Richard Vuduc
(2014).
Algorithmic time, energy, and power on candidate HPC compute building blocks
.
28th IEEE International Parallel and Distributed Processing Symposium (IPDPS’14)
.
PDF
Cite
DOI
Jee W. Choi
,
Aparna Chandramowlishwaran
,
Kamesh Madduri
,
Richard Vuduc
(2014).
A CPU:GPU hybrid implementation and model-driven scheduling of the fast multipole method
.
Proceedings of Workshop on General Purpose Processing Using GPUs
.
PDF
Cite
DOI
Jee W. Choi
,
Richard Vuduc
(2013).
How much (execution) time and energy does my algorithm cost?
.
XRDS
.
PDF
Cite
DOI
Jee W. Choi
,
Richard Vuduc
,
Robert Fowler
,
Dan Bedard
(2013).
A roofline model of energy
.
27th IEEE International Parallel and Distributed Processing Symposium (IPDPS’13)
.
PDF
Cite
DOI
Richard Vuduc
,
Jee W. Choi
(2013).
A brief history and introduction to GPGPU
.
Modern Accelerator Technologies for Geographic Information Science
.
PDF
Cite
DOI
Jee W. Choi
,
Richard Vuduc
(2012).
A roofline model of energy
.
PDF
Cite
Hyesoon Kim
,
Richard Vuduc
,
Sara Baghsorkhisorkhi
,
Jee W. Choi
,
Wen-mei Hwu
(2012).
Performance analysis and tuning for general purpose graphics processing units (GPGPU)
.
PDF
Cite
DOI
Aparna Chandramowlishwaran
,
Jee W. Choi
,
Kamesh Madduri
,
Richard Vuduc
(2012).
Towards a communication optimal fast multipole method and its implications for exascale
.
Proc.~ACM Symp. Parallel Algorithms and Architectures (SPAA)
.
PDF
Cite
DOI
Jee W. Choi
,
Richard Vuduc
(2012).
Modeling and Analysis for Performance and Power
.
IEEE 26th International Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW)
.
PDF
Cite
DOI
Richard Vuduc
,
Kenneth Czechowski
,
A. Chandramowlishwaran
,
Jee W. Choi
(2012).
Courses in high-performance computing for scientists and engineers
.
IEEE 26th International Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW)
.
PDF
Cite
DOI
M. Smelyanskiy
,
K. Vaidyanathan
,
Jee W. Choi
,
B. Joo
,
J. Chhugani
,
M.A. Clark
,
P. Dubey
(2011).
High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach
.
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
.
PDF
Cite
DOI
Senyo Apewokin
,
Brian Valentine
,
Jee W. Choi
,
Linda Wills
,
Scott Wills
(2011).
Real-time adaptive background modeling for multicore embedded systems
.
Journal of Signal Processing Systems
.
PDF
Cite
DOI
Sam Williams
,
Nathan Bell
,
Jee W. Choi
,
Michael Garland
,
Leonid Oliker
,
Richard Vuduc
(2010).
Sparse matrix vector multiplication on multicore and accelerator systems
.
Scientific Computing with Multicore Processors and Accelerators
.
PDF
Cite
Richard Vuduc
,
Aparna Chandramowlishwaran
,
Jee W. Choi
,
Murat Guney
,
Aashay Shringarpure
(2010).
On the limits of GPU acceleration
.
Proceedings of the 2nd USENIX inproceedings on Hot topics in parallelism
.
PDF
Cite
Jee W. Choi
,
Amik Singh
,
Richard Vuduc
(2010).
Model-driven autotuning of sparse matrix-vector multiply on GPUs
.
15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’11)
.
PDF
Cite
DOI
B. Valentine
,
Jee W. Choi
,
S. Apewokin
,
L. Wills
,
S. Wills
(2008).
Bypassing BigBackground: An efficient hybrid background modeling algorithm for embedded video surveillance
.
Second ACM/IEEE International Conference on Distributed Smart Cameras, 2008 (ICDSC 2008)
.
PDF
Cite
DOI
Jee W. Choi
,
S. Apewokin
,
B. E. Valentine
,
D. S. Wills
,
L. M. Wills
(2008).
Edge noise removal in multimodal background modeling techniques
.
Electronic Imaging, 2008
.
PDF
Cite
DOI
Jee W. Choi
(2004).
Reducing communication through buffers on a SIMD architecture
.
PDF
Cite
Cite
×