Publications | Jee Whan Choi

Jan Laukemann, Ahmed E Helal, S Isaac Geronimo Anderson, Fabio Checconi, Yongseok Soh, Jesmin Jahan Tithi, Teresa Ranadive, Brian J Gravelle, Fabrizio Petrini, Jee Whan Choi (2025). Accelerating sparse tensor decomposition using adaptive linearized representation. IEEE Transactions on Parallel and Distributed Systems (TPDS).

PDF DOI

Yongseok Soh, Ramakrishnan Kannan, Piyush Sao, Jee Whan Choi (2024). Accelerated Constrained Sparse Tensor Factorization on Massively Parallel Architectures. The 53rd International Conference on Parallel Processing (ICPP’24).

PDF DOI

Alexandre Chen, Brittany A. Erickson, Jeremy E Kozdon, Jee Whan Choi (2024). Matrix-free SBP-SAT finite difference methods and the multigrid preconditioner on GPUs. The 38th ACM International Conference on Supercomputing (ICS’24).

PDF DOI

Joseph Mclaughlin, Jee Whan Choi, Ramakrishnan Durairajan (2023). ×Grid: A Location-oriented Topology Design for LEO Satellites. LEO-NET’23: Proceedings of the 1st ACM Workshop on LEO Networking and Communication.

PDF DOI

Akash Dutta, Jee Whan Choi, Ali Jannesari (2023). Power Constrained Autotuning using Graph Neural Networks. 37th IEEE International Parallel and Distributed Processing Symposium (IPDPS’23).

PDF DOI

Yongseok Soh, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithis, Teresa Ranadive, Fabrizio Petrini, Jee W. Choi (2023). Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams. 37th IEEE International Parallel and Distributed Processing Symposium (IPDPS’23).

PDF DOI

Andy Nguyen, Ahmed E. Helal, Fabio Checconi, Jan Laukemann, Jesmin Jahan Tithis, Yongseok Soh, Teresa Ranadive, Fabrizio Petrini, Jee W. Choi (2022). Efficient, Out-of-memory Sparse MTTKRP on Massively Parallel Architectures. Proceedings of the 36th ACM International Conference on Supercomputing.

PDF DOI

Sean Isaac Geronimo Anderson, Keita Teranishi, Daniel M. Dunlavy, Jee W. Choi (2021). Extended Abstract: Performance-Portable Sparse Tensor Decomposition Kernels on Emerging Parallel Architectures. The 25th Annual IEEE Conference on High Performance Extreme Computing (HPEC’21).

PDF DOI

Ahmed E. Helal, Jan Laukemann, Fabio Checconi, Jesmin Jahan Tithi, Teresa Ranadive, Fabrizio Petrini, Jee W. Choi (2021). ALTO: Adaptive Linearized Storage of Sparse Tensors. The 35th ACM International Conference on Supercomputing (ICS’21).

PDF DOI

Yongseok Soh, Patrick Flick, Xing Liu, Shaden Smith, Fabio Checconi, Fabrizio Petrini, Jee Choi (2021). High Performance Streaming Tensor Decomposition. 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS’21).

PDF DOI

Venkatesan T. Chakaravarthy, Jee W. Choi, Douglas J. Joseph, Prakash Murali, Yogish Sabharwal, S. Shivmaran, Dheeraj Sreedhar (2018). On Optimizing Distributed Tucker Decomposition for Sparse Tensors. The 32nd ACM International Conference on Supercomputing (ICS’18).

PDF DOI

Jee W. Choi, Xing Liu, Venkatesan T. Chakaravarthy (2018). High-performance Dense Tucker Decomposition on GPU Clusters. The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’18).

PDF DOI

Jee W. Choi, Xing Liu, Shaden Smith, Tyler Simon (2018). Blocking Optimization Techniques for Sparse Tensor Computation. 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS’18).

PDF DOI

Venkatesan T. Chakaravarthy, Jee W. Choi, Xing Liu, Douglas J. Joseph, Prakash Murali, Yogish Sabharwal, Dheeraj Sreedhar (2017). On Optimizing Distributed Tucker Decomposition for Dense Tensors. 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS’17).

PDF DOI

Jiajia Li, Jee W. Choi, Ioakeim Perros, Jimeng Sun, Richard Vuduc (2017). Model-Driven Sparse CP Decomposition for High-Order Tensors. 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS’17).

PDF DOI

Daniele Buono, Fausto Artico, Fabio Checconi, Jee W. Choi, Xinyu Que, Lars Schneidenbach (2017). Data Analytics with NVLink: An SpMV Case Study. Proceedings of the Computing Frontiers Conference.

PDF DOI

Jee W. Choi, Richard Vuduc (2016). Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model. 30th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

PDF DOI

Xing Liu, Daniele Buono, Fabio Checconi, Jee W. Choi, Xinyu Que, Fabrio Petrini, John A. Gunnels, Jeffrey A. Stuecheli (2016). An Early Performance Study of Large-Scale POWER8 SMP Systems. 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS’16).

PDF DOI

Jee W. Choi (2015). Power and performance modeling for high-performance computing algorithms.

PDF

Jee W. Choi, Marat Dukhan, Xing Liu, Richard Vuduc (2014). Algorithmic time, energy, and power on candidate HPC compute building blocks. 28th IEEE International Parallel and Distributed Processing Symposium (IPDPS’14).

PDF DOI

Jee W. Choi, Aparna Chandramowlishwaran, Kamesh Madduri, Richard Vuduc (2014). A CPU:GPU hybrid implementation and model-driven scheduling of the fast multipole method. Proceedings of Workshop on General Purpose Processing Using GPUs.

PDF DOI

Jee W. Choi, Richard Vuduc (2013). How much (execution) time and energy does my algorithm cost?. XRDS.

PDF DOI

Jee W. Choi, Richard Vuduc, Robert Fowler, Dan Bedard (2013). A roofline model of energy. 27th IEEE International Parallel and Distributed Processing Symposium (IPDPS’13).

PDF DOI

Richard Vuduc, Jee W. Choi (2013). A brief history and introduction to GPGPU. Modern Accelerator Technologies for Geographic Information Science.

PDF DOI

Jee W. Choi, Richard Vuduc (2012). A roofline model of energy.

PDF

Hyesoon Kim, Richard Vuduc, Sara Baghsorkhisorkhi, Jee W. Choi, Wen-mei Hwu (2012). Performance analysis and tuning for general purpose graphics processing units (GPGPU).

PDF DOI

Aparna Chandramowlishwaran, Jee W. Choi, Kamesh Madduri, Richard Vuduc (2012). Towards a communication optimal fast multipole method and its implications for exascale. Proc.~ACM Symp. Parallel Algorithms and Architectures (SPAA).

PDF DOI

Jee W. Choi, Richard Vuduc (2012). Modeling and Analysis for Performance and Power. IEEE 26th International Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW).

PDF DOI

Richard Vuduc, Kenneth Czechowski, A. Chandramowlishwaran, Jee W. Choi (2012). Courses in high-performance computing for scientists and engineers. IEEE 26th International Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW).

PDF DOI

M. Smelyanskiy, K. Vaidyanathan, Jee W. Choi, B. Joo, J. Chhugani, M.A. Clark, P. Dubey (2011). High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach. High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for.

PDF DOI

Senyo Apewokin, Brian Valentine, Jee W. Choi, Linda Wills, Scott Wills (2011). Real-time adaptive background modeling for multicore embedded systems. Journal of Signal Processing Systems.

PDF DOI

Sam Williams, Nathan Bell, Jee W. Choi, Michael Garland, Leonid Oliker, Richard Vuduc (2010). Sparse matrix vector multiplication on multicore and accelerator systems. Scientific Computing with Multicore Processors and Accelerators.

PDF

Richard Vuduc, Aparna Chandramowlishwaran, Jee W. Choi, Murat Guney, Aashay Shringarpure (2010). On the limits of GPU acceleration. Proceedings of the 2nd USENIX inproceedings on Hot topics in parallelism.

PDF

Jee W. Choi, Amik Singh, Richard Vuduc (2010). Model-driven autotuning of sparse matrix-vector multiply on GPUs. 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’11).

PDF DOI

B. Valentine, Jee W. Choi, S. Apewokin, L. Wills, S. Wills (2008). Bypassing BigBackground: An efficient hybrid background modeling algorithm for embedded video surveillance. Second ACM/IEEE International Conference on Distributed Smart Cameras, 2008 (ICDSC 2008).

PDF DOI

Jee W. Choi, S. Apewokin, B. E. Valentine, D. S. Wills, L. M. Wills (2008). Edge noise removal in multimodal background modeling techniques. Electronic Imaging, 2008.

PDF DOI

Jee W. Choi (2004). Reducing communication through buffers on a SIMD architecture.

PDF