Jee W. Choi | Jee Whan Choi

Latest

Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams
Efficient, Out-of-memory Sparse MTTKRP on Massively Parallel Architectures
Extended Abstract: Performance-Portable Sparse Tensor Decomposition Kernels on Emerging Parallel Architectures
ALTO: Adaptive Linearized Storage of Sparse Tensors
Blocking Optimization Techniques for Sparse Tensor Computation
High-performance Dense Tucker Decomposition on GPU Clusters
On Optimizing Distributed Tucker Decomposition for Sparse Tensors
Data Analytics with NVLink: An SpMV Case Study
Model-Driven Sparse CP Decomposition for High-Order Tensors
On Optimizing Distributed Tucker Decomposition for Dense Tensors
An Early Performance Study of Large-Scale POWER8 SMP Systems
Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model
Power and performance modeling for high-performance computing algorithms
A CPU:GPU hybrid implementation and model-driven scheduling of the fast multipole method
Algorithmic time, energy, and power on candidate HPC compute building blocks
How much (execution) time and energy does my algorithm cost?
A brief history and introduction to GPGPU
A roofline model of energy
A roofline model of energy
Performance analysis and tuning for general purpose graphics processing units (GPGPU)
Towards a communication optimal fast multipole method and its implications for exascale
Courses in high-performance computing for scientists and engineers
Modeling and Analysis for Performance and Power
High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach
Real-time adaptive background modeling for multicore embedded systems
Model-driven autotuning of sparse matrix-vector multiply on GPUs
On the limits of GPU acceleration
Sparse matrix vector multiplication on multicore and accelerator systems
Bypassing BigBackground: An efficient hybrid background modeling algorithm for embedded video surveillance
Edge noise removal in multimodal background modeling techniques
Reducing communication through buffers on a SIMD architecture