HPC

Optimizing Tensor Decomposition on HPC Systems

We discuss our experience in optimizing the CP and Tucker decomposition algorithms for sparse datasets on a distributed system.

Optimizing Tensor Decomposition on HPC Systems

We discuss our experience in optimizing the CP and Tucker decomposition algorithms for sparse datasets on a distributed system.

Distributed Tucker for Sparse Tensors

We discuss our experience in optimizing the Tucker decomposition for sparse datasets on a distributed system.

Blocking Optimization for Sparse MTTKRP

We discuss our experience in optimizing the sparse MTTKRP kernel using varoius blocking techniques.