Perfomance Portablity

Multithreaded sparse matrix-matrix multiplication for many-core and GPU architectures
Performance-portable sparse matrix-matrix multiplication for many-core architectures