Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-enabled System
C. Chu, K. Hamidouche, A. Venkatesh, D. Banerjee, H. Subramoni, D. Panda
The 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS '16),
May 2016.