High Performance Implementation of MPI Datatype Communication over InfiniBand
J. Wu, P. Wyckoff, D. Panda
International Parallel and Distributed Processing Symposium (IPDPS 04),
Apr 2004.