Title: Memory Debugging of MPI-Parallel Applications in Open MPI
Author(s): Rainer Keller, Shiqing Fan, and Michael Resch
Abstract:
In this paper we describe the implementation of memory
checking functionality based on instrumentation using valgrind. The
combination of valgrind based checking functions within the
MPI-implementation offers superior debugging functionality, for errors
that otherwise are not possible to detect with comparable
MPI-debugging tools. The functionality is integrated into Open MPI as
the so-called memchecker-framework. This allows other memory debuggers
that offer a similar API to be integrated. The tight control of the
user's memory passed to Open MPI, allows not only to find application
errors, but also helps track bugs within Open MPI itself. We describe
the actual checks, classes of errors being found, how memory buffers
internally are being handled, show errors actually found in user's
code and the performance implications of this instrumentation.
Presented: ParCo2007, September 4th - 7th, 2007, Julich, Germany.
Paper:
Bibtex reference:
@INPROCEEDINGS{keller07,
author = "Rainer Keller and Shiqing Fan and Michael Resch",
title = "{Memory debugging of MPI-parallel Applications in Open MPI}",
booktitle = "Proceedings of ParCo'07",
editor = "G.R.~Joubert and C.~Bischof and F.~Peters and T.~Lippert and M.~Bucker and P.~Gibbon and B.~Mohr",
address = "Julich, Germany",
month = "Sep",
year = "2007"
}
|