The fastest malloc we have seen; works particularly well with threads and STL
copied from cf-post-staging / gperftoolsgperftools is a collection of a high-performance multi-threaded malloc() implementation, plus some pretty nifty performance analysis tools.