(Some) MPI collective comms can be issued with async function calls and subsequent computations can be done while the collective is traveling over the network.
This might be particularly useful in the pressure solver.
Boost.MPI does not have async collectives?
Use MPI C API?
NOTE: MPI_IBARRIER could be useful