Timing on NCCL tests must be done with MPI_Wtime
Timing on NCCL tests must be done with MPI_Wtime