Improving Communication Performance in GPU-Accelerated HPC Clusters