Failure avoidance techniques for HPC systems based on failure prediction