Empowering RCT with Multi-site Multi-source RWD: a Statistical Learning Perspective