Cross-Modal Learning From Visual Information For Activity Recognition On Inertial Sensors