Multi-Modal Deep Learning For Computer Vision And Its Application