Multi-modal deep network