Music 2 Video
Similar to Text-Based Media Retrieval, we map two modalities, music and video, into a common semantic space. This allows the recommendation of music tracks that would match well with a specifically identified video clip (e.g., a movie trailer), or to recommend video assets that would go well with a given music track (i.e., to create a music video).