Wednesday, October 6, 2021

Music video emotion classification using slow–fast audio–video network and unsupervised ...

The music and video information are processed through a multimodal architecture with audio–video information exchange and boosting method. The general 2D and 3D ...

source https://www.nature.com/articles/s41598-021-98856-2