On detecting the playing/non-playing activity of musicians in symphonic music videos
作者:
Highlights:
• We propose a semi-automatic annotation system for large symphonic orchestras videos.
• We leverage video redundancy, image clustering, and human annotation.
• Our method successfully deals with several intra-class variability issues.
• Human annotation effort reduced while maintaining high level of output quality.
• Comprehensive analysis of the impact of different modules on the overall performance.
摘要
•We propose a semi-automatic annotation system for large symphonic orchestras videos.•We leverage video redundancy, image clustering, and human annotation.•Our method successfully deals with several intra-class variability issues.•Human annotation effort reduced while maintaining high level of output quality.•Comprehensive analysis of the impact of different modules on the overall performance.
论文关键词:Cross-modal analysis,Music information retrieval,Human-object interaction,Diarization,Clustering
论文评审过程:Received 20 December 2014, Revised 30 May 2015, Accepted 21 September 2015, Available online 1 April 2016, Version of Record 1 April 2016.
论文官网地址:https://doi.org/10.1016/j.cviu.2015.09.009