A similarity measure between videos using alignment, graphical and speech features

作者:

Highlights:

摘要

A novel video similarity measure is proposed by using visual features, alignment distances and speech transcripts. First, video files are represented by a sequence of segments each of which contains colour histograms, starting time, and a set of phonemes. After, textual, alignment and visual features are extracted of these segments. The following step, bipartite matching and statistical features are applied to find correspondences between segments. Finally, a similarity is calculated between videos. Experiments have been carried out and promising results have been obtained.

论文关键词:Information retrieval,Content segmentation,Bipartite matching

论文评审过程:Available online 6 March 2012.

论文官网地址:https://doi.org/10.1016/j.eswa.2012.02.169