SR-clustering: Semantic regularized clustering for egocentric photo streams segmentation

作者:

Highlights:

摘要

While wearable cameras are becoming increasingly popular, locating relevant information in large unstructured collections of egocentric images is still a tedious and time consuming process. This paper addresses the problem of organizing egocentric photo streams acquired by a wearable camera into semantically meaningful segments, hence making an important step towards the goal of automatically annotating these photos for browsing and retrieval. In the proposed method, first, contextual and semantic information is extracted for each image by employing a Convolutional Neural Networks approach. Later, a vocabulary of concepts is defined in a semantic space by relying on linguistic information. Finally, by exploiting the temporal coherence of concepts in photo streams, images which share contextual and semantic attributes are grouped together. The resulting temporal segmentation is particularly suited for further analysis, ranging from event recognition to semantic indexing and summarization. Experimental results over egocentric set of nearly 31,000 images, show the prominence of the proposed approach over state-of-the-art methods.

论文关键词:

论文评审过程:Received 14 January 2016, Revised 11 October 2016, Accepted 16 October 2016, Available online 19 October 2016, Version of Record 17 January 2017.

论文官网地址:https://doi.org/10.1016/j.cviu.2016.10.005