Fisher Kernel Temporal Variation-based Relevance Feedback for video retrieval

作者:

Highlights:

摘要

This paper proposes a novel framework for Relevance Feedback based on the Fisher Kernel (FK). Specifically, we train a Gaussian Mixture Model (GMM) on the top retrieval results (without supervision) and use this to create a FK representation, which is therefore specialized in modelling the most relevant examples. We use the FK representation to explicitly capture temporal variation in video via frame-based features taken at different time intervals. While the GMM is being trained, a user selects from the top examples those which he is looking for. This feedback is used to train a Support Vector Machine on the FK representation, which is then applied to re-rank the top retrieved results. We show that our approach outperforms other state-of-the-art relevance feedback methods. Experiments were carried out on the Blip10000, UCF50, UCF101 and ADL standard datasets using a broad range of multi-modal content descriptors (visual, audio, and text).

论文关键词:

论文评审过程:Received 29 January 2015, Revised 12 September 2015, Accepted 5 October 2015, Available online 22 October 2015, Version of Record 13 January 2016.

论文官网地址:https://doi.org/10.1016/j.cviu.2015.10.005