Multi-Sentence Video Captioning using Content-oriented Beam Searching and Multi-stage Refining Algorithm

作者:

Highlights:

• A new multi-sentence video captioning algorithm is proposed.

• A new content-oriented beam search approach and a multi-stage refining method are used.

• A new neural network is proposed to measure the relevance between a sentence and a video.

• The structural dictionary of sentences is used to update the probabilities of words.

• An object detector is used to enhance the effectiveness of the algorithm.

摘要

•A new multi-sentence video captioning algorithm is proposed.•A new content-oriented beam search approach and a multi-stage refining method are used.•A new neural network is proposed to measure the relevance between a sentence and a video.•The structural dictionary of sentences is used to update the probabilities of words.•An object detector is used to enhance the effectiveness of the algorithm.

论文关键词:Multi-sentence video captioning,Beam search algorithm,Multimodal relevance measure network

论文评审过程:Received 19 December 2019, Revised 18 April 2020, Accepted 12 May 2020, Available online 16 June 2020, Version of Record 16 June 2020.

论文官网地址:https://doi.org/10.1016/j.ipm.2020.102302