Multi-Sentence Video Captioning using Content-oriented Beam Searching and Multi-stage Refining Algorithm
作者:
Highlights:
• A new multi-sentence video captioning algorithm is proposed.
• A new content-oriented beam search approach and a multi-stage refining method are used.
• A new neural network is proposed to measure the relevance between a sentence and a video.
• The structural dictionary of sentences is used to update the probabilities of words.
• An object detector is used to enhance the effectiveness of the algorithm.
摘要
•A new multi-sentence video captioning algorithm is proposed.•A new content-oriented beam search approach and a multi-stage refining method are used.•A new neural network is proposed to measure the relevance between a sentence and a video.•The structural dictionary of sentences is used to update the probabilities of words.•An object detector is used to enhance the effectiveness of the algorithm.
论文关键词:Multi-sentence video captioning,Beam search algorithm,Multimodal relevance measure network
论文评审过程:Received 19 December 2019, Revised 18 April 2020, Accepted 12 May 2020, Available online 16 June 2020, Version of Record 16 June 2020.
论文官网地址:https://doi.org/10.1016/j.ipm.2020.102302