VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable visual question answering
作者:
Highlights:
• The proposed model is a free form, open ended and knowledge aware VQA model.
• VQA modeled as an explainable, end to end factoid question answering problem.
• Model capable of leveraging granular details, correlate inter-related details in scenes.
• Model capable of leveraging external world knowledge to answer questions.
• Model capable of predicting likely explanations to justify the predicted answers.
摘要
Highlights•The proposed model is a free form, open ended and knowledge aware VQA model.•VQA modeled as an explainable, end to end factoid question answering problem.•Model capable of leveraging granular details, correlate inter-related details in scenes.•Model capable of leveraging external world knowledge to answer questions.•Model capable of predicting likely explanations to justify the predicted answers.
论文关键词:Visual question answering,Factoid question answering,Knowledge based reasoning,Explainable VQA
论文评审过程:Received 2 September 2020, Revised 6 October 2021, Accepted 7 October 2021, Available online 24 October 2021, Version of Record 1 November 2021.
论文官网地址:https://doi.org/10.1016/j.imavis.2021.104328