Science Cast

Zero-shot Visual Question Answering using Knowledge Graph

zhuo-chenFebruary 26, 2024 11:36am

Views (69)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Zero-shot Visual Question Answering using Knowledge Graph

arXivPDFJuly 12, 2021 12:00am

Authors

Zhuo Chen, Jiaoyan Chen, Yuxia Geng, Jeff Z. Pan, Zonggang Yuan, Huajun Chen

Abstract

Incorporating external knowledge to Visual Question Answering (VQA) has become a vital practical need. Existing methods mostly adopt pipeline approaches with different components for knowledge matching and extraction, feature learning, etc.However, such pipeline approaches suffer when some component does not perform well, which leads to error propagation and poor overall performance. Furthermore, the majority of existing approaches ignore the answer bias issue -- many answers may have never appeared during training (i.e., unseen answers) in real-word application. To bridge these gaps, in this paper, we propose a Zero-shot VQA algorithm using knowledge graphs and a mask-based learning mechanism for better incorporating external knowledge, and present new answer-based Zero-shot VQA splits for the F-VQA dataset. Experiments show that our method can achieve state-of-the-art performance in Zero-shot VQA with unseen answers, meanwhile dramatically augment existing end-to-end models on the normal F-VQA task.

TwitterandLinkedIn

0 comments

Add comment

Zero-shot Visual Question Answering using Knowledge Graph

Zero-shot Visual Question Answering using Knowledge Graph

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments