Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
by
Meng, Zuqiang
, Jiang, Lei
in
Accuracy
/ Algorithms
/ Analysis
/ Artificial intelligence
/ Cognition & reasoning
/ Computational linguistics
/ Datasets
/ Embedding
/ Graph representations
/ Graphical representations
/ Graphs
/ Knowledge representation
/ Knowledge-based systems
/ Language
/ Language processing
/ Natural language interfaces
/ Neural networks
/ Nodes
/ Object recognition
/ Question-answering systems
/ Questions
/ Reasoning
/ Semantics
/ Visual fields
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
by
Meng, Zuqiang
, Jiang, Lei
in
Accuracy
/ Algorithms
/ Analysis
/ Artificial intelligence
/ Cognition & reasoning
/ Computational linguistics
/ Datasets
/ Embedding
/ Graph representations
/ Graphical representations
/ Graphs
/ Knowledge representation
/ Knowledge-based systems
/ Language
/ Language processing
/ Natural language interfaces
/ Neural networks
/ Nodes
/ Object recognition
/ Question-answering systems
/ Questions
/ Reasoning
/ Semantics
/ Visual fields
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
by
Meng, Zuqiang
, Jiang, Lei
in
Accuracy
/ Algorithms
/ Analysis
/ Artificial intelligence
/ Cognition & reasoning
/ Computational linguistics
/ Datasets
/ Embedding
/ Graph representations
/ Graphical representations
/ Graphs
/ Knowledge representation
/ Knowledge-based systems
/ Language
/ Language processing
/ Natural language interfaces
/ Neural networks
/ Nodes
/ Object recognition
/ Question-answering systems
/ Questions
/ Reasoning
/ Semantics
/ Visual fields
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
Journal Article
Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
2023
Request Book From Autostore
and Choose the Collection Method
Overview
The field of visual question answering (VQA) has seen a growing trend of integrating external knowledge sources to improve performance. However, owing to the potential incompleteness of external knowledge sources and the inherent mismatch between different forms of data, current knowledge-based visual question answering (KBVQA) techniques are still confronted with the challenge of effectively integrating and utilizing multiple heterogeneous data. To address this issue, a novel approach centered on a multi-modal semantic graph (MSG) is proposed. The MSG serves as a mechanism for effectively unifying the representation of heterogeneous data and diverse types of knowledge. Additionally, a multi-modal semantic graph knowledge reasoning model (MSG-KRM) is introduced to perform reasoning and deep fusion of image–text information and external knowledge sources. The development of the semantic graph involves extracting keywords from the image object detection information, question text, and external knowledge texts, which are then represented as symbol nodes. Three types of semantic graphs are then constructed based on the knowledge graph, including vision, question, and the external knowledge text, with non-symbol nodes added to connect these three independent graphs and marked with respective node and edge types. During the inference stage, the multi-modal semantic graph and image–text information are embedded into the feature semantic graph through three embedding methods, and a type-aware graph attention module is employed for deep reasoning. The final answer prediction is a blend of the output from the pre-trained model, graph pooling results, and the characteristics of non-symbolic nodes. The experimental results on the OK-VQA dataset show that the MSG-KRM model is superior to existing methods in terms of overall accuracy score, achieving a score of 43.58, and with improved accuracy for most subclass questions, proving the effectiveness of the proposed method.
This website uses cookies to ensure you get the best experience on our website.