Multitask Learning for Visual Question Answering in Python

Multitask Learning for Visual Question Answering in Python

admin

admin

Feb 3, 2024 - 14:24

0 21

Abstract:

Visual Question Answering (VQA) concerns providing answers to Natural Language questions about images. Several deep neural network approaches have been proposed to model the task in an end-to-end fashion. Whereas the task is grounded in visual processing, if the question focuses on events described by verbs, the language understanding component becomes crucial. Our hypothesis is that models should be aware of verb semantics, as expressed via semantic role labels, argument types, and/or frame elements. Unfortunately, no VQA dataset exists that includes verb semantic information. Our first contribution is a new VQA dataset (imSituVQA) that we built by taking advantage of the imSitu annotations. The imSitu dataset consists of images manually labeled with semantic frame elements, mostly taken from FrameNet. Second, we propose a multitask CNN-LSTM VQA model that learns to classify the answers as well as the semantic frame elements. Our experiments show that semantic frame element classification helps the VQA system avoid inconsistent responses and improves performance.

Click Here To See More

Tags:

Previous Article

Nation State Threat Actor Attribution Using Fuzzy Hashing

Gradient Encryption Aided Privacy Preserved Federated Learning for Autonomous Ve...

What's Your Reaction?

0

Like

0

Dislike

0

Love

0

Funny

0

Angry

0

Sad

0

Wow

Related Posts

Development of Magnetic Probe for Sentinel Lymph Node D...

admin Dec 23, 2021 0 25

Enabling Visual Action Planning for Object Manipulation...

admin Feb 1, 2024 0 15

Open Set Fault Diagnosis via Supervised Contrastive Lea...

admin Jan 14, 2024 0 187

Clinic Mangement Drug Process in Django

admin Jan 27, 2024 0 15

Imbalanced Sample Fault Diagnosis of Rolling Bearing Us...

admin Jan 18, 2024 0 19

Artificial Intelligence of Things for Smarter Healthcar...

admin Jan 20, 2024 0 25

Comments