research-article

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Authors:
Yang Hu

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China

0000-0002-4856-5014
Search about this author

,
Guihua Wen

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China

0000-0002-9709-1126
Search about this author

,
Adriane Chapman

University of Southampton, Southampton, U.K.

University of Southampton, Southampton, U.K.

0000-0002-3814-2587
Search about this author

,
Pei Yang

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China
Search about this author

,
Mingnan Luo

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China
Search about this author

,
Yingxue Xu

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China
Search about this author

,
Dan Dai

South China University of Technology, Guangzhou, China

South China University of Technology, Guangzhou, China

0000-0002-1287-7569
Search about this author

,
Wendy Hall

University of Southampton, Southampton, U.K.

University of Southampton, Southampton, U.K.

0000-0003-4327-7811
Search about this author

IEEE Transactions on Multimedia Volume 242022 pp 2473–2487https://doi.org/10.1109/TMM.2021.3082292

Published:01 January 2022Publication History

IEEE Transactions on Multimedia

Abstract

Zero-shot learning uses semantic attributes to connect the search space of unseen objects. In recent years, although the deep convolutional network brings powerful visual modeling capabilities to the ZSL task, its visual features have severe pattern inertia and lack of representation of semantic relationships, which leads to severe bias and ambiguity. In response to this, we propose the Graph-based Visual-Semantic Entanglement Network to conduct graph modeling of visual features, which is mapped to semantic attributes by using a knowledge graph, it contains several novel designs: 1. it establishes a multi-path entangled network with the convolutional neural network (CNN) and the graph convolutional network (GCN), which input the visual features from CNN to GCN to model the implicit semantic relations, then GCN feedback the graph modeled information to CNN features; 2. it uses attribute word vectors as the target for the graph semantic modeling of GCN, which forms a self-consistent regression for graph modeling and supervise GCN to learn more personalized attribute relations; 3. it fuses and supplements the hierarchical visual-semantic features refined by graph modeling into visual embedding. Our method outperforms state-of-the-art approaches on multiple representative ZSL datasets: AwA2, CUB, and SUN by promoting the semantic linkage modelling of visual features.

Index Terms

(auto-classified)

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

IEEE Transactions on Multimedia Volume 24, Issue
2022
2475 pages
ISSN:1520-9210
Issue’s Table of Contents

1520-9210 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.
Sponsors
In-Cooperation
Publisher
IEEE Press
Publication History
- Published: 1 January 2022
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 0
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Save to Binder

IEEE Transactions on Multimedia

Abstract

Cited By

Index Terms

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

Digital Edition

Caption

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Save to Binder

IEEE Transactions on Multimedia

Abstract

Cited By

Index Terms

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

Digital Edition

Share this Publication link

Share on Social Media