TOMM: Vol 19, No 4

Volume 19, Issue 4July 2023Issue-in-Progress

Volume 19, Issue 4

July 2023

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1551-6857

EISSN:1551-6865

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Fake and Dishonest Participant Immune Secret Image Sharing

Article No.: 139, pp 1–26https://doi.org/10.1145/3572842

Secret image sharing (SIS) has received increased attention from the research community because of its usefulness in multiparty secure computing, access control, blockchain distributive storage and other security-oriented applications. Prevention of fake ...

research-article

Semantic Completion and Filtration for Image–Text Retrieval

Article No.: 140, pp 1–20https://doi.org/10.1145/3572844

Image–text retrieval is a vital task in computer vision and has received growing attention, since it connects cross-modality data. It comes with the critical challenges of learning unified representations and eliminating the large gap between visual and ...

research-article

Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference

Article No.: 141, pp 1–17https://doi.org/10.1145/3573201

As a crucial part of natural language processing, event-centered commonsense inference task has attracted increasing attention. With a given observed event, the intention and reaction of the people involved in the event are required to be inferred with ...

research-article

Attention, Please! Adversarial Defense via Activation Rectification and Preservation

Article No.: 142, pp 1–18https://doi.org/10.1145/3572843

This study provides a new understanding of the adversarial attack problem by examining the correlation between adversarial attack and visual attention change. In particular, we observed that: (1) images with incomplete attention regions are more ...

research-article

Context Sensing Attention Network for Video-based Person Re-identification

Article No.: 143, pp 1–20https://doi.org/10.1145/3573203

Video-based person re-identification (ReID) is challenging due to the presence of various interferences in video frames. Recent approaches handle this problem using temporal aggregation strategies. In this work, we propose a novel Context Sensing ...

research-article

Semi-supervised Learning for Mars Imagery Classification and Segmentation

Article No.: 144, pp 1–23https://doi.org/10.1145/3572916

With the progress of Mars exploration, numerous Mars image data are being collected and need to be analyzed. However, due to the severe train-test gap and quality distortion of Martian data, the performance of existing computer vision models is ...

research-article

DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network

Article No.: 145, pp 1–17https://doi.org/10.1145/3574136

Multi-modal medical image fusion is a long-standing important research topic that can obtain informative medical images and assist doctors diagnose and treat diseases more efficiently. However, most fusion methods extract and fuse features by subjectively ...

research-article

D³T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data

Article No.: 146, pp 1–20https://doi.org/10.1145/3576858

As an important and challenging problem, image generation with limited data aims at generating realistic images through training a GAN model given few samples. A typical solution is to transfer a well-trained GAN model from a data-rich source domain to ...

research-article

A Novel Lightweight Audio-visual Saliency Model for Videos

Article No.: 147, pp 1–22https://doi.org/10.1145/3576857

Audio information has not been considered an important factor in visual attention models regardless of many psychological studies that have shown the importance of audio information in the human visual perception system. Since existing visual attention ...

research-article

NumCap: A Number-controlled Multi-caption Image Captioning Network

Article No.: 148, pp 1–24https://doi.org/10.1145/3576927

Image captioning is a promising task that attracted researchers in the last few years. Existing image captioning models are primarily trained to generate one caption per image. However, an image may contain rich contents, and one caption cannot express ...

research-article

Distilled Meta-learning for Multi-Class Incremental Learning

Article No.: 149, pp 1–16https://doi.org/10.1145/3576045

Meta-learning approaches have recently achieved promising performance in multi-class incremental learning. However, meta-learners still suffer from catastrophic forgetting, i.e., they tend to forget the learned knowledge from the old tasks when they focus ...

research-article

Graph Attention Transformer Network for Multi-label Image Classification

Article No.: 150, pp 1–16https://doi.org/10.1145/3578518

Multi-label classification aims to recognize multiple objects or attributes from images. The key to solving this issue relies on effectively characterizing the inter-label correlations or dependencies, which bring the prevailing graph neural network. ...

research-article

UID2021: An Underwater Image Dataset for Evaluation of No-Reference Quality Assessment Metrics

Article No.: 151, pp 1–24https://doi.org/10.1145/3578584

Achieving subjective and objective quality assessment of underwater images is of high significance in underwater visual perception and image/video processing. However, the development of underwater image quality assessment (UIQA) is limited for the lack ...

ACM Transactions on Multimedia Computing, Communications, and Applications

Sections

Fake and Dishonest Participant Immune Secret Image Sharing

Semantic Completion and Filtration for Image–Text Retrieval

Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference

Attention, Please! Adversarial Defense via Activation Rectification and Preservation

Context Sensing Attention Network for Video-based Person Re-identification

Semi-supervised Learning for Mars Imagery Classification and Segmentation

DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network

D³T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data

A Novel Lightweight Audio-visual Saliency Model for Videos

NumCap: A Number-controlled Multi-caption Image Captioning Network

Distilled Meta-learning for Multi-Class Incremental Learning

Graph Attention Transformer Network for Multi-label Image Classification

UID2021: An Underwater Image Dataset for Evaluation of No-Reference Quality Assessment Metrics

Subjects

Comments