Bibliometrics
Skip Table Of Content Section
research-article
PAV-SOD: A New Task towards Panoramic Audiovisual Saliency Detection
Article No.: 101, pp 1–26https://doi.org/10.1145/3565267

Object-level audiovisual saliency detection in 360° panoramic real-life dynamic scenes is important for exploring and modeling human perception in immersive environments, also for aiding the development of virtual, augmented, and mixed reality ...

research-article
Temporal Dropout for Weakly Supervised Action Localization
Article No.: 102, pp 1–24https://doi.org/10.1145/3567827

Weakly supervised action localization is a challenging problem in video understanding and action recognition. Existing models usually formulate the training process as direct classification using video-level supervision. They tend to only locate the most ...

research-article
On Modality Bias Recognition and Reduction
Article No.: 103, pp 1–22https://doi.org/10.1145/3565266

Making each modality in multi-modal data contribute is of vital importance to learning a versatile multi-modal model. Existing methods, however, are often dominated by one or few of modalities during model training, resulting in sub-optimal performance. ...

research-article
CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising
Article No.: 104, pp 1–22https://doi.org/10.1145/3566125

Image denoising is a fundamental problem in computer vision and multimedia computation. Non-local filters are effective for image denoising. But existing deep learning methods that use non-local computation structures are mostly designed for high-level ...

research-article
Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search
Article No.: 105, pp 1–19https://doi.org/10.1145/3565886

Person search is a time-consuming computer vision task that entails locating and recognizing query people in scenic pictures. Body components are commonly mismatched during matching due to position variation, occlusions, and partially absent body parts, ...

research-article
Domain Adaptation Problem in Sketch Based Image Retrieval
Article No.: 106, pp 1–17https://doi.org/10.1145/3565368

In this article, we present two algorithms that discover the discriminative structures of sketches, given pairs of sketches and photos in sketch-based image retrieval (SBIR) scenarios. Unlike the existing approaches, we aim at the few-shot and domain ...

research-article
Toward Intelligent Fashion Design: A Texture and Shape Disentangled Generative Adversarial Network
Article No.: 107, pp 1–23https://doi.org/10.1145/3567596

Texture and shape in fashion, constituting essential elements of garments, characterize the body and surface of the fabric and outline the silhouette of clothing, respectively. The selection of texture and shape plays a critical role in the design process,...

research-article
Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization
Article No.: 108, pp 1–19https://doi.org/10.1145/3567828

Recent action localization works learn in a weakly supervised manner to avoid the expensive cost of human labeling. Those works are mostly based on the Multiple Instance Learning framework, where temporal pooling is an indispensable part that usually ...

research-article
Multi-scale Edge-guided Learning for 3D Reconstruction
Article No.: 109, pp 1–24https://doi.org/10.1145/3568678

Single-view three-dimensional (3D) object reconstruction has always been a long-term challenging task. Objects with complex topologies are hard to accurately reconstruct, which makes existing methods suffer from blurring of shape boundaries between ...

research-article
Lightweight Feature De-redundancy and Self-calibration Network for Efficient Image Super-resolution
Article No.: 110, pp 1–15https://doi.org/10.1145/3569900

In recent years, thanks to the inherent powerful feature representation and learning abilities of the convolutional neural network (CNN), deep CNN-steered single image super-resolution approaches have achieved remarkable performance improvements. However, ...

research-article
FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement
Article No.: 111, pp 1–22https://doi.org/10.1145/3569583

Deep neural networks have achieved remarkable success in HEVC compressed video quality enhancement. However, most existing multiframe-based methods either deliver unsatisfactory results or consume a significant amount of resources to leverage temporal ...

research-article
A Differentiable Parallel Sampler for Efficient Video Classification
Article No.: 112, pp 1–18https://doi.org/10.1145/3569584

It is crucial to sample a small portion of relevant frames for efficient video classification. The existing methods mainly develop hand-designed sampling strategies or learn sequential selection policies. However, there are two challenges to be solved. ...

research-article
TP-FER: An Effective Three-phase Noise-tolerant Recognizer for Facial Expression Recognition
Article No.: 113, pp 1–17https://doi.org/10.1145/3570329

Single-label facial expression recognition (FER), which aims to classify single expression for facial images, usually suffers from the label noisy and incomplete problem, where manual annotations for partial training images exist wrong or incomplete ...

research-article
Local Eyebrow Feature Attention Network for Masked Face Recognition
Article No.: 114, pp 1–19https://doi.org/10.1145/3569943

During the COVID-19 coronavirus epidemic, wearing masks has become increasingly popular. Traditional occlusion face recognition algorithms are almost ineffective for such heavy mask occlusion. Therefore, it is urgent to improve the recognition performance ...

research-article
Efficient Single-image Super-resolution Using Dual path Connections with Multiple scale Learning
Article No.: 115, pp 1–21https://doi.org/10.1145/3570164

Deep convolutional neural networks have been demonstrated to be effective for single-image super-resolution in recent years. On the one hand, residual connections and dense connections have been used widely to ease forward information and backward ...

research-article
Attention-Augmented Memory Network for Image Multi-Label Classification
Article No.: 116, pp 1–24https://doi.org/10.1145/3570166

The purpose of image multi-label classification is to predict all the object categories presented in an image. Some recent works exploit graph convolution network to capture the correlation between labels. Although promising results have been reported, ...

research-article
Open Access
Multi-Guidance CNNs for Salient Object Detection
Article No.: 117, pp 1–19https://doi.org/10.1145/3570507

Feature refinement and feature fusion are two key steps in convolutional neural networks–based salient object detection (SOD). In this article, we investigate how to utilize multiple guidance mechanisms to better refine and fuse extracted multi-level ...

research-article
ProposalVLAD with Proposal-Intra Exploring for Temporal Action Proposal Generation
Article No.: 118, pp 1–18https://doi.org/10.1145/3571747

Temporal action proposal generation aims to localize temporal segments of human activities in videos. Current boundary-based proposal generation methods can generate proposals with precise boundary but often suffer from the inferior quality of confidence ...

research-article
Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Article No.: 119, pp 1–17https://doi.org/10.1145/3571735

Video processing and analysis have become an urgent task, as a huge amount of videos (e.g., YouTube, Hulu) are uploaded online every day. The extraction of representative key frames from videos is important in video processing and analysis since it ...

research-article
Exploiting Residual and Illumination with GANs for Shadow Detection and Shadow Removal
Article No.: 120, pp 1–22https://doi.org/10.1145/3571745

Residual image and illumination estimation have been proven to be helpful for image enhancement. In this article, we propose a general framework, called RI-GAN, that exploits residual and illumination using generative adversarial networks (GANs). The ...

research-article
Detection of Recolored Image by Texture Features in Chrominance Components
Article No.: 121, pp 1–23https://doi.org/10.1145/3571076

Image recoloring is an emerging editing technique that can change the color style of an image by modifying pixel values without altering the original image content. With the rapid proliferation of social network and image editing techniques, recolored ...

research-article
High-Fidelity Face Reenactment Via Identity-Matched Correspondence Learning
Article No.: 122, pp 1–23https://doi.org/10.1145/3571857

Face reenactment aims to generate an animation of a source face using the poses and expressions from a target face. Although recent methods have made remarkable progress by exploiting generative adversarial networks, they are limited in generating high-...

research-article
Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection
Article No.: 123, pp 1–20https://doi.org/10.1145/3572777

In recent years, many model intellectual property (IP) proof methods for IP protection have been proposed, such as model watermarking and model fingerprinting. However, with the increasing number of models transmitted and deployed on the Internet, quickly ...

research-article
Talking Face Generation via Facial Anatomy
Article No.: 125, pp 1–19https://doi.org/10.1145/3571746

To generate the corresponding talking face from a speech audio and a face image, it is essential to match the variations in the facial appearance with the speech audio in subtle movements of different face regions. Nevertheless, the facial movements ...

Subjects

Comments

About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!