Volume 19, Issue 5September 2023Current IssueIssue-in-Progress
Bibliometrics
Skip Table Of Content Section
research-article
Open Access
Cross-User Similarities in Viewing Behavior for 360° Video and Caching Implications
Article No.: 152, pp 1–24https://doi.org/10.1145/3507917

The demand and usage of 360° video services are expected to increase. However, despite these services being highly bandwidth intensive, not much is known about the potential value that basic bandwidth saving techniques such as server or edge-network on-...

research-article
Exploring the Effect of High-frequency Components in GANs Training
Article No.: 153, pp 1–22https://doi.org/10.1145/3578585

Generative Adversarial Networks (GANs) have the ability to generate images that are visually indistinguishable from real images. However, recent studies have revealed that generated and real images share significant differences in the frequency domain. In ...

research-article
Feedforward and Feedback Modulations Based Foveated JND Estimation for Images
Article No.: 154, pp 1–23https://doi.org/10.1145/3579094

The just noticeable difference (JND) reveals the key characteristic of visual perception, which has been widely used in many perception-based image and video applications. Nevertheless, the modulatory mechanism of the human visual system (HVS) has not ...

research-article
MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup
Article No.: 155, pp 1–18https://doi.org/10.1145/3578935

Detecting out-of-distribution (OOD) inputs for deep learning models is a critical task when models are deployed in real-world environments. Recently, a large number of works have been dedicated to tackling the OOD detection problem. One of the most ...

research-article
A Multi-Level Consistency Network for High-Fidelity Virtual Try-On
Article No.: 156, pp 1–18https://doi.org/10.1145/3580500

The 2D virtual try-on task aims to transfer a target clothing image to the corresponding region of a person image. Although an extensive amount of research has been conducted due to its immense applications, this task still remains a great challenge to ...

research-article
Fine-Grained Text-to-Video Temporal Grounding from Coarse Boundary
Article No.: 157, pp 1–21https://doi.org/10.1145/3579825

Text-to-video temporal grounding aims to locate a target video moment that semantically corresponds to the given sentence query in an untrimmed video. In this task, fully supervised works require text descriptions for each event along with its temporal ...

research-article
Dual-Lens HDR using Guided 3D Exposure CNN and Guided Denoising Transformer
Article No.: 158, pp 1–20https://doi.org/10.1145/3579167

We study the high dynamic range (HDR) imaging problem in dual-lens systems. Existing methods usually treat the HDR imaging problem as an image fusion problem and the HDR result is estimated by fusing the aligned short exposure image and long exposure ...

research-article
Detection of Moving Object Using Superpixel Fusion Network
Article No.: 160, pp 1–15https://doi.org/10.1145/3579998

Moving object detection is still a challenging task in complex scenes. The existing methods based on deep learning mainly use U-Nets and have achieved amazing results. However, they ignore the local continuity between pixels. In order to solve this ...

research-article
Bottom-up and Top-down Object Inference Networks for Image Captioning
Article No.: 161, pp 1–18https://doi.org/10.1145/3580366

A bottom-up and top-down attention mechanism has led to the revolutionizing of image captioning techniques, which enables object-level attention for multi-step reasoning over all the detected objects. However, when humans describe an image, they often ...

research-article
MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval
Article No.: 162, pp 1–21https://doi.org/10.1145/3580501

Image-text retrieval aims to take the text (image) query to retrieve the semantically relevant images (texts), which is fundamental and critical in the search system, online shopping, and social network. Existing works have shown the effectiveness of ...

research-article
Robust Video Stabilization based on Motion Decomposition
Article No.: 164, pp 1–24https://doi.org/10.1145/3580498

Video stabilization aims to eliminate camera jitter and improve the visual experience of shaky videos. Video stabilization methods often ignore the active movement of the foreground objects and the camera, and may result in distortion and over-smoothing ...

Subjects

Comments

About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!