TOMM: Vol 19, No 5

Volume 19, Issue 5September 2023Current IssueIssue-in-Progress

Latest Issue

Volume 19, Issue 5

September 2023

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1551-6857

EISSN:1551-6865

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Open Access

Cross-User Similarities in Viewing Behavior for 360° Video and Caching Implications

Article No.: 152, pp 1–24https://doi.org/10.1145/3507917

The demand and usage of 360° video services are expected to increase. However, despite these services being highly bandwidth intensive, not much is known about the potential value that basic bandwidth saving techniques such as server or edge-network on-...

research-article

Exploring the Effect of High-frequency Components in GANs Training

Article No.: 153, pp 1–22https://doi.org/10.1145/3578585

Generative Adversarial Networks (GANs) have the ability to generate images that are visually indistinguishable from real images. However, recent studies have revealed that generated and real images share significant differences in the frequency domain. In ...

research-article

Feedforward and Feedback Modulations Based Foveated JND Estimation for Images

Article No.: 154, pp 1–23https://doi.org/10.1145/3579094

The just noticeable difference (JND) reveals the key characteristic of visual perception, which has been widely used in many perception-based image and video applications. Nevertheless, the modulatory mechanism of the human visual system (HVS) has not ...

research-article

MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup

Article No.: 155, pp 1–18https://doi.org/10.1145/3578935

Detecting out-of-distribution (OOD) inputs for deep learning models is a critical task when models are deployed in real-world environments. Recently, a large number of works have been dedicated to tackling the OOD detection problem. One of the most ...

research-article

A Multi-Level Consistency Network for High-Fidelity Virtual Try-On

Article No.: 156, pp 1–18https://doi.org/10.1145/3580500

The 2D virtual try-on task aims to transfer a target clothing image to the corresponding region of a person image. Although an extensive amount of research has been conducted due to its immense applications, this task still remains a great challenge to ...

research-article

Fine-Grained Text-to-Video Temporal Grounding from Coarse Boundary

Article No.: 157, pp 1–21https://doi.org/10.1145/3579825

Text-to-video temporal grounding aims to locate a target video moment that semantically corresponds to the given sentence query in an untrimmed video. In this task, fully supervised works require text descriptions for each event along with its temporal ...

research-article

Dual-Lens HDR using Guided 3D Exposure CNN and Guided Denoising Transformer

Article No.: 158, pp 1–20https://doi.org/10.1145/3579167

We study the high dynamic range (HDR) imaging problem in dual-lens systems. Existing methods usually treat the HDR imaging problem as an image fusion problem and the HDR result is estimated by fusing the aligned short exposure image and long exposure ...

research-article

Detection of Moving Object Using Superpixel Fusion Network

Yang Li

Article No.: 160, pp 1–15https://doi.org/10.1145/3579998

Moving object detection is still a challenging task in complex scenes. The existing methods based on deep learning mainly use U-Nets and have achieved amazing results. However, they ignore the local continuity between pixels. In order to solve this ...

research-article

Bottom-up and Top-down Object Inference Networks for Image Captioning

Article No.: 161, pp 1–18https://doi.org/10.1145/3580366

A bottom-up and top-down attention mechanism has led to the revolutionizing of image captioning techniques, which enables object-level attention for multi-step reasoning over all the detected objects. However, when humans describe an image, they often ...

research-article

MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval

Article No.: 162, pp 1–21https://doi.org/10.1145/3580501

Image-text retrieval aims to take the text (image) query to retrieve the semantically relevant images (texts), which is fundamental and critical in the search system, online shopping, and social network. Existing works have shown the effectiveness of ...

research-article

Robust Video Stabilization based on Motion Decomposition

Article No.: 164, pp 1–24https://doi.org/10.1145/3580498

Video stabilization aims to eliminate camera jitter and improve the visual experience of shaky videos. Video stabilization methods often ignore the active movement of the foreground objects and the camera, and may result in distortion and over-smoothing ...

ACM Transactions on Multimedia Computing, Communications, and Applications

Sections

Cross-User Similarities in Viewing Behavior for 360° Video and Caching Implications

Exploring the Effect of High-frequency Components in GANs Training

Feedforward and Feedback Modulations Based Foveated JND Estimation for Images

MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup

A Multi-Level Consistency Network for High-Fidelity Virtual Try-On

Fine-Grained Text-to-Video Temporal Grounding from Coarse Boundary

Dual-Lens HDR using Guided 3D Exposure CNN and Guided Denoising Transformer

Detection of Moving Object Using Superpixel Fusion Network

Bottom-up and Top-down Object Inference Networks for Image Captioning

MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval

Robust Video Stabilization based on Motion Decomposition

Subjects

Comments