Search by Subject - Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

research-article

Free

JUST ACCEPTED

Published By ACM

LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Just Accepted Accepted on March 2023https://doi.org/10.1145/3589002

Text-to-image generation aims to generate images from text descriptions. Its main challenge lies in two aspects: (1) Semantic consistency, i.e., the generated images should be semantically consistent with the input text; (2) Visual reality, i.e., the ...

research-article

Published By ACM

MKVSE: Multimodal Knowledge Enhanced Visual-semantic Embedding for Image-text Retrieval

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 5September 2023, Article No.: 162, pp 1–21https://doi.org/10.1145/3580501

Image-text retrieval aims to take the text (image) query to retrieve the semantically relevant images (texts), which is fundamental and critical in the search system, online shopping, and social network. Existing works have shown the effectiveness of ...

research-article

Published By ACM

Entangled Representation Learning: A Bidirectional Encoder Decoder Model

ACAI '22: Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial IntelligenceDecember 2022, Article No.: 70, pp 1–7https://doi.org/10.1145/3579654.3579728

Encoder-decoder model encodes input sentences to hidden representations and decodes the representations to the output in unidirectional way. We introduce a bidirectional encoder decoder model that adds a reverse decoder-encoder for the feedback from ...

research-article

Published By ACM

Towards Automated Analysis of Rhetorical Categories in Students Essay Writings using Bloom’s Taxonomy

LAK2023: LAK23: 13th International Learning Analytics and Knowledge ConferenceMarch 2023, pp 418–429https://doi.org/10.1145/3576050.3576112

Essay writing has become one of the most common learning tasks assigned to students enrolled in various courses at different educational levels, owing to the growing demand for future professionals to effectively communicate information to an audience ...

research-article

Published By ACM

EZInterviewer: To Improve Job Interview Performance with Mock Interview Generator

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data MiningFebruary 2023, pp 1102–1110https://doi.org/10.1145/3539597.3570476

Interview has been regarded as one of the most crucial step for recruitment. To fully prepare for the interview with the recruiters, job seekers usually practice with mock interviews between each other. However, such a mock interview with peers is ...

research-article

Published By ACM

S2TUL: A Semi-Supervised Framework for Trajectory-User Linking

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data MiningFebruary 2023, pp 375–383https://doi.org/10.1145/3539597.3570410

Trajectory-User Linking (TUL) aiming to identify users of anonymous trajectories, has recently received increasing attention due to its wide range of applications, such as criminal investigation and personalized recommendation systems. In this paper, we ...

research-article

Published By ACM

Semi-supervised Learning for Mars Imagery Classification and Segmentation

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 4July 2023, Article No.: 144, pp 1–23https://doi.org/10.1145/3572916

With the progress of Mars exploration, numerous Mars image data are being collected and need to be analyzed. However, due to the severe train-test gap and quality distortion of Martian data, the performance of existing computer vision models is ...

research-article

Published By ACM

FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 3May 2023, Article No.: 111, pp 1–22https://doi.org/10.1145/3569583

Deep neural networks have achieved remarkable success in HEVC compressed video quality enhancement. However, most existing multiframe-based methods either deliver unsatisfactory results or consume a significant amount of resources to leverage temporal ...

research-article

Published By ACM

When Visible Light (Backscatter) Communication Meets Neuromorphic Cameras in V2X

HotMobile '23: Proceedings of the 24th International Workshop on Mobile Computing Systems and ApplicationsFebruary 2023, pp 42–48https://doi.org/10.1145/3572864.3580333

Intelligent transportation systems are predicted to change the way people live in the foreseeable future. Vehicular networks are one of the key enablers for such systems, yet no status-quo solutions of vehicular networks make practical deployments ...

research-article

Published By ACM

A Decoupled Kernel Prediction Network Guided by Soft Mask for Single Image HDR Reconstruction

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 2sJune 2023, Article No.: 79, pp 1–23https://doi.org/10.1145/3550277

Recent works on single image high dynamic range (HDR) reconstruction fail to hallucinate plausible textures, resulting in information missing and artifacts in large-scale under/over-exposed regions. In this article, a decoupled kernel prediction network ...

research-article

Free

JUST ACCEPTED

Published By ACM

Learning Multi-turn Response Selection in Grounded Dialogues with Reinforced Knowledge and Context Distillation

ACM Transactions on Information Systems (TOIS), Just Accepted Accepted on February 2023https://doi.org/10.1145/3584701

Recently, knowledge-grounded dialogue systems have gained increasing attention. Great efforts have been made to build response matching models where all dialogue content and knowledge sentences are leveraged. However, knowledge redundancy and distraction ...

research-article

Published By ACM

Image Quality Assessment–driven Reinforcement Learning for Mixed Distorted Image Restoration

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 19, Issue 1sFebruary 2023, Article No.: 42, pp 1–23https://doi.org/10.1145/3532625

Due to the diversity of the degradation process that is difficult to model, the recovery of mixed distorted images is still a challenging problem. The deep learning model trained under certain degradation declines significantly in other degradation ...

research-article

Open Access

Published By ACM

Mortar: Morphing the Bit Level Sparsity for General Purpose Deep Learning Acceleration

ASPDAC '23: Proceedings of the 28th Asia and South Pacific Design Automation ConferenceJanuary 2023, pp 739–744https://doi.org/10.1145/3566097.3567868

Vanilla Deep Neural Networks (DNN) after training are represented with native floating-point 32 (fp32) weights. We observe that the bit-level sparsity of these weights is very abundant in the mantissa and can be directly exploited to speed up model ...

demonstration

Published By ACM

Ubiquitous Deployed Meta-Material Sensors for Structural Monitoring of Buildings

SenSys '22: Proceedings of the 20th ACM Conference on Embedded Networked Sensor SystemsNovember 2022, pp 788–789https://doi.org/10.1145/3560905.3568093

Obtaining fine-grained structural information about building through ubiquitous sensors is crucial for assessing their aging and damage. However, due to the energy requirements, traditional sensors deployed in the building structure need frequent ...

research-article

Published By ACM

Distinguished Paper

Learning to Construct Better Mutation Faults

ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software EngineeringOctober 2022, Article No.: 64, pp 1–13https://doi.org/10.1145/3551349.3556949

Mutation faults are the core of mutation testing and have been widely used in many other software testing and debugging tasks. Hence, constructing high-quality mutation faults is critical. There are many traditional mutation techniques that construct ...

research-article

Published By ACM

Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deployment

ASE '22: Proceedings of the 37th IEEE/ACM International Conference on Automated Software EngineeringOctober 2022, Article No.: 88, pp 1–13https://doi.org/10.1145/3551349.3556906

The size of deep learning models in artificial intelligence (AI) software is increasing rapidly, which hinders the large-scale deployment on resource-restricted devices (e.g., smartphones). To mitigate this issue, AI software compression plays a ...

research-article

Published By ACM

Optimizing communication in deep reinforcement learning with XingTian

Middleware '22: Proceedings of the 23rd ACM/IFIP International Middleware ConferenceNovember 2022, pp 255–268https://doi.org/10.1145/3528535.3565249

Deep Reinforcement Learning (DRL) achieves great success in various domains. Communication in today's DRL algorithms takes non-negligible time compared to the computation. However, prior DRL frameworks usually focus on computation management while ...

research-article

Stable Contrastive Learning for Self-Supervised Sentence Embeddings With Pseudo-Siamese Mutual Learning

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 302022, pp 3046–3059https://doi.org/10.1109/TASLP.2022.3203209

Learning semantic sentence embeddings is beneficial to a variety of natural language processing tasks. Recently, methods using the contrastive learning framework to fine-tune pre-trained language models have been proposed and have achieved significant ...

research-article

Generating Rational Commonsense Knowledge-Aware Dialogue Responses With Channel-Aware Knowledge Fusing Network

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 302022, pp 3230–3239https://doi.org/10.1109/TASLP.2022.3199649

Dialogues systems endow machines with the ability to converse with humans using natural language. Nonetheless, previous Seq2Seq-based generative dialogue systems often generate safe but meaningless responses, such as ‘I don't know’ or ...

research-article

Integrating Lattice-Free MMI Into End-to-End Speech Recognition

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 312023, pp 25–38https://doi.org/10.1109/TASLP.2022.3198555

In automatic speech recognition (ASR) research, discriminative criteria have achieved superior performance in DNN-HMM systems. Given this success, the adoption of discriminative criteria is promising to boost the performance of end-to-end (E2E) ASR ...

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Names

Affiliations

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Save to Binder