Search by Subject

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

Clear all

People

Publications

Conferences

Reproducibility Badges

Publication Date

38 ResultsEdit SearchSave Search

Searched The ACM Full-Text Collection (691,749 records)|Expand your search to The ACM Guide to Computing Literature (3,482,418 records)

Showing 1 - 20of38 Results

Filters

Select All

Export Citations Save to Binder

per page:

Latest

research-article
Free
March 2023
Published By ACM
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Homunculus: Auto-Generating Efficient Data-Plane ML Pipelines for Datacenter Networks
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3March 2023, pp 329–342https://doi.org/10.1145/3582016.3582022

Support for Machine Learning (ML) applications in networking has significantly improved over the last decade. The availability of public datasets and programmable switching fabrics (including low-level languages to program them) presents a full-stack ...
0
77
Metrics
Total Citations0
Total Downloads77
Last 12 Months77
Last 6 weeks77
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
February 2023
Published By ACM
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Results Reproduced / v1.1
DSP: Efficient GNN Training with Multiple GPUs
PPoPP '23: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel ProgrammingFebruary 2023, pp 392–404https://doi.org/10.1145/3572848.3577528

Jointly utilizing multiple GPUs to train graph neural networks (GNNs) is crucial for handling large graphs and achieving high efficiency. However, we find that existing systems suffer from high communication costs and low GPU utilization due to improper ...
0
250
Metrics
Total Citations0
Total Downloads250
Last 12 Months250
Last 6 weeks250
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
February 2023
Published By ACM
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
Results Reproduced / v1.1
TGOpt: Redundancy-Aware Optimizations for Temporal Graph Attention Networks
- Yufeng Wang,
- Charith Mendis
PPoPP '23: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel ProgrammingFebruary 2023, pp 354–368https://doi.org/10.1145/3572848.3577490

Temporal Graph Neural Networks are gaining popularity in modeling interactions on dynamic graphs. Among them, Temporal Graph Attention Networks (TGAT) have gained adoption in predictive tasks, such as link prediction, in a range of application domains. ...
0
108
Metrics
Total Citations0
Total Downloads108
Last 12 Months108
Last 6 weeks108
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
February 2023
Published By ACM
Results Reproduced / v1.1
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition
PPoPP '23: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel ProgrammingFebruary 2023, pp 260–273https://doi.org/10.1145/3572848.3577478

Tucker decomposition is one of the SOTA CNN model compression techniques. However, unlike the FLOPs reduction, we observe very limited inference time reduction with Tucker-compressed models using existing GPU software such as cuDNN. To this end, we ...
0
86
Metrics
Total Citations0
Total Downloads86
Last 12 Months86
Last 6 weeks86
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
February 2023
Published By ACM
Results Reproduced / v1.1
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
RL4ReAl: Reinforcement Learning for Register Allocation
CC 2023: Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler ConstructionFebruary 2023, pp 133–144https://doi.org/10.1145/3578360.3580273

We aim to automate decades of research and experience in register allocation, leveraging machine learning. We tackle this problem by embedding a multi-agent reinforcement learning algorithm within LLVM, training it with the state of the art ...
0
95
Metrics
Total Citations0
Total Downloads95
Last 12 Months95
Last 6 weeks72
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
January 2023
Published By ACM
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
Results Reproduced / v1.1
Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation
- Perry Gibson,
- José Cano
PACT '22: Proceedings of the International Conference on Parallel Architectures and Compilation TechniquesOctober 2022, pp 28–39https://doi.org/10.1145/3559009.3569682

Auto-scheduling for tensor programs is a process where a search algorithm automatically explores candidate schedules (program transformations) for a given program on a target hardware platform to improve its performance. However this can be a very time ...
1
60
Metrics
Total Citations1
Total Downloads60
Last 12 Months60
Last 6 weeks11
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
Open Access
January 2023
Published By ACM
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs
PACT '22: Proceedings of the International Conference on Parallel Architectures and Compilation TechniquesOctober 2022, pp 252–264https://doi.org/10.1145/3559009.3569674

The Convolutional Neural Network (CNN) kernel is a fundamental building block for deep learning, which dominates the computational cost of deep learning pipelines for image analysis. The synthesis of high-performance GPU kernels for CNNs is thus of ...
0
106
Metrics
Total Citations0
Total Downloads106
Last 12 Months106
Last 6 weeks61
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
November 2022
Published By ACM
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Results Reproduced / v1.1
Atlas: automate online service configuration in network slicing
CoNEXT '22: Proceedings of the 18th International Conference on emerging Networking EXperiments and TechnologiesNovember 2022, pp 140–155https://doi.org/10.1145/3555050.3569115

Network slicing achieves cost-efficient slice customization to support heterogeneous applications and services. Configuring cross-domain resources to end-to-end slices based on service-level agreements, however, is challenging, due to the complicated ...
0
146
Metrics
Total Citations0
Total Downloads146
Last 12 Months146
Last 6 weeks36
1
Supplementary Material
p140-liu.pdf
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
Open Access
March 2022
Published By ACM
Results Reproduced / v1.1
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
QGTC: accelerating quantized graph neural networks via GPU tensor core
PPoPP '22: Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel ProgrammingApril 2022, pp 107–119https://doi.org/10.1145/3503221.3508408

Over the most recent years, quantized graph neural network (QGNN) attracts lots of research and industry attention due to its high robustness and low computation and memory overhead. Unfortunately, the performance gains of QGNN have never been realized ...
8
550
Metrics
Total Citations8
Total Downloads550
Last 12 Months550
Last 6 weeks45
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
March 2022
Published By ACM
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
LiteReconfig: cost and content aware reconfiguration of video object detection systems for mobile GPUs
EuroSys '22: Proceedings of the Seventeenth European Conference on Computer SystemsMarch 2022, pp 334–351https://doi.org/10.1145/3492321.3519577

An adaptive video object detection system selects different execution paths at runtime, based on video content and available resources, so as to maximize accuracy under a target latency objective (e.g., 30 frames per second). Such a system is well ...
1
749
Metrics
Total Citations1
Total Downloads749
Last 12 Months744
Last 6 weeks77
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
March 2022
Published By ACM
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
Results Reproduced / v1.1
Automating reinforcement learning architecture design for code optimization
CC 2022: Proceedings of the 31st ACM SIGPLAN International Conference on Compiler ConstructionMarch 2022, pp 129–143https://doi.org/10.1145/3497776.3517769

Reinforcement learning (RL) is emerging as a powerful technique for solving complex code optimization tasks with an ample search space. While promising, existing solutions require a painstaking manual process to tune the right task-specific RL ...
3
573
Metrics
Total Citations3
Total Downloads573
Last 12 Months473
Last 6 weeks27
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
Open Access
March 2022
Published By ACM
Results Reproduced / v1.1
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
Training of deep learning pipelines on memory-constrained GPUs via segmented fused-tiled execution
CC 2022: Proceedings of the 31st ACM SIGPLAN International Conference on Compiler ConstructionMarch 2022, pp 104–116https://doi.org/10.1145/3497776.3517766

Training models with massive inputs is a significant challenge in the development of Deep Learning pipelines to process very large digital image datasets as required by Whole Slide Imaging (WSI) in computational pathology and analysis of brain fMRI ...
0
333
Metrics
Total Citations0
Total Downloads333
Last 12 Months306
Last 6 weeks28
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
February 2022
Published By ACM
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Results Reproduced / v1.1
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures
ASPLOS '22: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating SystemsFebruary 2022, pp 359–373https://doi.org/10.1145/3503222.3507723

This work reveals that memory-intensive computation is a rising performance-critical factor in recent machine learning models. Due to a unique set of new challenges, existing ML optimizing compilers cannot perform efficient fusion under complex two-...
5
1,733
Metrics
Total Citations5
Total Downloads1,733
Last 12 Months1,119
Last 6 weeks97
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
Open Access
July 2021
Published By ACM
Results Reproduced / v1.1
Orienting point clouds with dipole propagation
ACM Transactions on Graphics (TOG), Volume 40, Issue 4August 2021, Article No.: 165, pp 1–14https://doi.org/10.1145/3450626.3459835

Establishing a consistent normal orientation for point clouds is a notoriously difficult problem in geometry processing, requiring attention to both local and global shape characteristics. The normal direction of a point is a function of the local ...
6
431
Metrics
Total Citations6
Total Downloads431
Last 12 Months263
Last 6 weeks17
4
Supplementary Material
a165-metzer.zip
a165-metzer.mp4
3450626.3459835.mp4
3450626.3459835.vtt
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
July 2021
Published By ACM
Results Reproduced / v1.1
General virtual sketching framework for vector line art
ACM Transactions on Graphics (TOG), Volume 40, Issue 4August 2021, Article No.: 51, pp 1–14https://doi.org/10.1145/3450626.3459833

Vector line art plays an important role in graphic design, however, it is tedious to manually create. We introduce a general framework to produce line drawings from a wide variety of images, by learning a mapping from raster image space to vector image ...
7
757
Metrics
Total Citations7
Total Downloads757
Last 12 Months416
Last 6 weeks37
4
Supplementary Material
3450626.3459833.vtt
a51-mo.zip
a51-mo.mp4
3450626.3459833.mp4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
August 2020
Published By ACM
Results Reproduced / v1.1
MichiGAN: multi-input-conditioned hair image generation for portrait editing
ACM Transactions on Graphics (TOG), Volume 39, Issue 4August 2020, Article No.: 95, pp 95:1–95:13https://doi.org/10.1145/3386569.3392488

Despite the recent success of face image generation with GANs, conditional hair editing remains challenging due to the under-explored complexity of its geometry and appearance. In this paper, we present MichiGAN (Multi-Input-Conditioned Hair Image GAN), ...
35
513
Metrics
Total Citations35
Total Downloads513
Last 12 Months167
Last 6 weeks18
3
Supplementary Material
3386569.3392488.mp4
a95-tan.mp4
3386569.3392488.vtt
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
August 2020
Published By ACM
Results Reproduced / v1.1
Deep geometric texture synthesis
ACM Transactions on Graphics (TOG), Volume 39, Issue 4August 2020, Article No.: 108, pp 108:1–108:11https://doi.org/10.1145/3386569.3392471

Recently, deep generative adversarial networks for image generation have advanced rapidly; yet, only a small amount of research has focused on generative models for irregular structures, particularly meshes. Nonetheless, mesh generation and synthesis ...
16
521
Metrics
Total Citations16
Total Downloads521
Last 12 Months173
Last 6 weeks9
3
Supplementary Material
a108-hertz.mp4
3386569.3392471.vtt
3386569.3392471.mp4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
August 2020
Published By ACM
Results Reproduced / v1.1
Skeleton-aware networks for deep motion retargeting
ACM Transactions on Graphics (TOG), Volume 39, Issue 4August 2020, Article No.: 62, pp 62:1–62:14https://doi.org/10.1145/3386569.3392462

We introduce a novel deep learning framework for data-driven motion retargeting between skeletons, which may have different structure, yet corresponding to homeomorphic graphs. Importantly, our approach learns how to retarget without requiring any ...
39
1,419
Metrics
Total Citations39
Total Downloads1,419
Last 12 Months801
Last 6 weeks74
3
Supplementary Material
3386569.3392462.vtt
3386569.3392462.mp4
a62-aberman.mp4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
August 2020
Published By ACM
Results Reproduced / v1.1
CARL: controllable agent with reinforcement learning for quadruped locomotion
ACM Transactions on Graphics (TOG), Volume 39, Issue 4August 2020, Article No.: 38, pp 38:1–38:10https://doi.org/10.1145/3386569.3392433

Motion synthesis in a dynamic environment has been a long-standing problem for character animation. Methods using motion capture data tend to scale poorly in complex environments because of their larger capturing and labeling requirement. Physics-based ...
25
1,263
Metrics
Total Citations25
Total Downloads1,263
Last 12 Months416
Last 6 weeks71
3
Supplementary Material
3386569.3392433.vtt
3386569.3392433.mp4
a38-luo.mp4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF
research-article
Open Access
August 2020
Published By ACM
Results Reproduced / v1.1
Point2Mesh: a self-prior for deformable meshes
ACM Transactions on Graphics (TOG), Volume 39, Issue 4August 2020, Article No.: 126, pp 126:1–126:12https://doi.org/10.1145/3386569.3392415

In this paper, we introduce Point2Mesh, a technique for reconstructing a surface mesh from an input point cloud. Instead of explicitly specifying a prior that encodes the expected shape properties, the prior is defined automatically using the input ...
63
909
Metrics
Total Citations63
Total Downloads909
Last 12 Months397
Last 6 weeks39
4
Supplementary Material
3386569.3392415.vtt
3386569.3392415.mp4
a126-hanocka.mp4
a126-hanocka.zip
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
PDF

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Names

Affiliations

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Homunculus: Auto-Generating Efficient Data-Plane ML Pipelines for Datacenter Networks

DSP: Efficient GNN Training with Multiple GPUs

TGOpt: Redundancy-Aware Optimizations for Temporal Graph Attention Networks

TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition

RL4ReAl: Reinforcement Learning for Register Allocation

Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation

Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs

Atlas: automate online service configuration in network slicing

QGTC: accelerating quantized graph neural networks via GPU tensor core

LiteReconfig: cost and content aware reconfiguration of video object detection systems for mobile GPUs

Automating reinforcement learning architecture design for code optimization

Training of deep learning pipelines on memory-constrained GPUs via segmented fused-tiled execution

AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures

Orienting point clouds with dipole propagation

General virtual sketching framework for vector line art

MichiGAN: multi-input-conditioned hair image generation for portrait editing

Deep geometric texture synthesis

Skeleton-aware networks for deep motion retargeting

CARL: controllable agent with reinforcement learning for quadruped locomotion

Point2Mesh: a self-prior for deformable meshes