Search by Subject

keynote

Published By ACM

Scaling the Metaverse: An AI Perspective

Tania Lorido-Botran

ICPE '23 Companion: Companion of the 2023 ACM/SPEC International Conference on Performance EngineeringApril 2023, pp 259–260https://doi.org/10.1145/3578245.3584920

When one hears the word Metaverse, it is automatically associated with millions of users, immersive experiences and its potential to change our lives. But, what enables the Metaverse to function at such a scale? This talk will present the different ...

keynote

Free

Published By ACM

Robots in Real Life: Putting HRI to Work

Andrea Thomaz

HRI '23: Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot InteractionMarch 2023, pp 3https://doi.org/10.1145/3568162.3578810

This talk will be focused on the unique challenges in deploying a mobile manipulation robot into an environment where the robot is working closely with people on a daily basis. Diligent Robotics' first product, Moxi, is a mobile manipulation service ...

keynote

Published By ACM

Learning to Understand Audio and Multimodal Content

Rosie Jones

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data MiningFebruary 2023, pp 4–5https://doi.org/10.1145/3539597.3572333

Music, podcasts and audiobooks are rich audio content types with strong listener engagement. Search and recommendation across these content types can be greatly enhanced with a deep understanding of their content; across audio, text, and other ...

keynote

Published By ACM

Towards Autonomous Driving

Ya-Qin Zhang

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data MiningFebruary 2023, pp 1https://doi.org/10.1145/3539597.3572331

The automotive and transportation industry is going through a tectonic shift in the next decade with the advent of Connectivity, Automation, Sharing, and Electrification (CASE). Autonomous driving presents a historical opportunity to transform the ...

keynote

Published By ACM

Why Bother Enabling Biomedical Literature Analysis with Semantics?

Karin Verspoor

WWW '22: Companion Proceedings of the Web Conference 2022April 2022, pp 822https://doi.org/10.1145/3487553.3527164

These days, ELMo [3], BERT [1], BART [2] and other similarly cutely-named models appear to have dramatically advanced the state of the art in basically every problem in natural language processing and information retrieval. It can leave a researcher ...

keynote

Published By ACM

Stylistic Control for Neural Natural Language Generation

Shereen Oraby

WWW '22: Companion Proceedings of the Web Conference 2022April 2022, pp 1179https://doi.org/10.1145/3487553.3527149

With the rise of conversational assistants, it has become more critical for dialog systems to keep users engaged by responding in a natural, interesting, and often personalized way, even in a task-oriented setting. Recent work has thus focused on ...

keynote

Published By ACM

Adaptively Offloading the Software for Mobile Edge Computing

Xing Chen

WWW '22: Companion Proceedings of the Web Conference 2022April 2022, pp 940https://doi.org/10.1145/3487553.3527145

In mobile edge computing (MEC), computation offloading is a promising way to support those resource-constrained mobile devices, since it moves some time-consuming computation activities to nearby edge servers. Owing to the geographical distribution of ...

keynote

Published By ACM

Accurate and Explainable Misinformation Detection: Too Good to be True?

Jose Manuel Gomez-Perez

WWW '22: Companion Proceedings of the Web Conference 2022April 2022, pp 426–427https://doi.org/10.1145/3487553.3526943

Many of the challenges entailed in detecting online misinformation are related to our own cognitive limitations as human beings: We can only see a small part of the world at once, so we need to rely on others to pre-process part of that information for ...

keynote

Published By ACM

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition

Donatello Conte

WWW '22: Companion Proceedings of the Web Conference 2022April 2022, pp 1007–1008https://doi.org/10.1145/3487553.3526096

Computer Vision Problems, such as object detection, object tracking, action recognition and so on, have been, in the past, usually addressed through Statistical Pattern Recognition techniques. SVM, Regression or Neural Networks, are some examples of ...

keynote

Published By ACM

Provably Beneficial Artificial Intelligence

Stuart Russell

IUI '22: 27th International Conference on Intelligent User InterfacesMarch 2022, pp 3https://doi.org/10.1145/3490099.3519388

As AI advances in capabilities and moves into the real world, its potential to benefit humanity seems limitless. Yet we see serious problems including racial and gender bias, manipulation by social media, and an arms race in lethal autonomous weapons. ...

keynote

Published By ACM

Knowledge is Power: Symbolic Knowledge Distillation, Commonsense Morality, & Multimodal Script Knowledge

Yejin Choi

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data MiningFebruary 2022, pp 3https://doi.org/10.1145/3488560.3500242

Scale appears to be the winning recipe in today's AI leaderboards. And yet, extreme-scale neural models are still brittle to make errors that are often nonsensical and even counterintuitive. In this talk, I will argue for the importance of knowledge, ...

keynote

Published By ACM

Ethical Challenges in AI

Ricardo Baeza-Yates

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data MiningFebruary 2022, pp 1–2https://doi.org/10.1145/3488560.3498370

In the first part we address four current specific challenges through examples: (1) discrimination (e.g., facial recognition, justice, sharing economy, language models); (2) stupid models (e.g., lack of semantic and context understanding); (3) ...

keynote

Published By ACM

The Virtuous Cycles of Determinism: Programming Groq's Tensor Streaming Processor

Satnam Singh

FPGA '22: Proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2022, pp 153https://doi.org/10.1145/3490422.3510453

FPGAs and other 2D and 3D spatial computing fabrics share several common characteristics e.g. a deterministic computing model with distributed memories, but also differ along important dimensions e.g. granularity and communication infrastructure. This ...

keynote

Published By ACM

"Deepfake" Portrait Image Generation

Jianfei Cai

ADGD '21: Proceedings of the 1st Workshop on Synthetic Multimedia - Audiovisual Deepfake Generation and DetectionOctober 2021, pp 5https://doi.org/10.1145/3476099.3480396

With the prevailing of deep learning technology, especially generative adversarial networks (GAN), generating photo-realistic facial images has made huge progress. Image generation techniques have many good applications such as data augmentation, ...

keynote

Published By ACM

Deep Learning for Historical Data Analysis

Mathieu Aubry

SUMAC'21: Proceedings of the 3rd Workshop on Structuring and Understanding of Multimedia heritAge ContentsOctober 2021, pp 1https://doi.org/10.1145/3475720.3476877

This presentation will give an overview of projects on leveraging deep learning for historical data analysis my group did in the last 3 years, partly in the context of the ANR EnHerit project. I will first discuss how deep learning can be used to ...

keynote

Published By ACM

WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data

Ruihua Song

MMPT '21: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia UnderstandingAugust 2021, pp 3https://doi.org/10.1145/3463945.3468170

Multi-modal pre-training models have been intensively explored to bridge vision and language in recent years. However, most of them explicitly model the cross-modal interaction between image-text pairs, by assuming that there exists strong semantic ...

keynote

Published By ACM

Cross-modal Pretraining and Matching for Video Understanding

Limin Wang

MMPT '21: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia UnderstandingAugust 2021, pp 1–2https://doi.org/10.1145/3463945.3468169

Videos are generally accompanied with multi-modal information such as audio, text, and motion. The multi-modal information is becoming an important cue for understanding video content. How to model the correlation between multi-modalities in videos is ...

keynote

Published By ACM

Lightweight Short-term Photovoltaic Power Prediction for Mobile Edge Computing

Albert Y. Zomaya

MSWiM '20: Proceedings of the 23rd International ACM Conference on Modeling, Analysis and Simulation of Wireless and Mobile SystemsNovember 2020, pp 129https://doi.org/10.1145/3416010.3431218

To meet the needs for energy savings in Internet of Things (IoT) and mobile systems, solar energy has been increasingly exploited to serve as a green and renewable source to allow systems to better operate in an energy-efficient way. In this respect, ...

keynote

Published By ACM

What Kind of Human-centric Robotics do We Need?: Investigations from Human-robot Interactions in Socially Assistive Scenarios

Ginevra Castellano

HAI '20: Proceedings of the 8th International Conference on Human-Agent InteractionNovember 2020, pp 1–2https://doi.org/10.1145/3406499.3422313

Today we are witnessing an increased robotisation in all areas of society, from manufacturing to assistive technology, from healthcare to education. These application areas require robots to be able to interact with humans in an efficient and socially ...

keynote

Published By ACM

Deep Image Features for Instance-level Recognition and Matching

André Araujo

SUMAC'20: Proceedings of the 2nd Workshop on Structuring and Understanding of Multimedia heritAge ContentsOctober 2020, pp 1https://doi.org/10.1145/3423323.3423414

In this talk, I will discuss recent work from our team at Google Research, covering novel methods and datasets. Instance-level recognition, retrieval and matching are key computer vision problems which generally depend on effective image representations,...

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Names

Affiliations

Authors

Publications

Proceedings/Book Names

All Publications

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Scaling the Metaverse: An AI Perspective

Robots in Real Life: Putting HRI to Work

Learning to Understand Audio and Multimodal Content

Towards Autonomous Driving

Why Bother Enabling Biomedical Literature Analysis with Semantics?

Stylistic Control for Neural Natural Language Generation

Adaptively Offloading the Software for Mobile Edge Computing

Accurate and Explainable Misinformation Detection: Too Good to be True?

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition

Provably Beneficial Artificial Intelligence

Knowledge is Power: Symbolic Knowledge Distillation, Commonsense Morality, & Multimodal Script Knowledge

Ethical Challenges in AI

The Virtuous Cycles of Determinism: Programming Groq's Tensor Streaming Processor

"Deepfake" Portrait Image Generation

Deep Learning for Historical Data Analysis

WenLan: Efficient Large-Scale Multi-Modal Pre-Training on Real World Data

Cross-modal Pretraining and Matching for Video Understanding

Lightweight Short-term Photovoltaic Power Prediction for Mobile Edge Computing

What Kind of Human-centric Robotics do We Need?: Investigations from Human-robot Interactions in Socially Assistive Scenarios

Deep Image Features for Instance-level Recognition and Matching

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Names

Affiliations

Authors

Publications

Proceedings/Book Names

All Publications

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder