Please login to be able to save your searches and receive alerts for new content matching your search criteria.
When one hears the word Metaverse, it is automatically associated with millions of users, immersive experiences and its potential to change our lives. But, what enables the Metaverse to function at such a scale? This talk will present the different ...
This talk will be focused on the unique challenges in deploying a mobile manipulation robot into an environment where the robot is working closely with people on a daily basis. Diligent Robotics' first product, Moxi, is a mobile manipulation service ...
Music, podcasts and audiobooks are rich audio content types with strong listener engagement. Search and recommendation across these content types can be greatly enhanced with a deep understanding of their content; across audio, text, and other ...
The automotive and transportation industry is going through a tectonic shift in the next decade with the advent of Connectivity, Automation, Sharing, and Electrification (CASE). Autonomous driving presents a historical opportunity to transform the ...
These days, ELMo [3], BERT [1], BART [2] and other similarly cutely-named models appear to have dramatically advanced the state of the art in basically every problem in natural language processing and information retrieval. It can leave a researcher ...
With the rise of conversational assistants, it has become more critical for dialog systems to keep users engaged by responding in a natural, interesting, and often personalized way, even in a task-oriented setting. Recent work has thus focused on ...
In mobile edge computing (MEC), computation offloading is a promising way to support those resource-constrained mobile devices, since it moves some time-consuming computation activities to nearby edge servers. Owing to the geographical distribution of ...
Many of the challenges entailed in detecting online misinformation are related to our own cognitive limitations as human beings: We can only see a small part of the world at once, so we need to rely on others to pre-process part of that information for ...
Computer Vision Problems, such as object detection, object tracking, action recognition and so on, have been, in the past, usually addressed through Statistical Pattern Recognition techniques. SVM, Regression or Neural Networks, are some examples of ...
As AI advances in capabilities and moves into the real world, its potential to benefit humanity seems limitless. Yet we see serious problems including racial and gender bias, manipulation by social media, and an arms race in lethal autonomous weapons. ...
Scale appears to be the winning recipe in today's AI leaderboards. And yet, extreme-scale neural models are still brittle to make errors that are often nonsensical and even counterintuitive. In this talk, I will argue for the importance of knowledge, ...
In the first part we address four current specific challenges through examples: (1) discrimination (e.g., facial recognition, justice, sharing economy, language models); (2) stupid models (e.g., lack of semantic and context understanding); (3) ...
FPGAs and other 2D and 3D spatial computing fabrics share several common characteristics e.g. a deterministic computing model with distributed memories, but also differ along important dimensions e.g. granularity and communication infrastructure. This ...
With the prevailing of deep learning technology, especially generative adversarial networks (GAN), generating photo-realistic facial images has made huge progress. Image generation techniques have many good applications such as data augmentation, ...
This presentation will give an overview of projects on leveraging deep learning for historical data analysis my group did in the last 3 years, partly in the context of the ANR EnHerit project. I will first discuss how deep learning can be used to ...
Multi-modal pre-training models have been intensively explored to bridge vision and language in recent years. However, most of them explicitly model the cross-modal interaction between image-text pairs, by assuming that there exists strong semantic ...
Videos are generally accompanied with multi-modal information such as audio, text, and motion. The multi-modal information is becoming an important cue for understanding video content. How to model the correlation between multi-modalities in videos is ...
To meet the needs for energy savings in Internet of Things (IoT) and mobile systems, solar energy has been increasingly exploited to serve as a green and renewable source to allow systems to better operate in an energy-efficient way. In this respect, ...
Today we are witnessing an increased robotisation in all areas of society, from manufacturing to assistive technology, from healthcare to education. These application areas require robots to be able to interact with humans in an efficient and socially ...
In this talk, I will discuss recent work from our team at Google Research, covering novel methods and datasets. Instance-level recognition, retrieval and matching are key computer vision problems which generally depend on effective image representations,...