1:30pm - 2:30pm

Thursday 24 November 2022

Vision-and-Language Navigation: From Ground to Sky

External Seminar by Dr Qi Wu, University of Adelaide

All Welcome


University of Surrey
back to all events

This event has passed


  • Dr Qi Wu

Vision-and-Language Navigation: From Ground to Sky

Dr Qi Wu


Vision-and-Language Navigation is a recently raised research direction that has attracted much attention from the computer vision, natural language processing and robotics communities. We lighted up this direction four years ago by proposing the first benchmarked VLN task and dataset, Room-to-Room (R2R). Since then, many new models and datasets have been proposed, including our REVERIE - Remote Embodied Visual Referring Expression in Real Indoor Environments. In this talk, I will first present the original VLN task and dataset and then discuss some of our recently proposed methods based on it. At the end of the talk, I will also present our recently proposed AerialVLN, which brings the VLN into the sky for the first time.


Short bio:

Dr Qi Wu is an Associate Professor at the University of Adelaide and was the ARC Discovery Early Career Researcher Award (DECRA) Fellow between 2019-2021. He is the Director of Vision-and-Language at the Australia Institute of Machine Learning. Australian Academy of Science awarded him a J G Russell Award in 2019. He obtained his PhD degree in 2015 and MSc degree in 2011, in Computer Science from the University of Bath, United Kingdom. His research interests are mainly in computer vision and machine learning. Currently, he is working on the vision-language problem, and he is primarily an expert in image captioning and visual question answering (VQA). He has published more than 100 papers in prestigious conferences and journals, such as TPAMI, CVPR, ICCV, ECCV. He is also the Area Chair for CVPR and ICCV.


Attend the Event

This is a free Hybrid event, open to everyone