11:30am - 12:30pm

Thursday 11 July 2024

The many meanings with image pairs

CVSSP External Seminar

Hybrid event - All Welcome!


21BA02 - Arthur C Clarke Building
Stag Hill Campus
University of Surrey
back to all events

This event has passed

In Person only. Room 21BA02 - PAI Seminar room


  • Dr. Liang Zheng

The many meanings with image pairs

Dr. Liang Zheng


Training AI models with image pairs has been studied for a long time and proven very useful. In this talk, I will first revisit popular practices of using data pairs in various computer vision tasks: from face recognition, person re-identification, to contrastive learning in foundation models. I will then discuss human preference data: between a pair of images, people may generally prefer one over the other. This type of data pair can be used to align diffusion models with human preference, so that diffusion models are more likely to generate images that people like. I will describe how we address this problem by aligning human preference at different denoising steps. This method effectively improves stable diffusion (SD) and SDXL models while accelerating the fine-tuning process by 10 times compared with existing methods.


Short bio:

Dr Liang Zheng is an Associate Professor (tenured) at the Australian National University, specialising in computer vision and machine learning. He obtained his Bachelor (2010) and PhD (2015) degrees from Tsinghua University. He is best known for his contributions to the field of object re-identification through useful datasets and algorithms, including Market-1501 (ICCV 2015) and part-based convolutional baseline (ECCV 2018). He also developed a few widely used methods in image classification and multi-object tracking such as random erasing (AAAI 2020) and joint detection and embedding (ECCV 2020). He regularly serves as an area chair for conferences like CVPR and NeurIPS and co-organises the AI CITY Challenges and Vision Datasets Understanding workshops. He is a program chair for ACM Multimedia 2024