CVSSP projects

We have a collaborative research and innovation portfolio of over £30m in current projects supported by government, industry and third sector organisations. Our centre leads multiple national and international flagship programmes in AI and machine learning, together with a large portfolio of collaborative research, development and technology transfer projects.

Primary funders include UKRI (EPSRC, InnovateUK, BBSRC, MRC, AHRC) with a current EPSRC portfolio of £20m, Royal Society, Royal Academy of Engineering, Wellcome Trust, Cancer Research UK, Dementia Research UK, Alzheimer’s Society, BBC, NPL, MoD, Dstl, EU and SNSF. Direct industry funding of research and licensing of CVSSP technology is over £4m with substantial additional in-kind support for research and facilities.

National flagship programmes in AI

CVSSP leads national and international flagship research programmes in AI including: the DECaDE UKRI Digital Economy Centre in AI and Blockchain; a UKRI-EPSRC Prosperity Partnership in AI for Creative Industries; and two UKRI-EPSRC programme grants in face recognition and spatial audio. International partnerships include the MURI/EPSRC programme leading fundamental AI advances for multimodal semantic information.

The centre has also received strategic UKRI-EPSRC Platform Grant support since 2003 to underpin continuity of leading UK expertise in audio-visual AI.

EPSRC Prosperity Partnership with the BBC, AI4ME: AI for personalised media experiences - £14.7m, incl £8.7m industry funding

This addresses key challenges for personalised content creation and delivery at scale using AI and Object-Based Media (OBM), focused on enabling media experiences which adapt to individual preferences, accessibility requirements, devices and location. This will position the UK at the forefront of ‘Personalised Media’ revolution enabling the creation and delivery of new services. and opening new horizontal markets or the creative industry.

EPSRC DECaDE: National Centre for Decentralised Digital Economy bringing together AI and blockchain technology - £10m, incl. £6m industry funding and 33 partners

This 5-year National Research Centre, explores how emerging data technologies such as Distributed Ledger Technology (Blockchain) and AI could transform digital economy through decentralised platforms. DECaDE is part of a programme of Next Stage Digital Economy Centres that will take forward previously funded inter/multidisciplinary applied digital economy research to “the next stage”, easing the pathway to commercialisation.

EPSRC Fellowship: AI for Sound - £2.5m

Focused on four application use cases: (i) monitoring sounds of human activity in the home for assisted living; (ii) measuring sounds in non-domestic buildings to improve the office and workplace environment; (iii) measuring sounds in smart cities to improve the urban environment; and (iv) developing tools to use sounds to help producers and consumers of broadcast creative content. The aim is to deliver a step-change in research bringing "AI for Sound" technology out of the lab to benefit society and the economy.

MURI: Semantic information pursuit for multimodal data analysis (2018-2023)

The goal of the research is to advance machine perception technology, allowing cameras and microphones to extract useful information from an environment and separate it from ‘nuisance’ factors such as illumination, blur, noise and object pose.

FACER2VM: Face Matching for Automatic Identity Retrieval, Recognition, Verification and Management (2016-2021)

The project will develop unconstrained face recognition technology, which is robust to a range of degradation factors, for applications in the Digital Economy and in a world facing global security issues, as well as demographic changes.

S3A: Future Immersive Spatial Audio (2013-2019)

The goal of S3A is to deliver a step-change in the quality of audio consumed by the general public, using novel audio-visual signal processing to enable immersive audio to work outside the research laboratory in everyone’s homes.

Audio-Visual Media Platform Grant (2003-2022)

Platform Grant support for audio-visual media processing is supporting a critical-mass of joint research expertise in multi-sensory machine perception. This is critical towards achieving machines which can hear and see to understand and interact with real-world dynamic scenes.

Fellowships

CVSSP hosts a number of prestigious personal fellowships for established and early career academics to support national and international research leadership. The Centre is keen to host and support individual fellowship programmes to become independent research leaders and grow impactful areas of research and collaboration across multiple disciplines related to AI.

Please get in touch if you are interested in applying for a fellowship to be hosted in CVSSP.

Meet our current fellows:

Dr Lucia Florescu - Wellcome Trust Fellowship 'Optical Characterisation of Epithelial Tissue Function and Metabolism for Early Cancer Diagnosis and Treatment Monitoring'

The aim of this project is to develop and evaluate an optical imaging technology that will enable clinicians to see accurate 3D images of precancerous and cancerous changes in the epithelial tissue, the extent of disease, and early changes associated with treatment.

Prof. Adrian Hilton - Royal Society Wolfson Research Merit Award '4D Computer Vision Modelling'

Pioneering research into 4D computer vision modelling – technology that is changing the way we analyse sport, enjoy digital entertainment and diagnose medical conditions.

Dr Charles Malleson - Leverhulme/Royal Society Fellowship 'Animo – Tracking and Understanding Animal Motion in the Wild'

The aim of this project is to investigate non-contact tracking and understanding of animal motion in unconstrained environments.

Dr Armin Mustafa - The Royal Academy of Engineering Fellowship '4D Vision for Perceptive machines'

The aim of this project is to better understand complex scenes so that machines can efficiently model and interpret real-world for a range of socially beneficial applications including autonomous systems, augmented reality and healthcare.

Prof. Mark Plumbley - EPSRC Fellowship ‘AI for Sound’

The aim of this project is to transform how AI understands everyday sounds – from our homes, outdoor environments, to the workplace – tackling key issues that have prevented computational analysis of sound from reaching its potential.

Prof Wenwu Wang - DUO-India Fellowship Programme on Audio Scene Analysis and Source Separation

The project aims to develop new deep embedding techniques for audio scene analysis and source separation, mostly oriented towards problems in audio event detection in environmental sounds and for music related applications.

Project portfolio

Our research has pioneered new technologies for the benefit of society and the economy, with applications spanning healthcare, security, entertainment, robotics, autonomous vehicles, communication and audio-visual data analysis.

Creative vision and sound

Creative Vision focuses on machine perception for creative technologies, specialising in 4D immersive VR content production, performance capture and video-based animation for film and games.

Creative Sound works on spatial audio and machine audition, developing audio signal processing technology related to sound recognition and immersive audio experiences.

Healthcare

Healthcare focuses on medical imaging technologies for cancer detection and machine learning in personalised care for better living and healthy ageing.

PROTEIN: PeRsOnalized nutriTion for hEalthy livINg

Contact: Dr Kevin Wells

Funder: European Commission

Dates: 2018 - 2022

Proper nutrition is essential for good health, well-being and the prevention, mitigation or treatment of a number of non-communicable diseases (NCDs). Food is not only a source of calories, but also a complex mixture of dietary chemicals, some of which are directly related to cardiovascular diseases, diabetes, allergies and some types of cancer.

Foods, diet and nutritional status, including overweight and obesity, are also associated with elevated blood pressure and blood cholesterol or even resistance to the action of insulin. These conditions are not only risk factors for non-communicable diseases, but major causes of illness themselves. However, today's diet is characterized by irregular and poorly balanced meals.

Unhealthy eating habits in our daily life are not only risk factors for non-communicable diseases, but also major causes of stress and tiredness, i.e., lack of energy. Knowledge about our dietary habits based on the analysis of diverse types of information, including individual parameters, can contribute greatly towards answering key questions to respond to societal challenges regarding food and health. Find our more on the PROTEIN project page.

Collaborators: University of Surrey(United Kingdom), Intrasoft International Sa (Luxembourg), Ocado Group Plc (United Kingdom), Biosense Institute - Research And Development Institute For Information Technologies In Biosystems (Serbia), Aristotelio Panepistimio Thessalonikis (Greece), Katholieke Universiteit Leuven (Belgium), Datawizard Srl (Italy), Charite - Universitaetsmedizin Berlin (Germany), Cognicase Management Consulting Sl (Spain), The European Association For The Study Of Obesity - Ireland Company Limited By Guarantee (Ireland), Plux - Wireless Biosignals S.A. (Portugal), Diethnes Panepistimio Ellados (Greece), Istituto Comprensivo Di Boscochiesanuova (Italy), Fluviale - Societa A Responsabilita Limitata (Italy), Healthium - Healthcare Software Solutions, Sa (Portugal), Agrifood Capital Bv (Netherlands), Sport Lisboa E Benfica - Futebol Sad (Portugal), Istituto Comprensivo Statale B. Lorenzi Fumane Vr (Italy), Virtuagym Bv (Netherlands).

Robotics

Robotics works on autonomous systems, covering a broad range of technologies related to visual human-machine interaction. These include sign language and autonomous vehicles.

ROSSINI: Reconstructing 3D structure from single images: a perceptual reconstruction approach

Principal investigator: Prof. Richard Bowden.

Funder: EPSRC

Dates: 2019 - 2022

Consumers enjoy the immersive experience of 3D content in cinema, TV and virtual reality (VR), but it is expensive to produce. Filming a 3D movie requires two cameras to simulate the two eyes of the viewer. A common but expensive alternative is to film a single view, then use video artists to create the left and right eyes' views in post-production. What if a computer could automatically produce a 3D model (and binocular images) from 2D content: 'lifting images into 3D'? This is the overarching aim of this project. Lifting into 3D has multiple uses, such as route planning for robots, obstacle avoidance for autonomous vehicles, alongside applications in VR and cinema.

ROSSINI will develop a new machine vision system for 3D reconstruction that is more flexible and robust than previous methods. Focussing on static images, we will identify key structural features that are important to humans. We will combine neural networks with computer vision methods to form human-like descriptions of scenes and 3D scene models. Our aims are to (i) produce 3D representations that look correct to humans even if they are not strictly geometrically correct (ii) do so for all types of scene and (iii) express the uncertainty inherent in each reconstruction. To this end we will collect data on human interpretation of images and incorporate this information into our network. Our novel training method will learn from humans and existing ground truth datasets; the training algorithm selecting the most useful human tasks (i.e. judge depth within a particular image) to maximise learning. Importantly, the inclusion of human perceptual data should reduce the overall quantity of training data required, while mitigating the risk of over-reliance on a specific dataset. Moreover, when fully trained, our system will produce 3D reconstructions alongside information about the reliability of the depth estimates.Find our more on the project page.

Collaborators: University of Surrey, Aston University, CrossWing, Double Negative Ltd, Microsoft.

Security and data

Security theme works on biometrics related technologies, specialising in facial recognition and natural language interfaces for human-AI collaboration.

Data research theme addresses the application of AI for audio-visual information search, understanding and preservation including visual recognition, distributed ledger technologies and the understanding of AI systems.

DLT

We are investigating alternative uses for distributed ledger technology (DLT), including safe online identity, healthcare, and secure digital archives. The new approach, fusing DLT (trusted data) and AI (making sense of that data), is a common thread across all of our projects in DLT and a unique perspective to this emerging technology pioneered by the University of Surrey

Strategic funding

Research at CVSSP

Can Machines Think? Past, present and future of AI

Over the past thirty years, we have become an international centre of excellence for training and research in audio and visual machine perception in collaboration with industry.

Explore

Past projects

Further details about these projects can be obtained either by visiting the relevant websites or by contacting those involved in the research. The list is non-exhaustive.

CVSSP projects

National flagship programmes in AI

Fellowships

Meet our current fellows:

Project portfolio

AI4ME: AI for Media Experiences (2021-2026)

AI4ME: AI for Media Experiences

AI for Sound - EPSRC Senior Fellowship (2020-2025)

AI for Sound - EPSRC Senior Fellowship

InHEAR: Intelligent hearables with environment-aware rendering (2020-2024)

InHEAR: Intelligent hearables with environment-aware rendering

4D Vision for Perceptive machines - The Royal Academy of Engineering Fellowship (2018-2024)

4D Vision for Perceptive machines - The Royal Academy of Engineering Fellowship

DUO-India Fellowship Programme (Professors) on Audio Scene Analysis and Source Separation (2020-2022)

DUO-India Fellowship Programme (Professors) on Audio Scene Analysis and Source Separation

MAchine learNing acousTIc Surveillance MANTIS Phase 2 (2020-2021)

MAchine learNing acousTIc Surveillance MANTIS Phase 2

Automated Captioning of Image and Audio for Visually and Hearing Impaired (2021-2023)

Automated Captioning of Image and Audio for Visually and Hearing Impaired

SIGNetS: signal and information gathering for networked surveillance (2020-2023)

SIGNetS: signal and information gathering for networked surveillance

Animo – Tracking and Understanding Animal Motion in the Wild - The Leverhulme Trust Fellowship (2019 - 2022)

Animo – Tracking and Understanding Animal Motion in the Wild

Polymersive: Immersive Video Production Tools for Studio and Live Events (2019-2021)

Polymersive: Immersive Video Production Tools for Studio and Live Events

Radiomics and Data Science in Medical Imaging for Cancer

Radiomics and Data Science in Medical Imaging for Cancer

Phantoms for Audit of the MR-Linac

Phantoms for Audit of the MR-Linac

Engineering Integrated Dementia cAre (EIDA) UK Dementia Research Institute (DRI) Care Research & Technology Centre (2019-2025)

Engineering Integrated Dementia cAre (EIDA) UK Dementia Research Institute (DRI) Care Research & Technology Centre

PROTEIN: PeRsOnalized nutriTion for hEalthy livINg (2018-2022)

PROTEIN: PeRsOnalized nutriTion for hEalthy livINg

Optical Characterisation of Epithelial Tissue Function and Metabolism for Early Cancer Diagnosis and Treatment Monitoring - Wellcome Trust Fellowship, Lucia Florescu (2017-2022)

Optical Characterisation of Epithelial Tissue Function and Metabolism for Early Cancer Diagnosis and Treatment Monitoring - Wellcome Trust Fellowship

RetinaUWF: AI Detection of Diabetic Retinopathy in Ultra-Wide-Field Retinal Images (2019-2021)

RetinaUWF: AI Detection of Diabetic Retinopathy in Ultra-Wide-Field Retinal Images

LMDP: Low-cost Portable Molecular Diagnostic Platform for Rapid Detection of Poultry Infectious Pathogens (2018-2021)

LMDP: Low-cost Portable Molecular Diagnostic Platform for Rapid Detection of Poultry Infectious Pathogens

Covid-19 IF - Smart Rapid COVID-19 Testing and Tracing system (2020-2021)

Covid-19 IF - Smart Rapid COVID-19 Testing and Tracing system

Scalable Multimodal sign language technology for sIgn language Learning and assessment Phase-II (2021-2024)

Scalable Multimodal sign language technology for sIgn language Learning and assessment Phase-II

Reflexive robotics using asynchronous perception (2020-2023)

Reflexive robotics using asynchronous perception

ROSSINI: Reconstructing 3D structure from single images: a perceptual reconstruction approach (2019-2022)

ROSSINI: Reconstructing 3D structure from single images: a perceptual reconstruction approach

ExTOL: End to End Translation of British Sign Language (2018-2021)

ExTOL: End to End Translation of British Sign Language

MVSE: Multimodal Video Search by Examples (2021-2024)

MVSE: Multimodal Video Search by Examples

FACER2VM: Face Matching for Automatic Identity Retrieval, Recognition, Verification and Management (2016-2021)

FACER2VM: Face Matching for Automatic Identity Retrieval, Recognition, Verification and Management

MURI: Semantic information pursuit for multimodal data analysis (2017-2023)

MURI: Semantic information pursuit for multimodal data analysis

IoT-Crawler: A Distributed Framework for Massive Multi-modal Data Stream Discovery and Predictive Analysis in Internet of Things (2018-2021)

IoT-Crawler: A Distributed Framework for Massive Multi-modal Data Stream Discovery and Predictive Analysis in Internet of Things

DECaDE: Centre for the Decentralised Digital Economy (2020 - 2025)

DECaDE: Centre for the Decentralised Digital Economy

Blockstart: Blockchain-based applications for SME competitiveness (2019-2022)

Blockstart: Blockchain-based applications for SME competitiveness

Digital Inspiration and Search in the National Archives (2020-2021)

Digital Inspiration and Search in the National Archives

iFlyTek-Surrey Joint Research Centre on Artificial Intelligence (2019 - 2025)

Audio-Visual Media Research Platform

Audio-Visual Media Research Platform (2017-2022)

Audio-Visual Media Research Platform

EPSRC Capital award for core equipment (2019-2021)

EPSRC Capital award for core equipment

Experimental Equipment Call (2015-2021)

Experimental Equipment Call

Research at CVSSP

Past projects

Video retrieval

Artistic rendering of consumer video (HP Labs)

Spot the difference (JISC)

Visual Media

Digital Doubles for Film Production - Royal Society Industry Fellowship with Framestore (2008-2012)

4D computer vision modelling - Royal Society Wolfson Research Merit Award

Digital doubles: From real actors to computer generated digital doubles (Framestore, Royal Society)