Towards User-intent aware Multimodal Retrieval
Start date
April 2025End date
March 2027Overview
With this project, we aim to create a framework to support vision-language (V-L) research within eBay. We will develop a framework leveraging existing pre-trained vision-language models to improve the performance of matching listings with existing products, and for tasks like product listing moderation, attribute extraction, explained relevance prediction. This framework can later also serve as a testbed for customer support chatbots with image understanding and image-based product search.
People
Principal investigator
Dr Diptesh Kanojia
Senior Lecturer in People-Centred AI
Biography
Researcher working on problems within areas of Natural Language Processing (NLP) and Machine Learning at the Institute for People-Centred AI (PAI) and School of Computer Science and Electronic Engineering. As a research lead, I manage the NLP subgroup within the Nature Inspired Computing and Engineering group (NICE) @ Computer Science Research Centre.
I also lead teaching for the NLP module offered to undergraduate (COM3029) and postgraduate (COMM061) students.
My research focuses on developing scalable and safe human-machine interaction using foundation models. Guided by the principles of Responsible and Inclusive AI, my work emphasises cross-lingual and multimodal learning to address challenges like online toxicity, misinformation, and digital accessibility for low-resource languages. All our research outcomes- code, data, and models- are publicly available on the SurreyNLP GitHub and HuggingFace.
Prior roles include a Postdoctoral Fellowship at the Centre for Translation Studies, a joint PhD from IIT Bombay and Monash University, and Research Engineer at the CFILT Lab.
Co-Investigator
Professor Constantin Orasan
Professor of Language and Translation Technologies
Biography
I am Professor of Language and Translation Technologies at the Centre of Translation Studies, University of Surrey, UK, and a Fellow of the Surrey Institute for People-Centred Artificial Intelligence. Before starting this role, I was Reader (Associate Professor) in Computational Linguistics at the University of Wolverhampton, UK, and the deputy head of the Research Group in Computational Linguistics at the same university. I hold a PhD in computational linguistics and a BSc in computer science.
With over 25 years of experience in the fields of Natural Language Processing, Artificial Intelligence, and Linguistics, I have established myself as a leading researcher in the development of technologies that facilitate access to information. My PhD was in automatic summarisation, and I have led projects on question answering, text simplification, and translation technologies. Notable projects that I have led are EmpASR, an AHRC-funded project focused on training interpreters on how to benefit from the latest developments in artificial intelligence; HarnessingNLP4Court, a UKRI-funded project focused on facilitating access to legal information; the EXPERT project, an Initial Training Network (ITN) funded under the EU’s FP7 to train the next generation of world-class researchers in the field of data-driven translation technology; and the FIRST project, which developed language technologies for making texts more accessible to people with autism.
My current research is interdisciplinary, focusing on the intersection of AI, NLP, and translation studies. In recent years, I have increasingly focused on the practical application of NLP to support translators and interpreters. My recent publications explore reference-less translation evaluation, the processing of multilingual content in low-resource settings, the use of automatic speech recognition to support interpreters, and the use of large language models in text accessibility. My research is well known as a result of over 150 peer-reviewed articles in journals, books, and international conferences.
I am currently leading an EPSRC-funded project focused on making science accessible, and I am Co-Director of the ADA Leverhulme Doctoral Scholarships Network. More information about my work can be found at https://dinel.org.uk/.
Funding amount
£120,000
Funder
eBay
Contact
For enquiries or potential collaboration on this topic please contact Dr Diptesh Kanojia, the Principal Investigator of the project.
See other research projects carried out at the Centre for Translation Studies.
Related sustainable development goals
Research themes
Find out more about our research at Surrey: