Minsak J Nanang

Postgraduate Research Student

MSc, CEng, MIET

jn00767@surrey.ac.uk

Academic and research departments

School of Computer Science and Electronic Engineering.

About

My research project

Inclusive video editing and description for personalisation of media.

Through this research, we aim to develop AI-powered media editing solutions that enhance accessibility, representation, and personalisation, ensuring that digital content is reflective of diverse identities and experiences.

Supervisors

Armin Mustafa

Adrian Hilton

Affiliations and memberships

Institute of Engineering and Technology (IET)

Member of the Institution of Engineering and Technology (IET)

Publications

Minsak Nanang, Adrian Hilton, Armin Mustafa Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints Arxiv

DOI: 10.48550/arxiv.2603.11147

Audiovisual (AV) archives in museums and galleries are growing rapidly, but much of this material remains effectively locked away because it lacks consistent, searchable metadata. Existing method for archiving requires extensive manual effort. We address this by automating the most labour intensive part of the workflow: catalogue style metadata curation for in gallery video, grounded in an existing collection database. Concretely, we propose catalogue-grounded multimodal attribution for museum AV content using an open, locally deployable video language model. We design a multi pass pipeline that (i) summarises artworks in a video, (ii) generates catalogue style descriptions and genre labels, and (iii) attempts to attribute title and artist via conservative similarity matching to the structured catalogue. Early deployments on a painting catalogue suggest that this framework can improve AV archive discoverability while respecting resource constraints, data sovereignty, and emerging regulation, offering a transferable template for application-driven machine learning in other high-stakes domains.