profile photo

Tetyana Perchyk


Research Fellow in Health Data Science

Academic and research departments

School of Health Sciences.

Publications

Tetyana Perchyk, Isabella de Vere Hunt, Brian D. Nicholson, Luke Mounce, Kate Sykes, Georgios Lyratzopoulos, Agnieszka Lemanska, Katriina L. Whitaker, Robert S. Kerrison (2025)How well are marginalised groups represented in electronic records? A codelist development project and cross-sectional analysis of UK electronic health records, In: BMJ OpenIn Press(In Press) BMJ Publishing Group

Objectives. Primary care electronic health records provide a rich source of information for inequalities research. However, the reliability and validity of the research derived from these records depends on the completeness and resolution of the codelists (i.e. collection of medical terms/codes) used to identify populations of interest. The aim of this project was to develop comprehensive codelists for identifying people from ethnic minority groups, people with learning disabilities (LD), people with severe mental illness (SMI) and people who are transgender.Design. We followed a three-stage process to define and extract relevant codelists. First, groups of interest were defined a priori. Next, relevant clinical codes, relating to the groups, were identified by searching Clinical Practice Research Datalink (CPRD) publications, codelist repositories and the CPRD Code Browser. Relevant codelists were extracted and merged according to group, and duplicates were removed. Finally, remaining codes were reviewed by two general practitioners.Setting. The curated codelists were compared using a representative sample in the United Kingdom. The frequencies of individuals identified using the curated codelists were assessed and compared to widely used alternative codelists.Participants. Comprehensiveness was assessed in a representative CPRD population of 10,966,759 people.Results. After removal of duplicates and GP review, codelists were finalized with 325 unique codes for ethnicity, 558 for LD, 499 for SMI, and 38 for transgender. Compared with comparator codelists, an additional 48,017 (76.6%), 52,953 (68.9%) and 508 (36.9%) people with a LD, SMI or transgender code were identified. The proportions identified for ethnicity, meanwhile, were consistent with expectations for the UK (e.g. 6.1% Asian, 2.7% black, 1.4% mixed). Conclusions. The curated codelists are more sensitive than those widely used in practice and research. Discrepancies between national estimates and primary care records suggest potential record/retention issues. Resolving these requires further investigation and could lead to improved data quality for research.

Katriina Whitaker, Tetyana Perchyk, Robert S. Kerrison, Agnieszka Lemanska (2024)Challenges in understanding inequities in help-seeking for possible cancer symptoms, In: BMC global and public health249 Springer

Tackling inequities in cancer outcomes is a global health priority. One avenue for improving early diagnosis of cancer is to ensure people know when and how to seek help for cancer symptoms and that this knowledge (and behaviour) is equitably distributed across the population. In this perspective piece we highlight the challenges in understanding sociodemographic differences in help-seeking behaviour (for example, how help-seeking is defined / conceptualised and subsequently assessed), as well as challenges with using existing datasets that are now more readily accessible than ever. Addressing these will strengthen methodological approaches to understand inequities in help-seeking and ways to tackle them.