Machine learning-enabled systematic review on coded healthcare data in heart failure research
European Heart Journal - Digital Health

Abstract
Coded healthcare data are now commonly used in clinical research. This study aimed to assess the transparency of reporting within heart failure studies and employ machine learning to facilitate larger-scale evaluation.
A systematic search of EMBASE and MEDLINE (2015–2020) identified 4279 heart failure studies with accessible Extensible Markup Language published in the top 25 journals by impact factor. Manual extraction in a random sample of 170 studies by independent human reviewers characterized 40 studies (23.5%) that used coded healthcare data, with 34 of these (85%) reporting doing so and only 19 (47.5%) providing clear descriptions of dataset construction and linkage. Another 420 studies underwent manual annotation to further train a Natural Language Processing (NLP) model designed for this study to automate and upscale review. The NLP model processed 3689 studies with a high level of internal accuracy (area under the receiver operating characteristic curve 0.97 and F1 score 0.96). Overall, the NLP approach identified 782 studies (21.2%) that reported coded healthcare data usage (95% CI 19.8–20.9%). No correlation was found between the reporting of coded healthcare data use and the publication year (r = −0.05;
One-fifth of contemporary heart failure research articles are already reporting the use of coded healthcare data, with at-scale evaluation facilitated by a machine-learning model. The limited transparency on how coded healthcare data were used in studies highlights the need for quality standards such as the CODE-EHR framework for the use of healthcare data in research.
Contributors

Simrat Gill
Author
University of Birmingham Birmingham , United Kingdom of Great Britain & Northern Ireland

Tomasz Dyszynski
Author

Megan Schröder
Author

Kiliana Suzart-Woischnik
Author

Benoit Tyl
Author

Guillaume Allée
Author

Asgher Champsi
Author

Karin T Slater
Author

Alfonso Sartorius
Author

R Thomas Lumbers
Author
Barts Heart Centre London , United Kingdom of Great Britain & Northern Ireland

Folkert W Asselbergs
Author
Amsterdam University Medical Centre (AUMC) Amsterdam , Netherlands (The)

Georgios Gkoutos
Author

Dipak Kotecha
Author
University of Birmingham Birmingham , United Kingdom of Great Britain & Northern Ireland
You may be interested in


