Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database



Norman, Christopher R, Gargon, EA, Leeflang, Mariska MG, Neveol, Aurelie and Williamson, Paula R ORCID: 0000-0001-9802-6636
(2019) Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database. Database: the journal of biological databases and curation, 2019.

[img] Text
COMET_automation_accepted.pdf - Accepted Version

Download (223kB) | Preview

Abstract

Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.

Item Type: Article
Depositing User: Symplectic Admin
Date Deposited: 14 Aug 2019 15:21
Last Modified: 19 Nov 2021 08:10
DOI: 10.1093/database/baz109
Open Access URL: https://academic.oup.com/database/article/doi/10.1...
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3051743