Norman, Christopher R, Gargon, EA, Leeflang, Mariska MG, Neveol, Aurelie and Williamson, Paula R ORCID: 0000-0001-9802-6636
(2019)
Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database.
Database: the journal of biological databases and curation, 2019 (1).
baz109-.
Text
COMET_automation_accepted.pdf - Author Accepted Manuscript Download (223kB) | Preview |
Abstract
Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Databases, Factual, Data Mining, Data Curation, Machine Learning, Systematic Reviews as Topic |
Depositing User: | Symplectic Admin |
Date Deposited: | 14 Aug 2019 15:21 |
Last Modified: | 19 Jan 2023 00:30 |
DOI: | 10.1093/database/baz109 |
Open Access URL: | https://academic.oup.com/database/article/doi/10.1... |
Related URLs: | |
URI: | https://livrepository.liverpool.ac.uk/id/eprint/3051743 |