Serologic Detection of Hepatocellular Carcinoma: Application of Machine Learning and Implications for Diagnostic Models.

Johnson, Philip J ORCID: 0000-0003-1404-0209, Bhatti, Ehsan, Toyoda, Hidenori ORCID: 0000-0002-1652-6168 and He, Shan (2024) Serologic Detection of Hepatocellular Carcinoma: Application of Machine Learning and Implications for Diagnostic Models. JCO clinical cancer informatics, 8 (8). e2300199-.

Text
PJ Final Clean Serological HCC.docx - Author Accepted Manuscript
Available under License Creative Commons Attribution.
Download (537kB)

Official URL: http://dx.doi.org/10.1200/cci.23.00199

Abstract

<h4>Purpose</h4>The gender, age, lens culinaris agglutinin-reactive fraction of alphafetoprotein, alphafetoprotein, des-gamma-carboxyprothrombin (GALAD) score is a biomarker-based statistical model for the serologic diagnosis of hepatocellular carcinoma (HCC) that has been developed and validated using the case-control approach with a view to early detection. Performance has, however, been suboptimal in the first prospective studies which better reflect the real-world situation. In this article, we report the application of machine learning to a large, prospectively accrued, HCC surveillance data set.<h4>Patients and methods</h4>Models were built on a cohort of 3,473 patients with chronic liver disease within a rigorous surveillance program between 1998 and 2014, during which 459 patients with HCC were detected. Two random forest (RF) models were trained. The first RF model uses the same variables as the original GALAD model (GALAD-RF); the second is based on routinely available clinical and laboratory features (RF-practical). For comparison, we evaluated a logistic regression GALAD model trained on this longitudinal prospective data set (termed GALAD-Ogaki).<h4>Results</h4>Models were evaluated using a repetitive cross-validation approach with the metrics averaged over 100 independent runs. As judged by area under the receiver operator curve (AUROC) and F1 score, the GALAD RF model significantly outperformed the original GALAD model. The RF-practical model also outperformed the original GALAD model in terms of both AUROC and F1 score, and both models outperformed the individual biomarkers. An online web application that implemented the GALAD-RF and RF-practical models is presented.<h4>Conclusion</h4>RF-based models improve on the diagnostic performance of the original GALAD model in the setting of a standard HCC surveillance program. Further prospective validation studies are warranted using these models and could be expanded to offer prediction of risk of HCC development over defined periods of time.

Item Type:	Article
Uncontrolled Keywords:	Humans, Carcinoma, Hepatocellular, Liver Neoplasms, Area Under Curve, Prospective Studies, Machine Learning
Divisions:	Faculty of Health and Life Sciences Faculty of Health and Life Sciences > Institute of Systems, Molecular and Integrative Biology
Depositing User:	Symplectic Admin
Date Deposited:	19 Mar 2024 10:21
Last Modified:	10 Apr 2024 09:46
DOI:	10.1200/cci.23.00199
Related URLs:	Author Publisher
URI:	https://livrepository.liverpool.ac.uk/id/eprint/3179656