Predicting the Progression of Non-Alcoholic Fatty Liver Disease Using Machine Learning and Clinical Laboratory Parameters: A Systematic Review

Aila Mustofa; Catherine Halim

doi:10.70070/pmray255

Predicting the Progression of Non-Alcoholic Fatty Liver Disease Using Machine Learning and Clinical Laboratory Parameters: A Systematic Review

Authors

Aila Mustofa General Practitioner, Kimia Farma Clinical Laboratory, Surakarta City, Central Java, Indonesia
Catherine Halim Clinical Pathology Consultant, , Kimia Farma Clinical Laboratory, Surakarta City, Central Java, Indonesia

Keywords:

Non-Alcoholic Fatty Liver Disease (NAFLD), Non-Alcoholic Steatohepatitis (NASH), Liver Fibrosis, Machine Learning, Predictive Modeling, Clinical Laboratory Parameters, Systematic Review

Abstract

Introduction: Non-alcoholic fatty liver disease (NAFLD) is a growing global health crisis, with a significant proportion of patients progressing to severe liver pathologies, including non-alcoholic steatohepatitis (NASH), fibrosis, and cirrhosis. The limitations and risks of invasive liver biopsy, the current gold standard for diagnosis, necessitate the development of accurate, non-invasive tools for risk stratification and disease monitoring. Machine learning (ML) models, which utilize routinely collected clinical laboratory data, have emerged as a promising and scalable solution for predicting disease progression. This review synthesizes the current evidence on the efficacy of these models.

Methods: A systematic literature search was conducted across PubMed, Google Scholar, Semanthic Scholar, Springer, Wiley Online Library databases in accordance with PRISMA guidelines. The search included studies that developed or validated ML models to predict NAFLD progression (to NASH, significant fibrosis, or advanced fibrosis using clinical laboratory parameters as primary predictors. Data on study design, population characteristics, ML algorithms, key predictors, and a full spectrum of performance metrics were extracted. The methodological quality of each study was rigorously assessed using the Prediction model Risk Of Bias ASsessment Tool (PROBAST).

Results: Sixteen studies met the inclusion criteria, encompassing a diverse range of populations and model architectures. The primary outcomes predicted were progression to NASH, significant fibrosis, and advanced fibrosis. Ensemble ML models, particularly eXtreme Gradient Boosting (XGBoost) and Random Forest (RF), consistently demonstrated superior predictive performance over traditional statistical models and other ML algorithms. For the critical endpoint of advanced fibrosis, these models frequently achieved Area Under the Receiver Operating Characteristic (AUROC) values exceeding 0.85 and, in some cases, approaching 0.92. A core set of laboratory parameters—including alanine aminotransferase (ALT), aspartate aminotransferase (AST), gamma-glutamyl transpeptidase (GGT), platelet count, triglycerides, and glycated hemoglobin (HbA1c)—were consistently identified as the most important predictors across multiple models, reflecting their central role in the pathophysiology of NAFLD.

Discussion: The evidence strongly indicates that ML models can effectively integrate complex, non-linear patterns from standard laboratory tests to generate a "digital signature" of NAFLD pathophysiology, enabling more accurate and individualized risk stratification than traditional scoring systems. These models hold significant potential for clinical application, from facilitating early identification of high-risk individuals in primary care settings to improving the efficiency of patient enrollment in clinical trials for emerging NASH therapies. However, the predominance of retrospective study designs, a lack of consistent external validation, and issues with model interpretability are key limitations of the current evidence base that must be addressed.

Conclusion: Machine learning models based on clinical laboratory parameters are powerful non-invasive tools for predicting NAFLD progression. Their high accuracy and reliance on readily available data position them as a transformative technology in hepatology. Future research must prioritize prospective validation in diverse, real-world clinical settings and focus on developing interpretable, longitudinally-informed models to facilitate their responsible and effective integration into routine clinical practice.

References

Aggarwal, M. and McCullough, A. (2021). Novel Machine Learning Model Identifies NAFLD Patients with Advanced Liver Fibrosis. Cleveland Clinic Consult QD.

Al-Tawarah, Y., Al-Shdaifat, A., Al-Mistarehi, A.-H., Al-Zoubi, R.M., Al-Gharaibeh, N., Al-Rawashdeh, A., Al-Shara, Z., Obeidat, M. and Al-Delaimy, W. (2024). Application of Machine Learning Models in Predicting Non-Alcoholic Fatty Liver Disease Among Inactive Chronic Hepatitis B Patients: A Cross-Sectional Analysis. Journal of Clinical Medicine, 14(14), p.5042.

Anstee, Q.M., Castera, L., Chatterjee, A., Jiang, H., Romero-Gomez, M. and Sanyal, A.J. (2023). Biomarkers of inflammation in non-alcoholic fatty liver disease. Journal of Hepatology, 78(3), pp.643–656.

Asheghi, Z. and Marandi, S.M. (2024). Deep learning-based prediction of fibrosis progression risk in Non-alcoholic fatty liver disease using multi-omics and lifestyle features.

Canbay, A., Kälsch, J., Sowa, J.-P., Sydor, S., Fingas, C., Best, J., Jäschke, C., Cicek, E., Heider, D., Schemitzek, A., Eslam, M., He-Sheng, Q., Bechmann, L.P., Gerken, G., Syn, W.-K. and Canbay, A. (2019). Non-invasive assessment of NAFLD as systemic disease—A machine learning approach. PLOS ONE, 14(3), p.e0214436.

Chang, D., Truong, E., Mena, E., Pacheco, F., Wong, M., Guindi, M., Todo, T., Noureddin, N., Ayoub, W., Yang, J.D., Kim, I.K., Kohli, A., Alkhouri, N., Harrison, S. and Noureddin, M. (2023). Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology Communications, 7(1), p.e0155.

Charu, V., Liang, J.W., Mannalithara, A., Kwong, A., Tian, L. and Kim, W.R. (2024). Benchmarking clinical risk prediction algorithms with ensemble machine learning for the noninvasive diagnosis of liver fibrosis in NAFLD. Hepatology, 80(5), pp.1293–1302.

Chen, Y.-Y., Lin, C.-Y., Yen, H.-H., Su, P.-Y., Zeng, Y.-H., Huang, S.-P. and Liu, I.-L. (2022). Machine-Learning Algorithm for Predicting Fatty Liver Disease in a Taiwanese Population. Journal of Personalized Medicine, 12(7), p.1026.

Corey, K., Docherty, M., Regnier, S.A., Capkun, G., Balp, M.M., Ye, Q., Janssens, N., Tietz, A., Löffler, J., Cai, J., Pedrosa, M.C., Francque, S. and Tapper, E.B. (2021). Development of a novel machine learning model to predict presence of nonalcoholic steatohepatitis. Journal of the American Medical Informatics Association, 28(6), pp.1235–1241.

Ghandian, S., Thapa, R., Garikipati, A., Barnes, G., Green-Saxena, A., Calvert, J., Mao, Q. and Das, R. (2022). Machine learning to predict progression of non-alcoholic fatty liver to non-alcoholic steatohepatitis or fibrosis. JGH Open, 6(3), pp.196–204.

Higgins, J.P.T., Altman, D.G., Gøtzsche, P.C., Jüni, P., Moher, D., Oxman, A.D., Savović, J., Schulz, K.F., Weeks, L. and Sterne, J.A.C. (2011). The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ, 343, p.d5928.

Ji, W., Li, Y., Wang, M., Yang, S., Lin, H., Liu, S., Zhou, J., Li, L. and Liu, S. (2022). Application of machine learning in the screening of non-alcoholic fatty liver disease. Frontiers in Public Health, 10, p.846118.

Lee, J., Westphal, M., Vali, Y., Boursier, J., Ostroff, R., Alexander, L., Chen, Y., Fournier, C., Geier, A., Francque, S., Wonders, K., Tiniakos, D., Bedossa, P., Allison, M., Papatheodoridis, G., Cortez-Pinto, H., Pais, R., Dufour, J.-F., Leeming, D.J., Harrison, S., Cobbold, J., Holleboom, A.G., Yki-Järvinen, H., Crespo, J., Ekstedt, M., Aithal, G.P., Bugianesi, E., Ratziu, V., Sanyal, A.J., Anstee, Q.M. and Brass, C. (2023). Machine learning algorithm improves the detection of NASH (NAS-based) and at-risk NASH: a development and validation study. Hepatology, 78(1), pp.258–271.

Moons, K.G.M., Wolff, R.F., Riley, R.D., Whiting, P.F., Westwood, M., Collins, G.S., Reitsma, J.B., Knottnerus, A. and de Vet, H.C.W. (2019). PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies. Annals of Internal Medicine, 170(1), p.51.

Paik, J.M., Pan, T., Huang, G., Wu, C.-T., Lin, Y.-J. and Yu, C.-S. (2025). A prediction model for the risk of Non-alcoholic fatty liver disease using the random survival forests. Gene, 928, p.148612.

Pan, T., Huang, G., Wu, C.-T., Lin, Y.-J. and Yu, C.-S. (2025). A prediction model for the risk of Non-alcoholic fatty liver disease using the random survival forests. Gene, 928, p.148612.

Peng, H.-Y., Duan, S.-J., Pan, L., Wang, M.-Y., Chen, J.-L., Wang, Y.-C. and Yao, S.-K. (2023). Development and validation of machine learning models for nonalcoholic fatty liver disease. Hepatobiliary & Pancreatic Diseases International, 22(6), pp.568–576.

Ratziu, V., Noureddin, M., Harrison, S.A., Anstee, Q.M., Sanyal, A.J., Bedossa, P., Loomba, R., Francque, S., Tack, A., Saffo, S., Taylor-Weiner, A., Tuthill, T. and Goodman, Z. (2024). Artificial intelligence-powered digital pathology reproduces pathologist assessments of NASH histological improvement and provides a more sensitive measure of fibrosis change. Hepatology, 79(1), pp.99–110.

Sabet Sarvestany, S., Leng, Q., Fouteh, M., Martel, M., Barkun, A.N., Bhat, M. and Cooper, C. (2022). Development and validation of an ensemble machine learning framework for detection of all-cause advanced hepatic fibrosis: a retrospective cohort study. The Lancet Digital Health, 4(3), pp.e187–e196.

Souza-Guedes, P.F., Barboza, T.L.A., Leite, N.C., de Souza, G.F., de Carvalho, J.R.L. and Telles, R.W. (2022). Application of machine learning methods to predict nonalcoholic fatty liver disease and nonalcoholic steatohepatitis using simple and cost-effective biomarkers. World Journal of Artificial Intelligence, 3(3), pp.80–94.

Su, P.-Y., Chen, Y.-Y., Lin, C.-Y., Yen, H.-H., Liu, I.-L., Huang, S.-P. and Zeng, Y.-H. (2024). Interpretable machine learning models for predicting lean non-alcoholic fatty liver disease risk in patients with type 2 diabetes mellitus. Frontiers in Endocrinology, 16.

Taylor-Weiner, A., Pokkalla, H., Han, L., Jia, C., Huss, R., Chung, C., Elliott, H., Glass, B., Anstee, Q.M., Noureddin, M., Sanyal, A.J., Goodman, Z., Ratziu, V., Harrison, S.A. and Luedde, T. (2021). A Machine Learning Approach Enables Quantitative Measurement of Liver Histology and Disease Monitoring in NASH. Hepatology, 74(1), pp.133–147.

Xiao, G., An, P., Wang, Y., Zhang, F., Xu, Y., Zheng, R., Wu, J., Zhang, J., Chen, S., Wang, T., Zhang, J., You, H., Jia, J. and Zhao, H. (2022). The role of artificial intelligence in non-alcoholic fatty liver disease, non-alcoholic steatohepatitis and liver fibrosis. Annals of Medicine, 54(1), pp.26–36.

Xiao, T., Yip, T.C.-F., Liu, D., Tse, Y.-K., Huang, H., Zhang, J., El-Serag, H.B. and Wong, G.L.-H. (2022). Noninvasive Diagnosis of Nonalcoholic Steatohepatitis and Advanced Fibrosis in Patients With Nonalcoholic Fatty Liver Disease: A Machine Learning Approach. JMIR Medical Informatics, 10(6), p.e36997.

Xiong, F.-X., An, F.-P., Gao, Y.-Y., Zhang, T.-N., Nan, Y.-M., Fan, J.-G., Lu, L.-G., Zheng, M.-H. and Shi, J.-P. (2025). Development of a machine learning-based diagnostic model for advanced liver fibrosis in non-alcoholic steatohepatitis patients. World Journal of Gastroenterology, 31(9), pp.101383–101383.

Yip, T.C.-F., Ma, A.J., Wong, V.W.-S., Chow, A.M., Lee, C., Chan, J.K., Lam, K.L., Tse, Y.-K., Wong, G.L.-H., Harrison, S.A., Noureddin, M., Sanyal, A.J., Alkhouri, N. and Dunn, W. (2023). MACHINE LEARNING ADVANCED FIBROSIS IN NASH (ALADDIN) WITH WEB-BASED CALCULATION FOR PROBABILITY PREDICTION. Hepatology, 78, pp.S829–S835.

Zamanian, R., Ebrahimiyan, H., Asgari, E., Mamani, M., Khodamoradi, Z., Mohebi, Z. and Ebrahimi, A. (2024). Machine learning approaches for early detection of non-alcoholic steatohepatitis based on clinical and blood parameters. Scientific Reports, 14(1), p.2131.

Zheng, J.-R., Wang, Z.-L., Chen, H.-S. and Feng, B. (2024). Machine learning-based mortality prediction models for non-alcoholic fatty liver disease in the general United States population: a study based on NHANES-III.

Zhu, H., Shen, D., Gan, X. and Sun, D. (2024). An Interpretable Machine Learning Model Based on Inflammatory–Nutritional Biomarkers for Predicting Metachronous Liver Metastases After Colorectal Cancer Surgery. Biomolecules, 13(7), p.1706.

Predicting the Progression of Non-Alcoholic Fatty Liver Disease Using Machine Learning and Clinical Laboratory Parameters: A Systematic Review

Predicting the Progression of Non-Alcoholic Fatty Liver Disease Using Machine Learning and Clinical Laboratory Parameters: A Systematic Review

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Additional Information Feature

Indexed By