A machine learning approach to predict creatine kinase test results

Creative Commons License

Canbolat Z. N., SİLAHTAROĞLU G., DOĞUÇ Ö., Yılmaztürk N.

Emerging Science Journal, vol.4, no.4, pp.283-296, 2020 (Scopus) identifier

  • Publication Type: Article / Article
  • Volume: 4 Issue: 4
  • Publication Date: 2020
  • Doi Number: 10.28991/esj-2020-01231
  • Journal Name: Emerging Science Journal
  • Journal Indexes: Scopus, Directory of Open Access Journals
  • Page Numbers: pp.283-296
  • Keywords: Creatine Kinase, Data Mining, Decision Tree, Laboratory Tests, Machine Learning
  • Istanbul Medipol University Affiliated: Yes


Most of the research done in the literature are based on statistical approaches and used for deriving reference limits based on lab results. As more data are available to the researchers, ML methods are more effectively used by the clinicians and practitioners to reduce cost and provide more accurate diagnoses. This study aims to contribute to the medical laboratory processes by providing an automated method in order to predict the lab results accurately by machine learning from the previous test results. All patient data obtained have been anonymized, and a total of 449,471 test results have been used to build an integrated dataset. A total of 107,646 unique patients’ data has been used. This study aims to predict the value range of the Creatine Kinase tests, which are taken in separate tubes and usually needs more processing time than the other tests do. Using the lab results and the Random Forest Algorithm, this study reports that the outcome of the Creatine Kinase test can be determined with 97% accuracy by using the AST and ALT test values. This is an important achievement for the practitioners and the patients, as this study submits significant reduction in Creating Kinase test evaluation time.