Quick jump to page content
  • Main Navigation
  • Main Content
  • Sidebar

  • Home
  • Current
  • Archives
  • Join As Reviewer
  • Info
  • Announcements
  • Statistics
  • About
    • About the Journal
    • Submissions
    • Editorial Team
    • Privacy Statement
    • Contact
  • Register
  • Login
  • Home
  • Current
  • Archives
  • Join As Reviewer
  • Info
  • Announcements
  • Statistics
  • About
    • About the Journal
    • Submissions
    • Editorial Team
    • Privacy Statement
    • Contact
  1. Home
  2. Archives
  3. Vol. 11, No. 2, May 2026
  4. Articles

Issue

Vol. 11, No. 2, May 2026

Issue Published : May 1, 2026
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Integrating Tabular Data and Textual Representations for Clinical Risk Prediction Using Machine Learning and Large Language Models

https://doi.org/10.22219/kinetik.v11i2.2570
M.Rafly Rahman
Universitas Muhammadiyah Malang
Setio Basuki
Universitas Muhammadiyah Malang
Muhammad Ilham Perdana
Universitas Muhammadiyah Malang
La Febry Andira Rose Cynthia
Universitas Muhammadiyah Malang

Corresponding Author(s) : La Febry Andira Rose Cynthia

lafebryarc@umm.ac.id

Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, Vol. 11, No. 2, May 2026
Article Published : May 1, 2026

Share
WA Share on Facebook Share on Twitter Pinterest Email Telegram
  • Abstract
  • Cite
  • References
  • Authors Details

Abstract

Global health is currently facing serious challenges due to the increasing number of chronic disease patients, such as those with heart failure, diabetes, and cancer. This issue arises from the limitations of electronic health record (EHR) systems, which are not yet fully capable of ensuring accurate clinical diagnoses because of potential data input errors and delays in symptom identification by medical personnel. In response to this issue, this paper focuses on the integration of medical tabular data with a classification approach based on classical machine learning (ML) and large language models (LLM) to improve the accuracy of patient diagnosis predictions. This paper aims to develop and compare the performance of various ML models, such as XGBoost, SVM, and logistic regression, as well as LLM models like Gemini, LLaMA, and Qwen in fine-tuning, few-shot, and zero-shot scenarios. The paper results show that the combination of Gemini and the few-shot approach (250 shots) achieved the highest accuracy of up to 99.8% in predicting heart failure risk. The main finding of this study is that the narrative text representation of tabular data processed with LLM significantly enhances contextual understanding and classification accuracy, making this approach highly potent for application in AI-based clinical decision-making.

Keywords

Medical Tabular Data Large language Models (LLM) Clinical Risk Prediction Data Serialization Few-shot Learning
Rahman, M. ., Basuki, S., Perdana, M. I. ., & Cynthia, L. F. A. R. (2026). Integrating Tabular Data and Textual Representations for Clinical Risk Prediction Using Machine Learning and Large Language Models. Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, 11(2), 363-372. https://doi.org/10.22219/kinetik.v11i2.2570
  • ACM
  • ACS
  • APA
  • ABNT
  • Chicago
  • Harvard
  • IEEE
  • MLA
  • Turabian
  • Vancouver
Download Citation
Endnote/Zotero/Mendeley (RIS)
BibTeX
References
  1. N. R. F. Collaboration, “Worldwide trends in diabetes prevalence and treatment from 1990 to 2022: a pooled analysis of 1108 population-representative studies with 141 million participants,” Lancet, vol. 404, no. 10467, pp. 2077–2093, 2024. https://doi.org/10.1016/S0140-6736(24)02317-1
  2. H. Sung et al., “Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.,” CA. Cancer J. Clin., vol. 71, no. 3, pp. 209–249, May 2021. https://doi.org/10.3322/caac.21660
  3. B. Shahim, C. J. Kapelios, G. Savarese, and L. H. Lund, “Global Public Health Burden of Heart Failure: An Updated Review.,” Card. Fail. Rev., vol. 9, p. e11, 2023. https://doi.org/10.15420/cfr.2023.05
  4. T. N. Bogale et al., “Effect of electronic records on mortality among patients in hospital and primary healthcare settings: a systematic review and meta-analyses,” Front. Digit. Heal., vol. 6, no. June, pp. 1–14, 2024. https://doi.org/10.3389/fdgth.2024.1377826
  5. T. S. Hwang, M. Thomas, M. Hribar, A. Chen, and E. White, “The Impact of Documentation Workflow on the Accuracy of the Coded Diagnoses in the Electronic Health Record,” Ophthalmol. Sci., vol. 4, no. 1, p. 100409, 2024. https://doi.org/10.1016/j.xops.2023.100409
  6. R. A. Dixit, C. L. Boxley, S. Samuel, V. Mohan, R. M. Ratwani, and J. A. Gold, “Electronic Health Record Use Issues and Diagnostic Error: A Scoping Review and Framework,” J. Patient Saf., vol. 19, no. 1, pp. E25–E30, 2023. https://doi.org/10.1097/PTS.0000000000001081
  7. M. R. Kale, A. H. Mutlag, S. P, N. H. Al-Muraad, H. S. Mahdi, and S. Muthuperumal, “AI Powered Decision Support Systems for Healthcare Enhancing Diagnosis and Treatment with Deep Learning,” in 2025 International Conference on Intelligent Computing and Knowledge Extraction (ICICKE), 2025, pp. 1–5. https://doi.org/10.1109/ICICKE65317.2025.11136681
  8. E. Hassan and C. E. Omenogor, “AI powered predictive healthcare: Deep learning for early diagnosis, personalized treatment, and disease prevention,” Int. J. Sci. Res. Arch., vol. 14, no. 3, pp. 806–823, 2025. https://doi.org/10.30574/ijsra.2025.14.3.0731
  9. A. Jafar, N. Bibi, and R. A. Naqvi, “Revolutionizing agriculture with arti fi cial intelligence : plant disease detection methods , applications , and their limitations,” no. March, pp. 1–20, 2024. https://doi.org/10.3389/fpls.2024.1356260
  10. R. M. Shohel and S. Jeff, “AI in Healthcare: Transforming Patient Care through Predictive Analytics and Decision Support Systems,” J. Artif. Intell. Gen. Sci. ISSN3006-4023, vol. 1, no. 1, 2024. https://doi.org/10.60087/jaigs.v1i1.30
  11. M. A. Islam et al., “Harnessing Predictive Analytics: The Role of Machine Learning in Early Disease Detection and Healthcare Optimization,” J. Ecohumanism, vol. 4, no. 3, pp. 312–321, 2025. https://doi.org/10.62754/joe.v4i3.6642
  12. S E. Z. Snigdha, M. R. Hossain, and S. Mahabub, “AI-Powered Healthcare Tracker Development: Advancing Real-Time Patient Monitoring and Predictive Analytics Through Data-Driven Intelligence",” J. Comput. Sci. Technol. Stud., vol. 5, no. 4, pp. 229–239, 2023. https://doi.org/10.32996/jcsts.2023.5.4.24
  13. V Q. Niu, K. Chen, M. Li, P. Feng, Z. Bi, L. K. Q. Yan, Y. Zhang, C. H. Yin, C. Fei, J. Liu, T. Wang, Y. Wang, S. Chen, and B. Peng, “From text to multimodality: Exploring the evolution and impact of large language models in medical practice,” arXiv preprint arXiv:2410.01812, 2024. https://doi.org/10.48550/arXiv.2410.01812
  14. A S. Maity and M. J. Saikia, “Large language models in healthcare and medical applications: A review,” Bioengineering, vol. 12, no. 6, p. 631, 2025. https://doi.org/10.3390/bioengineering12060631
  15. R. Yang, T. F. Tan, W. Lu, A. J. Thirunavukarasu, D. S. W. Ting, and N. Liu, “Large language models in health care: Development, applications, and challenges,” Health Care Science, vol. 2, no. 4, pp. 255–263, 2023. https://doi.org/10.1002/hcs2.61
  16. G. Huang, Y. Li, S. Jameel, Y. Long, and G. Papanastasiou, “From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?,” Comput. Struct. Biotechnol. J., vol. 24, no. November 2023, pp. 362–373, 2024. https://doi.org/10.1016/j.csbj.2024.05.004
  17. F. Markowetz, “All models are wrong and yours are useless: making clinical prediction models impactful for patients,” npj Precis. Oncol., vol. 8, no. 1, pp. 6–8, 2024. https://doi.org/10.1038/s41698-024-00553-6
  18. K. Mavrogiorgos, A. Kiourtis, A. Mavrogiorgou, A. Menychtas, and D. Kyriazis, “Bias in Machine Learning: A Literature Review,” Appl. Sci., vol. 14, no. 19, 2024. https://doi.org/10.3390/ app14198860
  19. K. Ono and S. A. Lee, “Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning,” 2024. https://doi.org/10.48550/arXiv.2406.13846
  20. World Health Organization, “The top 10 causes of death,” 8 August 2024. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death
  21. R. Guido, S. Ferrisi, D. Lofaro, and D. Conforti, “An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review,” Inf., vol. 15, no. 4, 2024. https://doi.org/10.3390/info15040235
  22. K. Gohari et al., “A Bayesian latent class extension of naive Bayesian classifier and its application to the classification of gastric cancer patients,” BMC Med. Res. Methodol., vol. 23, no. 1, pp. 1–15, 2023. https://doi.org/10.1186/s12874-023-02013-4
  23. X. Tang et al., “A clinical diagnostic model based on an eXtreme Gradient Boosting algorithm to distinguish type 1 diabetes,” Ann. Transl. Med., vol. 9, no. 5, pp. 409–409, 2021. https://doi.org/10.21037/atm-20-7115
  24. S. A. Hicks et al., “On evaluation metrics for medical applications of artificial intelligence,” Sci. Rep., vol. 12, no. 1, pp. 1–9, 2022. https://doi.org/10.1038/s41598-022-09954-8
  25. S. Maity and M. J. Saikia, “Large Language Models in Healthcare and Medical Domain: A Review,” Informatics, vol. 11, no. 3, pp. 1–25, 2024, doi: https://doi.org/10.3390/informatics11030057.
  26. M. J. Schuemie et al., “Standardized patient profile review using large language models for case adjudication in observational research,” npj Digit. Med., vol. 8, no. 1, pp. 1–7, 2025. http://dx.doi.org/10.1038/s41746-025-01433-4
  27. A. Q. Xie, Q. Chen, A. Chen, and C. Peng, “Me-LLaMA : Medical Foundation Large Language Models for Comprehensive Text Analysis and Beyond,” pp. 1–21. https://doi.org/10.21203/rs.3.rs-5456223/v1
  28. K. Saab et al., “Capabilities of Gemini Models in Medicine,” pp. 1–58, 2024. https://doi.org/10.48550/arXiv.2404.18416
  29. S. Zhu, W. Hu, Z. Yang, J. Yan, and F. Zhang, “Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study.,” JMIR Med. informatics, vol. 13, p. e63731, Jan. 2025. https://doi.org/10.2196/63731
  30. D. M. Anisuzzaman, J. G. Malins, P. A. Friedman, and Z. I. Attia, “Fine-Tuning Large Language Models for Specialized Use Cases,” Mayo Clin. Proc. Digit. Heal., vol. 3, no. 1, p. 100184, 2025. https://doi.org/10.1016/j.mcpdig.2024.11.005
  31. Y. Ge, Y. Guo, Y.-C. Yang, M. A. Al-Garadi, and A. Sarker, “Few-shot learning for medical text: A systematic review,” 2022. https://doi.org/10.48550/arXiv.2204.14081
  32. B. Neves et al., “Zero-shot learning for clinical phenotyping: Comparing LLMs and rule-based methods,” Comput. Biol. Med., vol. 192, no. PA, p. 110181, 2025. https://doi.org/10.1016/j.compbiomed.2025.110181
Read More

References


N. R. F. Collaboration, “Worldwide trends in diabetes prevalence and treatment from 1990 to 2022: a pooled analysis of 1108 population-representative studies with 141 million participants,” Lancet, vol. 404, no. 10467, pp. 2077–2093, 2024. https://doi.org/10.1016/S0140-6736(24)02317-1

H. Sung et al., “Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.,” CA. Cancer J. Clin., vol. 71, no. 3, pp. 209–249, May 2021. https://doi.org/10.3322/caac.21660

B. Shahim, C. J. Kapelios, G. Savarese, and L. H. Lund, “Global Public Health Burden of Heart Failure: An Updated Review.,” Card. Fail. Rev., vol. 9, p. e11, 2023. https://doi.org/10.15420/cfr.2023.05

T. N. Bogale et al., “Effect of electronic records on mortality among patients in hospital and primary healthcare settings: a systematic review and meta-analyses,” Front. Digit. Heal., vol. 6, no. June, pp. 1–14, 2024. https://doi.org/10.3389/fdgth.2024.1377826

T. S. Hwang, M. Thomas, M. Hribar, A. Chen, and E. White, “The Impact of Documentation Workflow on the Accuracy of the Coded Diagnoses in the Electronic Health Record,” Ophthalmol. Sci., vol. 4, no. 1, p. 100409, 2024. https://doi.org/10.1016/j.xops.2023.100409

R. A. Dixit, C. L. Boxley, S. Samuel, V. Mohan, R. M. Ratwani, and J. A. Gold, “Electronic Health Record Use Issues and Diagnostic Error: A Scoping Review and Framework,” J. Patient Saf., vol. 19, no. 1, pp. E25–E30, 2023. https://doi.org/10.1097/PTS.0000000000001081

M. R. Kale, A. H. Mutlag, S. P, N. H. Al-Muraad, H. S. Mahdi, and S. Muthuperumal, “AI Powered Decision Support Systems for Healthcare Enhancing Diagnosis and Treatment with Deep Learning,” in 2025 International Conference on Intelligent Computing and Knowledge Extraction (ICICKE), 2025, pp. 1–5. https://doi.org/10.1109/ICICKE65317.2025.11136681

E. Hassan and C. E. Omenogor, “AI powered predictive healthcare: Deep learning for early diagnosis, personalized treatment, and disease prevention,” Int. J. Sci. Res. Arch., vol. 14, no. 3, pp. 806–823, 2025. https://doi.org/10.30574/ijsra.2025.14.3.0731

A. Jafar, N. Bibi, and R. A. Naqvi, “Revolutionizing agriculture with arti fi cial intelligence : plant disease detection methods , applications , and their limitations,” no. March, pp. 1–20, 2024. https://doi.org/10.3389/fpls.2024.1356260

R. M. Shohel and S. Jeff, “AI in Healthcare: Transforming Patient Care through Predictive Analytics and Decision Support Systems,” J. Artif. Intell. Gen. Sci. ISSN3006-4023, vol. 1, no. 1, 2024. https://doi.org/10.60087/jaigs.v1i1.30

M. A. Islam et al., “Harnessing Predictive Analytics: The Role of Machine Learning in Early Disease Detection and Healthcare Optimization,” J. Ecohumanism, vol. 4, no. 3, pp. 312–321, 2025. https://doi.org/10.62754/joe.v4i3.6642

S E. Z. Snigdha, M. R. Hossain, and S. Mahabub, “AI-Powered Healthcare Tracker Development: Advancing Real-Time Patient Monitoring and Predictive Analytics Through Data-Driven Intelligence",” J. Comput. Sci. Technol. Stud., vol. 5, no. 4, pp. 229–239, 2023. https://doi.org/10.32996/jcsts.2023.5.4.24

V Q. Niu, K. Chen, M. Li, P. Feng, Z. Bi, L. K. Q. Yan, Y. Zhang, C. H. Yin, C. Fei, J. Liu, T. Wang, Y. Wang, S. Chen, and B. Peng, “From text to multimodality: Exploring the evolution and impact of large language models in medical practice,” arXiv preprint arXiv:2410.01812, 2024. https://doi.org/10.48550/arXiv.2410.01812

A S. Maity and M. J. Saikia, “Large language models in healthcare and medical applications: A review,” Bioengineering, vol. 12, no. 6, p. 631, 2025. https://doi.org/10.3390/bioengineering12060631

R. Yang, T. F. Tan, W. Lu, A. J. Thirunavukarasu, D. S. W. Ting, and N. Liu, “Large language models in health care: Development, applications, and challenges,” Health Care Science, vol. 2, no. 4, pp. 255–263, 2023. https://doi.org/10.1002/hcs2.61

G. Huang, Y. Li, S. Jameel, Y. Long, and G. Papanastasiou, “From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?,” Comput. Struct. Biotechnol. J., vol. 24, no. November 2023, pp. 362–373, 2024. https://doi.org/10.1016/j.csbj.2024.05.004

F. Markowetz, “All models are wrong and yours are useless: making clinical prediction models impactful for patients,” npj Precis. Oncol., vol. 8, no. 1, pp. 6–8, 2024. https://doi.org/10.1038/s41698-024-00553-6

K. Mavrogiorgos, A. Kiourtis, A. Mavrogiorgou, A. Menychtas, and D. Kyriazis, “Bias in Machine Learning: A Literature Review,” Appl. Sci., vol. 14, no. 19, 2024. https://doi.org/10.3390/ app14198860

K. Ono and S. A. Lee, “Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning,” 2024. https://doi.org/10.48550/arXiv.2406.13846

World Health Organization, “The top 10 causes of death,” 8 August 2024. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death

R. Guido, S. Ferrisi, D. Lofaro, and D. Conforti, “An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review,” Inf., vol. 15, no. 4, 2024. https://doi.org/10.3390/info15040235

K. Gohari et al., “A Bayesian latent class extension of naive Bayesian classifier and its application to the classification of gastric cancer patients,” BMC Med. Res. Methodol., vol. 23, no. 1, pp. 1–15, 2023. https://doi.org/10.1186/s12874-023-02013-4

X. Tang et al., “A clinical diagnostic model based on an eXtreme Gradient Boosting algorithm to distinguish type 1 diabetes,” Ann. Transl. Med., vol. 9, no. 5, pp. 409–409, 2021. https://doi.org/10.21037/atm-20-7115

S. A. Hicks et al., “On evaluation metrics for medical applications of artificial intelligence,” Sci. Rep., vol. 12, no. 1, pp. 1–9, 2022. https://doi.org/10.1038/s41598-022-09954-8

S. Maity and M. J. Saikia, “Large Language Models in Healthcare and Medical Domain: A Review,” Informatics, vol. 11, no. 3, pp. 1–25, 2024, doi: https://doi.org/10.3390/informatics11030057.

M. J. Schuemie et al., “Standardized patient profile review using large language models for case adjudication in observational research,” npj Digit. Med., vol. 8, no. 1, pp. 1–7, 2025. http://dx.doi.org/10.1038/s41746-025-01433-4

A. Q. Xie, Q. Chen, A. Chen, and C. Peng, “Me-LLaMA : Medical Foundation Large Language Models for Comprehensive Text Analysis and Beyond,” pp. 1–21. https://doi.org/10.21203/rs.3.rs-5456223/v1

K. Saab et al., “Capabilities of Gemini Models in Medicine,” pp. 1–58, 2024. https://doi.org/10.48550/arXiv.2404.18416

S. Zhu, W. Hu, Z. Yang, J. Yan, and F. Zhang, “Qwen-2.5 Outperforms Other Large Language Models in the Chinese National Nursing Licensing Examination: Retrospective Cross-Sectional Comparative Study.,” JMIR Med. informatics, vol. 13, p. e63731, Jan. 2025. https://doi.org/10.2196/63731

D. M. Anisuzzaman, J. G. Malins, P. A. Friedman, and Z. I. Attia, “Fine-Tuning Large Language Models for Specialized Use Cases,” Mayo Clin. Proc. Digit. Heal., vol. 3, no. 1, p. 100184, 2025. https://doi.org/10.1016/j.mcpdig.2024.11.005

Y. Ge, Y. Guo, Y.-C. Yang, M. A. Al-Garadi, and A. Sarker, “Few-shot learning for medical text: A systematic review,” 2022. https://doi.org/10.48550/arXiv.2204.14081

B. Neves et al., “Zero-shot learning for clinical phenotyping: Comparing LLMs and rule-based methods,” Comput. Biol. Med., vol. 192, no. PA, p. 110181, 2025. https://doi.org/10.1016/j.compbiomed.2025.110181

Author biographies is not available.
Download this PDF file
PDF
Statistic
Read Counter : 135 Download : 6

Downloads

Download data is not yet available.

Quick Link

  • Author Guidelines
  • Download Manuscript Template
  • Peer Review Process
  • Editorial Board
  • Reviewer Acknowledgement
  • Aim and Scope
  • Publication Ethics
  • Licensing Term
  • Copyright Notice
  • Open Access Policy
  • Important Dates
  • Author Fees
  • Indexing and Abstracting
  • Archiving Policy
  • Scopus Citation Analysis
  • Statistic
  • Article Withdrawal

Meet Our Editorial Team

Ir. Amrul Faruq, M.Eng., Ph.D
Editor in Chief
Universitas Muhammadiyah Malang
Google Scholar Scopus
Prof. Robert Lis
Editorial Board
Wrocław University of Science and Technology
Orcid  Scopus
Hanung Adi Nugroho
Editorial Board
Universitas Gadjah Mada
Google Scholar Scopus
Prof. Roman Voliansky
Editorial Board
Dniprovsky State Technical University, Ukraine
Google Scholar Scopus
Read More
 

KINETIK: Game Technology, Information System, Computer Network, Computing, Electronics, and Control
eISSN : 2503-2267
pISSN : 2503-2259


Address

Program Studi Elektro dan Informatika

Fakultas Teknik, Universitas Muhammadiyah Malang

Jl. Raya Tlogomas 246 Malang

Phone 0341-464318 EXT 247

Contact Info

Principal Contact

Amrul Faruq
Phone: +62 812-9398-6539
Email: faruq@umm.ac.id

Support Contact

Fauzi Dwi Setiawan Sumadi
Phone: +62 815-1145-6946
Email: fauzisumadi@umm.ac.id

© 2020 KINETIK, All rights reserved. This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License