Seven Editora
##common.pageHeaderLogo.altText##
##common.pageHeaderLogo.altText##


Contact

  • Seven Publicações Ltda CNPJ: 43.789.355/0001-14 Rua: Travessa Aristides Moleta, 290- São José dos Pinhais/PR CEP: 83045-090
  • Principal Contact
  • Nathan Albano Valente
  • (41) 9 8836-2677
  • editora@sevenevents.com.br
  • Support Contact
  • contato@sevenevents.com.br

Comparison and selection of machine learning algorithms for diabetes prediction: An exploratory quantitative study based on medical data analysis

Santos VS

Vinicius de Souza Santos


Keywords

Machine Learning
Diabetes
Principal Component Analysis
Random Forest

Abstract

The global prevalence of diabetes is increasing at an alarming rate, making early and accurate detection a critical area of interest. This study employs Machine Learning techniques to predict the incidence of diabetes in a population of women from the Pima heritage, known for their predisposition to the disease. Using a database of diagnostic measures, multiple algorithms were applied, including Support Vector Machines (SVM), Artificial Neural Networks (ANN), K-Nearest Neighbors (KNN), Decision Trees, and Random Forest, to develop predictive models. Principal Component Analysis (PCA) was implemented for dimensionality reduction and highlighting of key diagnostic variables, optimizing algorithm performance. The results highlighted the superior- ity of the Random Forest, which showed higher accuracy and precision, suggesting its viability as a clinical diagnostic tool. This study contributes to the emerging field of artificial intelligence ap- plications in health, providing valuable insights for the prevention and early treatment of diabetes.

 

DOI:https://doi.org/10.56238/sevened2024.007-053


Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright (c) 2024 Vinicius de Souza Santos