A Machine Learning Approach for Breast Cancer Detection using Random Forest Algorithm

Authors

  • K. V. Shiny Assistant Professor, School of Computing, Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India
  • Aman Kumar Ajnabi Student, School of Computing, Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India
  • Anand Kumar Student, School of Computing, Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India
  • Bhagwan Kumar Singh Student, School of Computing, Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India
  • Ankit Gupta Student, School of Computing, Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India

DOI:

https://doi.org/10.5281/zenodo.10944608

Keywords:

Medical Diagnosis, Random Forest Algorithm, Eigenvalues, Decision Tree, OpenCV, Logistic Regression, Mammography, Feature Extraction, Segmentation, Malignant, Benign

Abstract

Breast cancer cells continues to be a considerable worldwide health and wellness worry needing precise along with prompt discovery techniques for boosted individual results. This research study recommends an integrative artificial intelligence structure for bust cancer cells discovery leveraging the Random Forest formula, Decision Tree, Logistic Regression version, together with OpenCV for photo handling. The technique entails several phases. To start with electronic mammography pictures are preprocessed making use of OpenCV to improve function removal and also alleviate sound artifacts. Following a function choice procedure is used to recognize pertinent photo includes vital for category. Ultimately 3 distinct artificial intelligence formulas are used: Random Forest, Decision Tree as well as Logistic Regression. These formulas are educated together with confirmed making use of a detailed dataset consisting of mammography photos with connected ground fact tags showing malignant or non-cancerous areas. The Random Forest formula harnesses the power of set finding out, accumulating the outcomes of numerous choice trees to boost category precision plus strength. Decision Tree designs are utilized for their interpretability along with simpleness supplying understandings right into the underlying decision-making procedure. Logistic Regression an extensively made use of straight classifier, supplies a probabilistic analysis of the chance of boob cancer cells event based upon input functions. The efficiency of each formula is carefully examined utilizing metrics such as precision, level of sensitivity, specificity as well as location under the receiver operating attribute contour (AUC-ROC) to analyze category efficiency and also generalization capability. These algorithms are trained on the extracted feature set to learn the patterns indicative of malignant and benign of breast tissues. Random Forest Algorithm provides high accuracy due to its ensemble nature. The performance is evaluated using standard metrices such as accuracy, sensitivity, specificity, and area under the receiver operating characteristics curve (AUC-ROC).

Downloads

Download data is not yet available.

Downloads

Published

08-04-2024

Issue

Section

Articles

How to Cite

[1]
K. V. Shiny, A. K. Ajnabi, A. Kumar, B. K. Singh, and A. Gupta, “A Machine Learning Approach for Breast Cancer Detection using Random Forest Algorithm”, IJRESM, vol. 7, no. 4, pp. 14–18, Apr. 2024, doi: 10.5281/zenodo.10944608.