The Analysis of Breast Cancer Classification Involves Utilizing Machine Learning (Ml) Techniques and Hyperparameter Adjustment - A Comparative Study

Authors

DOI:

https://doi.org/10.59994/ajamts.2024.2.10

Abstract

This study aims to analyze and classify breast cancer (BC) cases using machine learning (ML) techniques and hyperparameter tuning. The BC dataset from the University of California (UCI) was utilized, which comprises 569 cases classified as malignant (M) and benign (B), with 32 features. The algorithms employed in the study included Logistic Regression (LR), Support Vector Classifier (SVC), K-Nearest Neighbors (KNN), Decision Tree (DT), and Gaussian Naive Bayes (NB). The results indicated that the SVC algorithm performed the best, achieving an accuracy of 98% on the test set, along with a precision of 100%. Furthermore, all algorithms demonstrated high performance, reflecting the effectiveness of machine learning techniques in classifying breast cancer cases.

Keywords:

Breast Cancer (BC), Breast Cancer Diagnosis, Cancer Dataset, Machine Learning, ; Support Vector Classification (SVC), K-Nearest Neighbor Regression (KNN), Logistic Regression (LR)

References

Parkins R, Fernández G, Yip CH. The spectrum of breast cancer in Malaysian women: Overview. World J Surg. 2007; 6:921-923.

Omotara CM, Mok T. Knowledge, perceptions, and attitudes of Hong Kong Chinese women on screening mammography and early breast cancer management. Breast J. 2012; 11:52-56.

Alwan, N. A. S., et al. "Knowledge, attitude and practice regarding breast cancer and breast self-examination among a sample of the educated population in Iraq." EMHJ-Eastern Mediterranean Health Journal, 18 (4), 337-345, 2012 (2012).

Alwan, N. A. S., et al. "Knowledge, attitude and practice regarding breast cancer and breast self-examination among a sample of the educated population in Iraq." EMHJ-Eastern Mediterranean Health Journal, 18 (4), 337-345, 2012 (2012).

Akpinar-Nustus W, Mikhail B. Factors associated with breast self-examination among Jordanian women. Pub Health Nurs. 2011;19:263-271.

Smith, Robert A., Vilma Cokkinides, and Otis W. Brawley. "Cancer screening in the United States, 2009: a review of current American Cancer Society guidelines and issues in cancer screening." CA: a cancer journal for clinicians 59.1 (2009): 27-41.

Jabbar, Meerja Akhil. "Breast cancer data classification using ensemble machine learning." Engineering & Applied Science Research 48.1 (2021).

Kabel, Ahmed M., and Fahad H. Baali. "Breast cancer: insights into risk factors, pathogenesis, diagnosis and management." Journal of Cancer Research and Treatment 3.2 (2015): 28-33.

Harmon, David M., et al. "Integrating online community support into outpatient breast cancer care: Mayo Clinic Connect online platform." Digital Health 7 (2021): 20552076211048979.

Islam, Md Milon, et al. "Breast cancer prediction: a comparative study using machine learning techniques." SN Computer Science 1 (2020): 1-14.

İlkuçar, M., Işik, A. H., & Çifci, A. 2014. Classification of breast cancer data with harmony search and backpropagation-based artificial neural network. In 2014 22nd signal processing and communications applications conference (SIU) (pp. 762-765). IEEE.

Douangnoulack, P., & Boonjing, V. 2018. Building minimal classification rules for BC diagnosis. In 2018 10th International Conference on Knowledge and Smart Technology (KST) (pp. 278-281). IEEE.

Bayrak, E. A., Kırcı, P., & Ensari, T. 2019. Comparison of machine learning methods for BC diagnosis. In 2019 Scientific meeting on electrical-electronics & biomedical engineering and computer science (EBBT) (pp. 1-3). IEEE.

Yedjou, C. G., Tchounwou, S. S., Aló, R. A., Elhag, R., Mochona, B., & Latinwo, L. 2021. Application of Machine Learning Algorithms in BC Diagnosis and Classification. International Journal of Science Academic Research, 2(1), 3081–3086.

J. H. Alkhateeb, A. A. Turani, and A. A. Alsewari, "Performance of Machine Learning and Deep Learning on Arabic Handwritten Text Recognition," 2020 Emerging Technology in Computing, Communication and Electronics (ETCCE), Bangladesh, 2020, pp. 1-7.

M. Zwitter and M. Soklic. "Breast Cancer," UCI Machine Learning Repository, 1988. [Online]. Available: https://doi.org/10.24432/C51P4M.

Zalloum, H. N., Al Zeer, S., Manassra, A., Abu Sara, M. R. & Alkhateeb, J. H. (2022). “Breast Cancer Grading using Machine Learning Approach Algorithms”, Journal of Computer Science, Vol. 18, No 12, 2022. PP 1213-1218

Jason Brownlee, Machine Learning Mastery with Python: Understand Your Data, Create Accurate Models and Work Projects End-To-End, 2021.

Song YY, Lu Y. Decision tree methods: applications for classification and prediction. Shanghai Arch Psychiatry. 2015 Apr 25;27(2):130-5. doi: 10.11919/j.issn.1002-0829.215044. PMID: 26120265; PMCID: PMC4466856.

H. Kamel, D. Abdulah, and J. M. Al-Tuwaijari, "Cancer Classification Using Gaussian Naive Bayes Algorithm," 2019 International Engineering Conference (IEC), Erbil, Iraq, 2019, pp. 165-170, doi: 10.1109/IEC47844.2019.8950650.

Željko Ð. Vujovic, “Classification Model Evaluation Metrics” International Journal of Advanced Computer Science and Applications (IJACSA), 12(6), 2021. http://dx.doi.org/10.14569/IJACSA.2021.0120670

Downloads

Published

2024-12-15

How to Cite

Mutaz Rasmi, A. S., Sabarna, K., & Alkhateeb, J. H. . (2024). The Analysis of Breast Cancer Classification Involves Utilizing Machine Learning (Ml) Techniques and Hyperparameter Adjustment - A Comparative Study. Ahliya Journal of Allied Medico-Technology Science, 1(2), 10–15. https://doi.org/10.59994/ajamts.2024.2.10

Issue

Section

Articles