Hybrid Machine Learning Framework for Clinical Diagnosis of Polycystic Ovary Syndrome
Main Article Content
Abstract
Polycystic Ovary Syndrome (PCOS) is a common hormonal disorder in women, and it is difficult to diagnose at an early stage due to varying symptoms and limitations of traditional diagnostic methods. Early detection of PCOS is important to prevent long-term health complications. In this study, a hybrid machine learning model is proposed for PCOS detection using clinical and hormonal data. A dataset containing 541 patient records was used for analysis. Missing values were imputed using K-Nearest Neighbour (KNN), and the most relevant features were selected using Mutual Information. To address class imbalance, SMOTE was applied to the training data. Individual machine learning models were first evaluated, and based on their performance, a hybrid model was developed using a weighted soft-voting approach that combines Gaussian Naïve Bayes, Logistic Regression, and Random Forest. The experimental results suggest that the hybrid model strikes a better balance between accuracy, precision, and recall than any single model on its own. This makes it a more trustworthy approach for predicting PCOS. This project marks just the first phase of research, laying the groundwork for future studies that will combine these findings with ultrasound image analysis.
Downloads
Article Details
Section

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
How to Cite
References
S. Rajendran, R. Bama, and K. Maheswari, “A deep learning and machine learning-based hybrid approach for PCOS diagnosis using ultrasound images,” Scientific Reports, vol. 12, no. 1, p. 18141, 2022. DOI: https://doi.org/10.1038/s41598-022-18141-0
S. J. Pulluparambil and S. Bhat, “Medical image processing: Detection and prediction of PCOS—a systematic literature review,” International Journal of Health Sciences and Pharmacy (IJHSP), vol. 5, no. 2, pp. 80–98, 2021. DOI: https://doi.org/10.47992/IJHSP.2581.6411.0075
M. Priyadharshini, A. Srimathi, C. Sanjay, and K. Ramprakash, “PCOS disease prediction using machine learning algorithms,” International Research Journal of Advanced Engineering Hub (IRJAEH), vol. 2, no. 3, pp. 651–655, 2024.DOI: https://doi.org/10.47392/IRJAEH.2024.0094
B. Yamini, V. R. Kaneti, M. Nalini, and S. Subramanian, “Machine learning-driven PCOS prediction for early detection and tailored interventions,” SSRG International Journal of Electrical and Electronics Engineering, vol. 10, no. 9, pp. 61–75, 2023. DOI: https://doi.org/10.14445/23488379/IJEEE-V10I9P106
M. M. Rahman et al., “Empowering early detection: A web-based machine learning approach for PCOS prediction,” Informatics in Medicine Unlocked, vol. 47, p. 101500, 2024. DOI: https://doi.org/10.1016/j.imu.2024.101500
Z. Zad et al., “Predicting polycystic ovary syndrome with machine learning algorithms from electronic health records,” Frontiers in Endocrinology, vol. 15, p. 1298628, 2024. DOI: https://doi.org/10.3389/fendo.2024.1298628
B. Panjwani, J. Yadav, V. Mohan, N. Agarwal, and S. Agarwal, “Optimized machine learning for the early detection of polycystic ovary syndrome in women,” Sensors, vol. 25, no. 4, p. 1166, 2025. DOI: https://doi.org/10.3390/s25041166
H. Elmannai et al., “Polycystic ovary syndrome detection machine learning model based on optimized feature selection and explainable artificial intelligence,” Diagnostics, vol. 13, no. 8, p. 1506, 2023.DOI: https://doi.org/10.3390/diagnostics13081506
V. Sakthivel, P. Prakash, K. Vishnukumar, and D. Min, “Advanced diagnosis of polycystic ovarian syndrome using machine learning and multimodal data integration,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 6, 2024. DOI: https://dx.doi.org/10.14569/IJACSA.2024.01506122
S. A. Suha and M. N. Islam, “An extended machine learning technique for polycystic ovary syndrome detection using ovary ultrasound image,” Scientific Reports, vol. 12, no. 1, p. 17123, 2022. DOI: https://doi.org/10.1038/s41598-022-21724-0
M. Sumathi, P. Chitra, R. S. Prabha, and K. Srilatha, “Study and detection of PCOS-related diseases using CNN,” in IOP Conference Series: Materials Science and Engineering, vol. 1070, no. 1, p. 012062, Feb. 2021. DOI: https://doi.org/10.1088/1757-899X/1070/1/012062
S. Ahmed et al., “A review on the detection techniques of polycystic ovary syndrome using machine learning,” IEEE Access, vol. 11, pp. 86522–86543, 2023. DOI: https://doi.org/10.1109/ACCESS.2023.3304536
D. Rao, R. R. Dayma, S. K. Pendekanti, and A. K. Acharya, “Deep learning model for diagnosing polycystic ovary syndrome using a comprehensive dataset from Kerala hospitals,” International Journal of Electrical and Computer Engineering, vol. 14, no. 5, pp. 5715–5727, 2024. DOI: https://doi.org/10.11591/ijece.v14i5.pp5715-5727
H. Pushkarini and M. A. Anusuya, “A prediction model for evaluating the risk of developing PCOS,” International Research Journal of Engineering and Technology (IRJET), vol. 7, no. 9, pp. 1150–1156, 2020. https://www.irjet.net/archives/V7/i9/IRJET-V7I9192.pdf
F. J. Barrera et al., “Application of machine learning and artificial intelligence in the diagnosis and classification of polycystic ovarian syndrome: A systematic review,” Frontiers in Endocrinology, vol. 14, p. 1106625, 2023. DOI: https://doi.org/10.3389/fendo.2023.1106625
T. Arunprasath, R. Ramalakshmi, R. Kottaimalai, and J. Alex Michael Raj, “Development of a machine learning model to classify polycystic ovarian syndrome,” Technology and Health Care, vol. 33, no. 3, pp. 1478–1488, 2025. DOI: https://doi.org/10.1177/09287329241296357
R. Galagan, S. Andreiev, N. Stelmakh, Y. Rafalska, and A. Momot, “Automation of polycystic ovary syndrome diagnostics through machine learning algorithms in ultrasound imaging,” Applied Computer Science, vol. 20, no. 2, pp. 194–204, 2024. DOI: https://doi.org/10.35784/acs-2024-24
R. Ahmad et al., “SMOTE-based automated PCOS prediction using lightweight deep learning models,” Diagnostics, vol. 14, no. 19, p. 2225, 2024. DOI: https://doi.org/10.3390/diagnostics14192225
N. Leslie, A. A. Permana, and A. T. Perdana, “Application of the K-nearest neighbour algorithm for polycystic ovarian syndrome (PCOS) classification: A diagnostic tool,” Journal of Logistics, Informatics and Service Science, vol. 11, no. 10, 2024. DOI: https://doi.org/10.33168/jliss.2024.1011
K. P. Rakshitha and N. C. Naveen, “Op-RMSprop classification for prediction of polycystic ovary syndrome using hybrid machine learning technique,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 6, 2022. DOI: https://dx.doi.org/10.14569/IJACSA.2022.0130671
C. Khadse and C. Puri, “Early detection of polycystic ovary syndrome using a hybrid machine learning model,” International Journal of Advanced Multidisciplinary Research and Educational Development, vol. 1, no. 4, pp. 300–318, 2025.DOI: https://doi.org/10.1109/IDICAIHEI65991.2025.11379286
S. Venkatalakshmi and M. Regina, “Leveraging hybrid machine learning models for early diagnosis and prediction of polycystic ovary syndrome (PCOS),” Journal of Electrical Systems, vol. 20, no. 11S, pp. 1765–1775, 2024. https://journal.esrgroups.org/jes/article/view/7586
F. K. Sarguroh and R. Srivaramangai, “An analytical review of AI-driven techniques for polycystic ovary syndrome prediction,” International Journal of Science and Research (IJSR), Dec. 2025.DOI: https://dx.doi.org/10.21275/SR251217131307