Information technology for gender recognition by voice

Diana Koshtura

Gender recognition from voice is a challenging problem in speech processing. This task involves extracting meaningful features from speech signals and classifying them into male or female categories. In this article, was implemented a gender recognition system using Python programming. I first recorded voice samples from both male and female speakers and extracted Mel-frequency cepstral coefficients (MFCC) as features. Then trained, a Support Vector Machine (SVM) classifier was on these features and evaluated its performance using accuracy, precision, recall, and F1-score metrics. These experiments demonstrated that proposed system should achieve high accuracy on the test set and will accurately predict the gender of a speaker based on their voice. I also explored using pre-trained models to reduce the need for large amounts of training data and found that they can provide good performance while requiring less computation. This study highlights the potential of using machine learning techniques for gender recognition from voice and can be extended to other speech processing applications.

gender recognition

python

Mel-frequency cepstral coefficients

support vector machine

machine learning

Balasubramanian, V., & Manikandan, M. S. (2018). Automatic Gender Recognition from Speech Using Machine Learning Techniques. International Journal of Engineering & Technology, 7(4.35), 116–119. https://doi.org/10.14419/ijet.v7i4.35.22005
Sethi, P., & Chandra, M. (2018). Gender Classification of Speakers using Mel Frequency Cepstral Coefficients and Support Vector Machine. International Journal of Advanced Research in Computer Science, 9(3), 129–133. https://doi.org/10.26483/ijarcs.v9i3.5507
Koshtura D. and Kunanets N. (2022). Information Sysem Project for Communication of Hearing Impaired Users, 2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, 247–251. DOI: 10.1109/CSIT56902.2022.10000866.
Andrunyk V., Shestakevych T. and Koshtura D. (2021). The text analysis software for hearing-impaired persons, 2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, Ukraine, 119–123. DOI: 10.1109/CSIT52700.2021.9648605.
Chen G., Li J., Li Y., and Li J. (2020). Gender classification using a fusion of MFCC and deep residual network features. IEEE Transactions on Affective Computing, Vol. 11, No. 4, 656–665, Oct.–Dec. 2020.
Huang X., Cai M., and Zhang Q. (2021). Gender recognition in noisy environments using convolutional neural networks. Journal of Ambient Intelligence and Humanized Computing, Vol. 12, No. 10, 10425–10438, Oct. 2021.
Srivastava, R., & Singh, N. (2016). A Study of Feature Extraction Techniques for Gender Recognition System. International Journal of Computer Science and Mobile Computing, 5(11), 15–21. http://www.ijcsmc.com/docs/papers/November2016/V5I11201602.pdf.
Librosa documentation: https://librosa.org/doc/latest/index.html.
Scikit-learn documentation: https://scikit-learn.org/stable/documentation.html.