{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T04:30:26Z","timestamp":1741667426987,"version":"3.38.0"},"reference-count":34,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["KES"],"published-print":{"date-parts":[[2021,11,10]]},"abstract":"The speaker identification in Teleconferencing scenario, it is important to address whether a particular speaker is a part of a conference or not and to note that whether a particular speaker is spoken at the meeting or not. The feature vectors are extracted using MFCC-SDC-LPC. The Generalized Gamma Distribution is used to model the feature vectors. K-means algorithm is utilized to cluster the speech data. The test speaker is to be verified that he\/she is a participant in the conference. A conference database is generated with 50 speakers. In order to test the model, 20 different speakers not belonging to the conference are also considered. The efficiency of the model developed is compared using various measures such as AR, FAR and MDR. And the system is tested by varying number of speakers in the conference. The results show that the model performs more robustly.<\/jats:p>","DOI":"10.3233\/kes-210079","type":"journal-article","created":{"date-parts":[[2021,11,16]],"date-time":"2021-11-16T17:27:41Z","timestamp":1637083661000},"page":"357-365","source":"Crossref","is-referenced-by-count":0,"title":["Machine hearing system for teleconference authentication with effective speech analysis"],"prefix":"10.1177","volume":"25","author":[{"given":"T.V.","family":"Madhusudhana Rao","sequence":"first","affiliation":[{"name":"Department of CSE, Vignan\u2019s Institute of Information Technology, Visakhapatnam, India"}]},{"given":"Suribabu","family":"Korada","sequence":"additional","affiliation":[{"name":"Scientist, NSTL, Visakhapatnam, India"}]},{"given":"Y.","family":"Srinivas","sequence":"additional","affiliation":[{"name":"Department of IT, GITAM University, Visakhapatnam, India"}]}],"member":"179","reference":[{"key":"10.3233\/KES-210079_ref1","first-page":"1","article-title":"A comparative study of MFCC-KNN and LPC-KNN for hijaiyyah letters pronounciation classification system","author":"Adiwijaya","year":"2017","journal-title":"2017 5th International Conference on Information and Communication Technology (ICoIC7)"},{"key":"10.3233\/KES-210079_ref2","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1109\/ICSTM.2015.7225394","article-title":"Effective preprocessing of speech and acoustic features extraction for spoken language identification","author":"Kumar","year":"2015","journal-title":"Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM)"},{"key":"10.3233\/KES-210079_ref3","doi-asserted-by":"crossref","first-page":"3755","DOI":"10.1109\/ICEEOT.2016.7755413","article-title":"Improved MFCC and LPC algorithm for bundelkhandi isolated digit speech recognition","author":"Dixit","year":"2016","journal-title":"2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)"},{"issue":"1","key":"10.3233\/KES-210079_ref4","first-page":"33","article-title":"Speaker identification and verification using vector quantization and mel frequency cepstral coefficients","volume":"4","author":"Srinivasan","year":"2012","journal-title":"Journal of Applied Sciences, Engineering, and Technology"},{"key":"10.3233\/KES-210079_ref5","first-page":"1424","article-title":"Feature extraction modeling & training strategies in continuous speech recognition for roman language","author":"Corneliu\u00a0Octavian","year":"2005","journal-title":"EU Proceedings of IEEE Xplore, EUROCN"},{"key":"10.3233\/KES-210079_ref6","first-page":"96","article-title":"Channel\/handset mismatch evaluation in a biometric speaker verification using shifted delta cepstral features","volume":"4756","author":"Calvo","year":"2007","journal-title":"CIARP 2007, LNCS"},{"key":"10.3233\/KES-210079_ref7","first-page":"1","article-title":"Automatic language identification for seven indian languages using higher level features","author":"Madhu","year":"2017","journal-title":"Signal Processing, Informatics, Communication, and Energy Systems (SPICES)"},{"key":"10.3233\/KES-210079_ref8","unstructured":"D.R. Gonzalez and J.R.C. de\u00a0Lara, Speaker verification with shifted delta cepstral features: Its pseudo-prosodic behaviour, proc I Iberian SLTech, 2009."},{"key":"10.3233\/KES-210079_ref9","doi-asserted-by":"crossref","first-page":"4416","DOI":"10.1109\/ICASSP.2011.5947333","article-title":"Speaker diarization of meetings based on speaker role n-gram models","author":"Valente","year":"2011","journal-title":"2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)"},{"issue":"1","key":"10.3233\/KES-210079_ref10","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1521\/pedi.2009.23.1.6","article-title":"Emotion recognition in borderline personality disorder\u00a0\u2013 a review of the literature","volume":"23","author":"Domes","year":"2009","journal-title":"Journal of Personality Disorders"},{"key":"10.3233\/KES-210079_ref11","unstructured":"D.R. Gonz\u00e1lez, S.C. Arias and J.R.C. de\u00a0Lara, Single channel speech enhancement based on zero phase transformation in reverberated environments, The Proceedings of REVERB Challenge, 2014."},{"issue":"1","key":"10.3233\/KES-210079_ref12","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1109\/LSP.2012.2227312","article-title":"Shifted-delta MLP features for spoken language recognition","volume":"20","author":"Wang","year":"2013","journal-title":"IEEE Signal Processing Letters"},{"issue":"1","key":"10.3233\/KES-210079_ref13","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1109\/MSP.2011.943131","article-title":"Trends in speech and language processing","volume":"29","author":"Freng","year":"2011","journal-title":"IEEE Signal Processing Magazine"},{"key":"10.3233\/KES-210079_ref14","unstructured":"J. Razik, C.S. Enac, D. Fohr, O. Mella and N. Parlangeau-Valles, Comparision of two speech\/Music segmentation systems for audio indexing on Web, in: Proc WMSCI\u201903, Florida, USA, 2003."},{"key":"10.3233\/KES-210079_ref15","first-page":"96","article-title":"Channel\/handset mismatch evaluation in biometric speaker verification using shifted delta cepstral features","author":"Calvo","journal-title":"Proc of CIARP 2007, LNCS 4756"},{"key":"10.3233\/KES-210079_ref16","first-page":"1","article-title":"Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian Mixture models","author":"Veena","year":"2015","journal-title":"2015 International Conference on Power, Instrumentation, Control and Computing (PICC)"},{"issue":"7","key":"10.3233\/KES-210079_ref17","first-page":"137","article-title":"Speech recognition and verification using MFCC & VQ","volume":"1","author":"Patel","year":"2013","journal-title":"Int J Emerg Sci Eng (IJESE)"},{"key":"10.3233\/KES-210079_ref18","doi-asserted-by":"crossref","first-page":"4898","DOI":"10.1109\/ICMLC.2005.1527805","article-title":"Speech emotion recognition based on HMM and SVM","volume":"8","author":"Lin","year":"2005","journal-title":"2005 International Conference on Machine Learning and Cybernetics"},{"issue":"9\u201310","key":"10.3233\/KES-210079_ref19","doi-asserted-by":"crossref","first-page":"1172","DOI":"10.1016\/j.specom.2011.01.007","article-title":"Application of speaker- and language identification state-of-the-art techniques for emotion recognition","volume":"53","author":"Kockmann","year":"2011","journal-title":"Speech Communication"},{"key":"10.3233\/KES-210079_ref20","first-page":"1","article-title":"Speaker\u2019s gender classification and segmentation using spectral and cepstral feature averaging","author":"Kos","year":"2011","journal-title":"2011 18th International Conference on Systems, Signals and Image Processing"},{"key":"10.3233\/KES-210079_ref21","first-page":"437","article-title":"Robust features for emotion recognition from speech by using Gaussian mixture model classification","author":"Navyasri","year":"2017","journal-title":"International Conference on Information and Communication Technology for Intelligent Systems"},{"key":"10.3233\/KES-210079_ref22","doi-asserted-by":"crossref","unstructured":"N. Singh, R.A. Khan and R. Shree, MFCC and prosodic feature extraction techniques: A comparative study, International Journal of Computer Applications 54(1) (2012).","DOI":"10.5120\/8529-2061"},{"issue":"8","key":"10.3233\/KES-210079_ref23","doi-asserted-by":"crossref","first-page":"21","DOI":"10.9790\/3021-04812125","article-title":"An approach to extract feature using MFCC","volume":"4","author":"Singh","year":"2014","journal-title":"IOSR Journal of Engineering"},{"key":"10.3233\/KES-210079_ref24","first-page":"89","article-title":"Approches to language identification using gausian mixture models and shifted delta cepstral features","author":"Torres-Carrasquillo","year":"2002","journal-title":"Proc of ICSLP"},{"key":"10.3233\/KES-210079_ref25","doi-asserted-by":"crossref","first-page":"3391","DOI":"10.1016\/j.proeng.2012.06.392","article-title":"Identification of language using mel-frequency cepstral coefficients (MFCC)","volume":"38","author":"Koolagudi","year":"2012","journal-title":"Procedia Engineering"},{"issue":"3","key":"10.3233\/KES-210079_ref26","doi-asserted-by":"crossref","first-page":"544","DOI":"10.1016\/j.dsp.2011.11.008","article-title":"A hierarchical language identification system for Indian languages","volume":"22","author":"Jothilakshmi","year":"2012","journal-title":"Digital Signal Processing"},{"key":"10.3233\/KES-210079_ref27","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1016\/j.specom.2015.04.005","article-title":"Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification","volume":"72","author":"Sadjadi","year":"2015","journal-title":"Speech Communication"},{"issue":"6","key":"10.3233\/KES-210079_ref28","first-page":"479","article-title":"Review of feature extraction techniques in automatic speech recognition","volume":"2","author":"Shanthi","year":"2013","journal-title":"International Journal of Scientific Engineering and Technology"},{"issue":"10","key":"10.3233\/KES-210079_ref29","first-page":"5150","article-title":"Prosodic feature based text dependent speaker recognition using machine learning algorithms","volume":"2","author":"Agrawal","year":"2010","journal-title":"International Journal of Engineering Science and Technology"},{"issue":"4","key":"10.3233\/KES-210079_ref30","first-page":"21","article-title":"An efficient speech recognition system","volume":"3","author":"Swamy","year":"2013","journal-title":"Computer Science & Engineering: An International Journal (CSEIJ)"},{"key":"10.3233\/KES-210079_ref31","first-page":"547","article-title":"Temporal discrete cosine transform: Towards longer term temporal features for speaker verification","author":"Kinnunen","year":"2006","journal-title":"Proc Fifth Internat Symposium on Chinese Spoken Language Processing (ISCSLP 2006)"},{"key":"10.3233\/KES-210079_ref32","first-page":"89","article-title":"Approches to language identification using gausian mixture models and shifted delta cepstral features","author":"Torres-Carrasquillo","year":"2002","journal-title":"Proc of ICSLP"},{"key":"10.3233\/KES-210079_ref33","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.procs.2015.06.027","article-title":"Significance of GMMUBM based modelling for indian language identification","volume":"54","author":"Kumar","year":"2015","journal-title":"Procedia Computer Science"},{"key":"10.3233\/KES-210079_ref34","doi-asserted-by":"crossref","unstructured":"X. Anguera, S. Bozonnet, N.W.D. Evans, C. Fredouille, G. Friedland and O. Vinyals, Speaker diarization: A review of recent research, IEEE Transactions on Audio, Speech, and Language Processing 20(2) (2012).","DOI":"10.1109\/TASL.2011.2125954"}],"container-title":["International Journal of Knowledge-based and Intelligent Engineering Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/KES-210079","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,10]],"date-time":"2025-03-10T19:11:03Z","timestamp":1741633863000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/KES-210079"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,10]]},"references-count":34,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/kes-210079","relation":{},"ISSN":["1327-2314","1875-8827"],"issn-type":[{"type":"print","value":"1327-2314"},{"type":"electronic","value":"1875-8827"}],"subject":[],"published":{"date-parts":[[2021,11,10]]}}}