Abstract
Speech synthesizer is an artificial system to produce speech. But the generation of emotional speech is a difficult task. Though many researchers have been working on this area since a long period, still it is a challenging problem in terms of accuracy. The objective of our work is to design an intelligent model for emotional speech synthesis. An attempt is taken to compute such system using rule based fuzzy model. Initially the required parameters have been considered for the model and are extracted as features. The features are analyzed for each speech segment. At the synthesis level the model has been trained with these parameters properly. Next to it, it has been tested. The tested results show its performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Turk, O., Schröder, M., Bozkurt, B., Arslan, L.M.: Voice Quality Interpolation for Emotional Text-To-Speech Synthesis. In: Proc. Interspeech, Lisbon, Portugal, pp. 797–800 (2005)
Atal, B.S., Hanauer, S.L.: Speech Analysis and Synthesis by Linear Prediction of the Speech Wave. Bell Telephone Laboratories, Incorporaiai, Murray Hill (1971)
Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M., Alku, P.: HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. IEEE Trans. on Audio, Speech, and Language Processing 19(1) (January 2011)
Shannon, M.: Autoregressive Models for Statistical Parametric Speech Synthesis. IEEE Trans. on Audio, Speech, and language Processing 21(3) (March 2013)
Jiang, D.-N., Zhang, W., Shen, L.-Q., Cai, L.-H.: Prosody analysis and modeling for emotional speech synthesis. In: Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), vol. 1 (2005)
Newton, P.S.R.: Review of methods of Speech Synthesis. EE Dept., IIT Bombay (November 2011)
Stylianou, Y.: Applying the Harmonic Plus Noise Model in Concatenative Speech Synthpesis. IEEE Trans. on Speech and Audio Processing 9(1) (January 2001)
Black, A.W.: Unit Selection and Emotional Speech”. In: Proceeding of the Eurospeech, Geneve (2003)
Tisljár-Szabó, E., Pléh, C.: Ascribing emotions depending on pause length in native and foreign language speech. Speech Communication 56, 35–48 (2014)
Chabchoub, A., Cherif, A.: High Quality Arabic Concatenative Speech Synthesis. Signal & Image Processing: An International Journal (SIPIJ) 2(4) (December 2011)
Bhatlawande, S.N., Apte, S.D.: Emotion Generation using LPC Synthesis. International Journal on Recent and Innovation Trends in Computing and Communication 2(1), 128–134 (2014)
Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Applications to Modeling and Control. IEEE Trans. on Systems, Man, and Cybernetics smc-15(1) (January/February 1985)
Shrawankar, U., Thakare, V.: Parameters Optimization for Improving ASR Performance in Adverse Real World Noisy Environmental Conditions. International Journal of Human Computer Interaction (IJHCI) 3(3) (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Smruti, S., Sahoo, J., Dash, M., Mohanty, M.N. (2015). An Approach to Design an Intelligent Parametric Synthesizer for Emotional Speech. In: Satapathy, S., Biswal, B., Udgata, S., Mandal, J. (eds) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Advances in Intelligent Systems and Computing, vol 328. Springer, Cham. https://doi.org/10.1007/978-3-319-12012-6_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-12012-6_40
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12011-9
Online ISBN: 978-3-319-12012-6
eBook Packages: EngineeringEngineering (R0)