An Approach to Design an Intelligent Parametric Synthesizer for Emotional Speech | SpringerLink
Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 328))

Abstract

Speech synthesizer is an artificial system to produce speech. But the generation of emotional speech is a difficult task. Though many researchers have been working on this area since a long period, still it is a challenging problem in terms of accuracy. The objective of our work is to design an intelligent model for emotional speech synthesis. An attempt is taken to compute such system using rule based fuzzy model. Initially the required parameters have been considered for the model and are extracted as features. The features are analyzed for each speech segment. At the synthesis level the model has been trained with these parameters properly. Next to it, it has been tested. The tested results show its performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 22879
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 28599
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Turk, O., Schröder, M., Bozkurt, B., Arslan, L.M.: Voice Quality Interpolation for Emotional Text-To-Speech Synthesis. In: Proc. Interspeech, Lisbon, Portugal, pp. 797–800 (2005)

    Google Scholar 

  2. Atal, B.S., Hanauer, S.L.: Speech Analysis and Synthesis by Linear Prediction of the Speech Wave. Bell Telephone Laboratories, Incorporaiai, Murray Hill (1971)

    Google Scholar 

  3. Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M., Alku, P.: HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. IEEE Trans. on Audio, Speech, and Language Processing 19(1) (January 2011)

    Google Scholar 

  4. Shannon, M.: Autoregressive Models for Statistical Parametric Speech Synthesis. IEEE Trans. on Audio, Speech, and language Processing 21(3) (March 2013)

    Google Scholar 

  5. Jiang, D.-N., Zhang, W., Shen, L.-Q., Cai, L.-H.: Prosody analysis and modeling for emotional speech synthesis. In: Proceeding of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), vol. 1 (2005)

    Google Scholar 

  6. Newton, P.S.R.: Review of methods of Speech Synthesis. EE Dept., IIT Bombay (November 2011)

    Google Scholar 

  7. Stylianou, Y.: Applying the Harmonic Plus Noise Model in Concatenative Speech Synthpesis. IEEE Trans. on Speech and Audio Processing 9(1) (January 2001)

    Google Scholar 

  8. Black, A.W.: Unit Selection and Emotional Speech”. In: Proceeding of the Eurospeech, Geneve (2003)

    Google Scholar 

  9. Tisljár-Szabó, E., Pléh, C.: Ascribing emotions depending on pause length in native and foreign language speech. Speech Communication 56, 35–48 (2014)

    Article  Google Scholar 

  10. Chabchoub, A., Cherif, A.: High Quality Arabic Concatenative Speech Synthesis. Signal & Image Processing: An International Journal (SIPIJ) 2(4) (December 2011)

    Google Scholar 

  11. Bhatlawande, S.N., Apte, S.D.: Emotion Generation using LPC Synthesis. International Journal on Recent and Innovation Trends in Computing and Communication 2(1), 128–134 (2014)

    Google Scholar 

  12. Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Applications to Modeling and Control. IEEE Trans. on Systems, Man, and Cybernetics smc-15(1) (January/February 1985)

    Google Scholar 

  13. Shrawankar, U., Thakare, V.: Parameters Optimization for Improving ASR Performance in Adverse Real World Noisy Environmental Conditions. International Journal of Human Computer Interaction (IJHCI) 3(3) (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soumya Smruti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Smruti, S., Sahoo, J., Dash, M., Mohanty, M.N. (2015). An Approach to Design an Intelligent Parametric Synthesizer for Emotional Speech. In: Satapathy, S., Biswal, B., Udgata, S., Mandal, J. (eds) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. Advances in Intelligent Systems and Computing, vol 328. Springer, Cham. https://doi.org/10.1007/978-3-319-12012-6_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12012-6_40

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12011-9

  • Online ISBN: 978-3-319-12012-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics