Reconstruction of Speech Signals from Their Unpredictable Points Manifold

Khanagha, Vahid; Yahia, Hussein; Daoudi, Khalid; Pont, Oriol; Turiel, Antonio

doi:10.1007/978-3-642-25020-0_5

Vahid Khanagha²⁰,
Hussein Yahia²⁰,
Khalid Daoudi²⁰,
Oriol Pont²⁰ &
…
Antonio Turiel²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7015))

Included in the following conference series:

International Conference on Nonlinear Speech Processing

967 Accesses
1 Citations

Abstract

This paper shows that a microcanonical approach to complexity, such as the Microcanonical Multiscale Formalism, provides new insights to analyze non-linear dynamics of speech, specifically in relation to the problem of speech samples classification according to their information content. Central to the approach is the precise computation of Local Predictability Exponents (LPEs) according to a procedure based on the evaluation of the degree of reconstructibility around a given point. We show that LPEs are key quantities related to predictability in the framework of reconstructible systems: it is possible to reconstruct the whole speech signal by applying a reconstruction kernel to a small subset of points selected according to their LPE value. This provides a strong indication of the importance of the Unpredictable Points Manifold (UPM), already demonstrated for other types of complex signals. Experiments show that a UPM containing around 12% of the points provides very good perceptual reconstruction quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arneodo, A., Argoul, F., Bacry, E., Elezgaray, J., Muzy, J.F.: Ondelettes, multifractales et turbulence. Diderot Editeur, Paris (1995)
Google Scholar
Boffetta, G., Cencini, M., Falcioni, M., Vulpiani, A.: Predictability: a way to characterize complexity. Physics Reports 356(6), 367–474 (2002), doi:10.1016/S0370-1573(01)00025-4
Article MathSciNet MATH Google Scholar
Frisch, U.: Turbulence: The legacy of A.N. Kolmogorov. Cambridge Univ. Press (1995)
Google Scholar
Hu, Y., Loizou, P.C.: Evaluation of objective quality measures for speech enhancement. IEEE Trans. Audio Speech Language Processing 16, 229–238 (2008)
Article Google Scholar
Kaiser, J.F.: Some observations on vocal tract operation from a fluid flow point of view. In: Titze, I.R., Scherer, R.C. (eds.) Vocal Fold Physiology: Biomechanics, Acoustics, and Phonatory Control, pp. 358–386. The Denver Center for the Performing Arts (1983)
Google Scholar
Khanagha, V., Daoudi, K., Pont, O., Yahia, H.: Improving text-independent phonetic segmentation based on the microcanonical multiscale formalism. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP (2010)
Google Scholar
Kokkinos, I., Maragos, P.: Nonlinear speech analysis using models for chaotic systems. IEEE Transactions on Speech and Audio Processing 13(6), 1098–1109 (2005)
Article Google Scholar
Kubin, G.: Nonlinear processing of speech. Speech Coding and Synthesis, ch. 16. Elsevier (1995)
Google Scholar
Little, M., McSharry, P.E., Moroz, I., Roberts, S.: Testing the assumptions of linear prediction analysis in normal vowels. Journal of the Acoustical Society of America 119, 549–558 (2006)
Article Google Scholar
Mallat, S.: A Wavelet Tour of Signal Processing. Academic Press (1999)
Google Scholar
Maragos, P., Potamianos, A.: Fractal dimensions of speech sounds: Computation and application to automatic speech recognition. Journal of Acoustic Society of America 105, 1925–1932 (1999)
Article Google Scholar
Pont, O., Turiel, A., Pérez-Vicente, C.J.: Description, modeling and forecasting of data with optimal wavelets. Journal of Economic Interaction and Coordination 4(1), 39–54 (2009)
Article Google Scholar
Pont, O., Turiel, A., Perez-Vicente, C.: Empirical evidences of a common multifractal signature in economic, biological and physical systems. Physica A 388(10), 2025–2035 (2009)
Article Google Scholar
Pont, O., Turiel, A., Yahia, H.: An Optimized Algorithm for the Evaluation of Local Singularity Exponents in Digital Signals. In: Aggarwal, J.K., et al. (eds.) IWCIA 2011. LNCS, vol. 6636, pp. 346–357. Springer, Heidelberg (2011)
Chapter Google Scholar
Teager, H.M., Teager, S.M.: Evidence for nonlinear sound production mechanisms in the vocal tract. In: Hardcastle, W., Marchal, A. (eds.) Speech Production and Speech Modelling. NATO Advanced Study Institute Series D (1989)
Google Scholar
Turiel, A., del Pozo, A.: Reconstructing images from their most singular fractal manifold. IEEE Trans. on Im. Proc. 11, 345–350 (2002)
Article MathSciNet Google Scholar
Turiel, A., Pérez-Vicente, C., Grazzini, J.: Numerical methods for the estimation of multifractal singularity spectra on sampled data: A comparative study. Journal of Computational Physics 216(1), 362–390 (2006)
Article MathSciNet MATH Google Scholar
Turiel, A., Yahia, H., Vicente, C.P.: Microcanonical multifractal formalism: a geometrical approach to multifractal systems. part 1: singularity analysis. J. Phys. A, Math. Theor. 41, 015501 (2008)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

INRIA Bordeaux Sud-Ouest (GEOSTAT team), 351 Cours de la Liberation, 33405, Talence cedex, France
Vahid Khanagha, Hussein Yahia, Khalid Daoudi & Oriol Pont
Institut de Ciències del MAR, ICM - CSIC, Passeig de la barceloneta 37-49, 08003, Barcelona, Spain
Antonio Turiel

Authors

Vahid Khanagha
View author publications
You can also search for this author in PubMed Google Scholar
Hussein Yahia
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Daoudi
View author publications
You can also search for this author in PubMed Google Scholar
Oriol Pont
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Turiel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Technological Development and Innovation in Communications (IDETIC), Signals and Communications Department, University of Las Palmas de Gran Canaria, Campus de Tafira, s/n, 35017, Las Palmas de Gran Canaria, Spain
Carlos M. Travieso-González & Jesús B. Alonso-Hernández &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khanagha, V., Yahia, H., Daoudi, K., Pont, O., Turiel, A. (2011). Reconstruction of Speech Signals from Their Unpredictable Points Manifold. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds) Advances in Nonlinear Speech Processing. NOLISP 2011. Lecture Notes in Computer Science(), vol 7015. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25020-0_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-25020-0_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25019-4
Online ISBN: 978-3-642-25020-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics