Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization

Airaksinen, Manu; Juvela, Lauri; Bäckström, Tom; Alku, Paavo

doi:10.21437/Interspeech.2016-338

This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-the-art GIF techniques, the proposed method does not require estimation of glottal closure instants.

The proposed method was objectively evaluated with two test sets of continuous synthetic speech created with a glottal vocoding analysis/synthesis procedure. When compared to a set of reference GIF methods, the proposed NMF technique shows improved estimation accuracy especially for male voices.

Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization

Manu Airaksinen, Lauri Juvela, Tom Bäckström, Paavo Alku