This paper presents a particle filter approach to spectral amplitude speech enhancement.Spectral amplitudes are known to exhibit inter-frame dependencies and non-Gaussian statistics; however, incorporating these properties makes closed-form solutions intractable. Using the particle filter framework allows the presented algorithm to model the speech spectral amplitudes as an autoregressive process with Laplace distributed excitation. Two variants of the standard algorithm are also presented: one that uses an interacting multiple model approach to account for transitions between active speech and silence intervals, and one that allows for phase differences between the clean speech and noise complex Fourier transform coefficients. All of the particle sampling distributions are constrained to take the measurement into account, improving sampling efficiency. In experiments using wideband speech and real recorded noise the proposed algorithm variants are shown to offer natural-sounding output speech, with objective evaluation results that compare favorably to existing particle filter speech enhancement algorithms. The multiple model variant is found to improve inter-speech noise reduction, while the phase variant improves performance when the signal-to-noise ratio is low.

Additional Metadata
Keywords Noise reduction, Particle filters, Speech enhancement
Persistent URL dx.doi.org/10.1109/TASL.2010.2042127
Journal IEEE Transactions on Audio, Speech and Language Processing
Citation
Laska, B.N.M. (Brady N. M.), Bolić, M. (Miodrag), & Goubran, R. (2010). Particle filter enhancement of speech spectral amplitudes. IEEE Transactions on Audio, Speech and Language Processing, 18(8), 2155–2167. doi:10.1109/TASL.2010.2042127