In this paper the problem of co-channel speech separation for convolutive mixtures is considered where visual cues from one of the speakers is available as side information. The visual cues from the one speaker in the two speaker speech separation are used to estimate the spectral content of the speech and this spectral estimate is in turn used to constrain the solution of the coupling reconstruction filters in the convolutive mixture. The preliminary experimental results show that good performance in speech separation is obtained for our limited case study of visual cues obtained from the spoken numbers of "one" thru "four".

Additional Metadata
Conference Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing
Citation
Dansereau, R. (2004). Co-channel audiovisual speech separation using spectral matching constraints. Presented at the Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing.