Discriminative Non-negative Matrix Factorization For Single-channel Speech Separation

Zi Wang and Fei Sha

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China; Department of Computer Science, University of Southern California, Los Angeles, CA 90089.

wangzi10@mails.tsinghua.edu.cn; feisha@usc.edu.


Non-negative matrix factorization (NMF) has emerged as a promising approach for single-channel speech separation. In this paper, we propose a new method of discriminative learning of NMF. In contrast to conventional approaches where the basis vectors are learned independently on clean signals from each speaker, our approach optimizes all basis vectors jointly to reconstruct both clean signals and mixed signals well. Our empirical studies validated our approach. Specifically, discriminative NMF outperforms standard methods by a large margin in improving signal-to-noise ratio for reconstructing signals.

Full text