Speaker Localization and Speech Separation Using Phase Difference Versus Frequency Distribution
This paper proposes a novel sparse source separation method using a pair of microphones. The method is based on Time Frequency (T-F) decomposition, applies the weighted Hough transform to the Phase Difference (PD) versus Frequency (PD-F) distribution of received mixture signals, and estimates source directions. Then, the estimated source directions and harmonic structure are used to separate the mixture signals. The effectiveness of the proposed method is shown through experiments in real acoustic circumstances. Blind Source Separation (BSS) aims to estimate source signals by using only mixed signals without any priori information about the source position, mixing process, or circumstances.