Download Now Free registration required
In this paper, the authors propose an unsupervised approach to speaker segmentation using Auto Associative Neural Network (AANN). Speaker segmentation aims at finding speaker change points in a speech signal which is an important preprocessing step to audio indexing, spoken document retrieval and multi speaker diarization. The method extracts the speaker specific information from the Mel Frequency Cepstral Coefficients (MFCC). The speaker change points are detected using the distribution capturing ability of the AANN model. Experiments are carried out on different audio databases, and the method is capable of detecting speaker changes with short duration of speech in an unsupervised manner.
- Format: PDF
- Size: 225.46 KB