Temporal Envelope Subtraction for Robust Speech Recognition Using Modulation Spectrum
In this paper, the authors present a new noise compensation technique for modulation frequency features derived from syllable length segments of sub-band temporal envelopes. The sub-band temporal envelopes are estimated using Frequency Domain Linear Prediction (FDLP). The authors propose a technique for noise compensation in FDLP where an estimate of the noise envelope is subtracted from the noisy speech envelope. The noise compensated FDLP envelopes are compressed with static (logarithmic) and dynamic (adaptive loops) compression and are transformed into modulation spectral features.