从模拟听觉神经纤维在不同噪声条件下获得的语音表示的鲁棒性。-小狗文献

【The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions.].

【从模拟听觉神经纤维在不同噪声条件下获得的语音表示的鲁棒性。】 复制标题 收藏收藏

影响因子 :
发表时间：2013-09-01
来源期刊：J Acoust Soc Am

DOI：10.1121/1.4817912 复制DOI
文章类型：杂志文章

作者列表：
下载文献

Different methods of extracting speech features from an auditory model were systematically investigated in terms of their robustness to different noises. The methods either computed the average firing rate within frequency channels (spectral features) or inter-spike-intervals (timing features) from the simulated auditory nerve response. When used as the front-end for an automatic speech recognizer, timing features outperformed spectral features in Gaussian noise. However, this advantage was lost in babble, because timing features extracted the spectro-temporal structure of babble noise, which is similar to the target speaker. This suggests that different feature extraction methods are optimal depending on the background noise.

译文

：系统地研究了从听觉模型中提取语音特征的不同方法，它们对不同噪声的鲁棒性强。这些方法或者从模拟听觉神经反应中计算出频道（频谱特征）或尖峰间隔（定时特征）内的平均发声率。当用作自动语音识别器的前端时，时序特征在高斯噪声中的表现优于频谱特征。但是，由于时序特征提取了与目标说话者相似的胡言乱语的频谱时态结构，因此在胡言乱语中失去了这一优势。这表明根据背景噪声，不同的特征提取方法是最佳的。

【The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions.].

手机登录