Text normalization with varied data sources for conversational speech language modelingAcoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on, Vol. 1 (2002), pp. I-789-I-792 vol.1.
|
Reviews
[Write a review of this article]
The text from related topic but in different style was used in LM training for an ASR task. Text normalization was performed. Therefore, more training data could be used in LM training, and the OOV rate, as well as the recognition performance, was improved.
- 2008-06-25 01:41:47
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractCollecting sufficient language model training data for good speech recognition performance in a new domain is often difficult. However, there may be other sources of data that are matched in terms of topic or style, if not both. This paper looks at the use of text normalization tools to make these data more suitable for language model training, in conjunction with mixture models to combine data from different sources. We specifically address the task of recognizing meeting speech, showing a small reduction in word error rate over a baseline language model trained from conversational speech data
BibTeX record
RIS record