Speech recognition dictionary compilation assisting system, speech recognition dictionary compilation assisting method and speech recognition dictionary compilation assisting program
First Claim
1. A speech recognition dictionary compilation assisting system, comprising:
- a computer processing apparatus; and
a computer-readable storage medium having data stored thereon that includes a dictionary, a language model, an acoustic model, and a speech recognition dictionary compilation assisting program that is executable by the computer processing apparatus to cause the computer processing apparatus to operate as;
a text analysis section that applies morphological analysis to input text data to produce analyzed text data comprising words of the input text data and pronunciation information for each word;
a virtual speech recognition processing section that performs a speech recognition process on said analyzed text data received from the text analysis section by applying the dictionary and the language model to said analyzed text data thereby to generate virtual text data, and that compares a pronunciation information of the virtual text data with the pronunciation information of the analyzed text data to extract and output different points of the analyzed text data and the virtual text data, each different point comprising an element of the analyzed text data and a corresponding element of the virtual text data; and
an update processing section that corrects at least one of the dictionary and the language model in accordance with the different points identified by the virtual speech recognition processing section,wherein for each different point, the pronunciation information corresponding to a word of the analyzed text data differs from a corresponding pronunciation information of the virtual text data.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition dictionary compilation assisting system can create and update speech recognition dictionary and language models efficiently so as to reduce speech recognition errors by utilizing text data available at a low cost. The system includes speech recognition dictionary storage section 105, language model storage section 106 and acoustic model storage section 107. A virtual speech recognition processing section 102 processes analyzed text data generated by the text analyzing section 101 by making reference to the recognition dictionary, language models and acoustic models so as to generate virtual text data resulted from speech recognition, and compares the virtual text data resulted from speech recognition with the analyzed text data. The update processing section 103 updates the recognition dictionary and language models so as to reduce different point(s) between both sets of text data.
26 Citations
24 Claims
-
1. A speech recognition dictionary compilation assisting system, comprising:
-
a computer processing apparatus; and a computer-readable storage medium having data stored thereon that includes a dictionary, a language model, an acoustic model, and a speech recognition dictionary compilation assisting program that is executable by the computer processing apparatus to cause the computer processing apparatus to operate as; a text analysis section that applies morphological analysis to input text data to produce analyzed text data comprising words of the input text data and pronunciation information for each word; a virtual speech recognition processing section that performs a speech recognition process on said analyzed text data received from the text analysis section by applying the dictionary and the language model to said analyzed text data thereby to generate virtual text data, and that compares a pronunciation information of the virtual text data with the pronunciation information of the analyzed text data to extract and output different points of the analyzed text data and the virtual text data, each different point comprising an element of the analyzed text data and a corresponding element of the virtual text data; and an update processing section that corrects at least one of the dictionary and the language model in accordance with the different points identified by the virtual speech recognition processing section, wherein for each different point, the pronunciation information corresponding to a word of the analyzed text data differs from a corresponding pronunciation information of the virtual text data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A speech recognition dictionary compilation assisting method that uses a computer, comprising:
-
a text analysis step of, by the computer, applying morphological analysis to input text data to produce analyzed text data comprising words of the input text data and pronunciation information for each word; a step of, by the computer, generating virtual text data from speech recognition from the analyzed text data output from the text analysis step by using a dictionary, a language model, and acoustic models stored in storage devices connected to the computer; a step of, by the computer, comparing the pronunciation information of the analyzed text data with a pronunciation information of the virtual text data so as to extract and output different points therebetween, each different point comprising an element of the analyzed text data and a corresponding element of the virtual text data; and an updating process of, by the computer, correcting at least one of the dictionary and the language model in accordance with the different points, wherein for each different point, the pronunciation information corresponding to a word of the analyzed text data differs from a corresponding pronunciation information of the virtual text data. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A program stored on a non-transitory computer-readable storage medium and executable on a computer to cause the computer to operate as a speech recognition dictionary compilation assisting system that performs the following:
-
a text analysis process that applies morphological analysis to input text data to produce analyzed text data comprising words of the input text data and pronunciation information for each word; a process that generates virtual text data from speech recognition from the analyzed text data output from the text analysis process by using a dictionary, a language model and acoustic models stored in non-transitory computer-readable storage devices; a virtual speech recognition process that compares the pronunciation information of the analyzed text data with a pronunciation information of the virtual text data so as to extract and output different points therebetween, each different point comprising an element of the analyzed text data and a corresponding element of the virtual text data; and an updating process that corrects at least one of the dictionary and the language model in accordance with the different points, wherein for each different point, the pronunciation information corresponding to a word of the analyzed text data differs from a corresponding pronunciation information of the virtual text data. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification