VOICE RECOGNITION APPARATUS AND RECORDING MEDIUM HAVING VOICE RECOGNITION PROGRAM RECORDED THEREIN
First Claim
1. A voice recognition apparatus for recognizing voice within a programmed computer, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a voice recognition means for recognizing voice represented by the voice data and converting it into text data; and
a display means for displaying the text data.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to what causes a computer to read a voice recognition program from a first recording medium, and read voice data from a second recording medium, and causes a CPU in the computer to recognize voice represented by the read voice data according to the voice recognition program, convert the result of voice recognition into text data, and display the converted text data on a display unit.
Also included is a check mark button used by a speaker to designate a portion of voice data, which is input through a microphone, corresponding to an unnecessary word or the like. The portion of the voice data in which a check mark is inscribed is not regarded as an object of voice recognition. Only the other portion of the voice data in which the check mark is not inscribed is regarded as an object of voice recognition, and voice recognition is thus carried out.
Furthermore, the sound level of a voiceful portion of voice data is rated. The gain of the voice data is adjusted according to the rated level. On the basis of the voice data whose sound level has been adjusted, voice recognition is carried out.
183 Citations
23 Claims
-
1. A voice recognition apparatus for recognizing voice within a programmed computer, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a voice recognition means for recognizing voice represented by the voice data and converting it into text data; and
a display means for displaying the text data. - View Dependent Claims (2, 22, 23)
-
-
3. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
recognize voice represented by the voice data so as to convert it into text data; and
display the text data. - View Dependent Claims (4, 5)
-
-
6. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
recognize voice represented by the voice data so as to detect a given word; and
indicate the positions of the given word. - View Dependent Claims (7, 8)
-
-
9. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
recognize voice represented by the voice data so as to convert it into text data;
display the text data;
enable designation of at least part of the text data using a designation input means; and
delete a portion of the voice data corresponding to a portion of the text data designated using said designation input means from said voice data recording medium, and cancel display of the designated portion of the text data.
-
-
10. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
recognize voice represented by the voice data so as to convert it into text data;
acquire position information of positions in said voice data recording medium, at which portions of the voice data corresponding to words of the text data are recorded, in one-to-one correspondence with the words;
display the text data;
enable designation of at least part of the text data using a designation input means;
acquire position information of positions in said voice data recording medium, at which a corresponding portion of the voice data is recorded, according to a word contained in a portion of the text data designated using said designation input means; and
delete the corresponding portion of the voice data from said voice data recording medium having the voice data recorded therein on the basis of the position information, and cancel display of the designated portion of the text data.
-
-
11. A voice recognition apparatus, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a detecting means for detecting a check mark that is appended to the voice data and distinguishes an interval within the voice data;
a voice recognition means for not recognizing voice represented by a portion of the voice data associated with the given check mark but recognizing voice represented by the other portion of the voice data; and
a display means for displaying the result of recognition performed by said voice recognition means. - View Dependent Claims (12)
-
-
13. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
detect a check mark that is appended to the voice data and distinguishes an interval within the voice data;
not recognize voice represented by a portion of the voice data associated with the given check mark but recognize voice represented by the other portion of the voice data; and
display the result of voice recognition.
-
-
14. A voice recognition apparatus, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a level adjusting means for adjusting the sound level of the voice data read by said voice data reading means according to a given procedure;
a voice recognizing means for recognizing voice represented by the voice data whose sound level has been adjusted by said level adjusting means; and
a display means for displaying the result of recognition performed by said voice recognizing means.
-
-
15. A voice recognition apparatus, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a voice rating means for rating the voice data read by said voice data reading means as voiceful portions and voiceless portions;
a level adjusting means for adjusting the sound level of the voice data read by said voice data reading means on the basis of absolute values of amplitudes of voice signals of voice data items rated as the voiceful portions by said voice rating means;
a voice recognizing means for inputting the voice data whose sound level has been adjusted by said level adjusting means, and recognizing voice; and
a display means for displaying the result of recognition performed by said voice recognizing means. - View Dependent Claims (16)
-
-
17. A voice recognition apparatus, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a voice rating means for rating the voice data read by said voice data reading means as voiceful portions and voiceless portions;
an averaging means for averaging absolute values of voice data items rated as the voiceful portions by said voice rating means;
a gain calculating means for calculating a gain on the basis of the average value;
a multiplying means for multiplying the voice data by the gain;
a voice recognizing means for recognizing voice represented by the voice data multiplied by the gain; and
a display means for displaying the result of recognition performed by said voice recognizing means.
-
-
18. A voice recognition apparatus, comprising:
-
a voice data reading means for reading voice data of a desired file from a voice data recording medium in which voice data digitized and divided into frames is recorded in units of a file;
a voice rating means for rating the voice data read by said voice data reading means as voiceful frames and voiceless frames;
an averaging means for averaging absolute values of voice data items in frames rated as the voiceful frames by said voice rating means;
a gain calculating means for calculating a gain on the basis of the average value;
a multiplying means for multiplying the voice data by the gain;
a voice recognizing means for recognizing voice represented by the voice data multiplied by the gain; and
a display means for displaying the result of recognition performed by said voice recognizing means.
-
-
19. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
adjust the sound level of the read voice data;
recognize voice represented by the voice data whose sound level has been adjusted; and
display the result of voice recognition.
-
-
20. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
rate the read voice data as voiceful portions and voiceless portions;
adjust the sound level of the read voice data on the basis of the absolute values of voice data items rated as the voiceful portions according to a given procedure;
recognize voice represented by the voice data whose sound level has been adjusted; and
display the result of voice recognition.
-
-
21. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
rate the read voice data as voiceful portions and voiceless portions;
average absolute values of voice data items rated as the voiceful portions;
calculate a gain on the basis of the average value;
multiply the voice data by the gain;
input the voice data multiplied by the gain so as to recognize voice; and
display the result of voice recognition.
-
Specification