Speech recognition with text generation from portions of voice data preselected by manual-input commands
First Claim
1. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
read a control condition manually inputted into said computer;
recognize speech represented by the voice data so as to convert it into text data;
limit the conversion of text data according to the control condition; and
display the text data;
said speech recognition program, responsive to said control condition, which includes a manually inputted time interval and a manually inputted number of words, causes the computer to recognize only a given number of words of voice data according to the manually inputted number of words, at intervals determined by the inputted time interval number, and to convert said given number of words of voice data into text data.
4 Assignments
0 Petitions
Accused Products
Abstract
A computer reads a voice speech recognition program from a first recording medium, and reads voice data from a second recording medium, and causes a CPU in the computer to recognize speech represented by the read voice data according to the speech recognition program, convert the result of speech recognition into text data, and display the converted text data on a display unit. A check mark button used by a speaker designates a portion of voice data, which is input through a microphone, corresponding to an unnecessary word or the like. The portion of the voice data in which a check mark is inscribed is not regarded as an object of speech recognition. Only the other portion of voice data in which the check mark is not inscribed is regarded as an object of speech recognition, and speech recognition is thus carried out. Furthermore, the sound level of a voice portion of voice data is rated. The gain of the voice data is adjusted according to the rated level. On the basis of the voice data whose sound level has been adjusted, speech recognition is carried out.
-
Citations
16 Claims
-
1. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
read a control condition manually inputted into said computer;
recognize speech represented by the voice data so as to convert it into text data;
limit the conversion of text data according to the control condition; and
display the text data;
said speech recognition program, responsive to said control condition, which includes a manually inputted time interval and a manually inputted number of words, causes the computer to recognize only a given number of words of voice data according to the manually inputted number of words, at intervals determined by the inputted time interval number, and to convert said given number of words of voice data into text data. - View Dependent Claims (2)
-
-
3. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
read a control condition manually inputted into said computer;
recognize speech represented by the voice data so as to convert it into text data;
limit the conversion into text data according to the control condition; and
display the text data;
wherein said speech recognition program, responsive to said control condition, which includes a manually inputted original position in said voice data recording medium and a number of words starting at said original position, further causes the computer to recognize only the manually inputted number of words starting at the manually inputted original position in said voice data recording medium having voice data recorded therein and to convert them into text when causing the computer to recognize speech represented by the voice data and convert it to text data.
-
-
4. A recording medium having a speech recognition program recorded therein, wherein said voice recognition program causes a computer to:
-
read voice data from a voice recording medium in which the voice data is recorded;
recognize a control condition manually inputted into the computer, which condition is a given word; and
recognize speech represented by the voice data so as to detect the given word; and
indicate original positions of the given word in the voice recording medium responsive to the control condition, process the voice data so as to recognize occurrences of the given word therein, and indicate the original positions of the given word in said voice data.- View Dependent Claims (5, 6, 7)
-
-
8. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
recognize speech represented by the voice data so as to convert it into text data;
display the text data;
recognizing a control designation manually inputted into said computer;
designating at least part of the text data responsive to said control designation; and
delete a portion of the voice data corresponding to a portion of the text data designated responsive to said control designation from said voice data recording medium, and cancel display of the designated portion of the text data.
-
-
9. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
-
read voice data from the voice data recording medium in which the voice data is recorded;
recognize speech represented by the voice data so as to convert it into text data;
acquire position information of positions in said voice data recording medium, at which portions of the voice data corresponding to words of the text data are recorded, in one-to-one correspondence with the words;
display the text data;
recognize a control condition manually inputted into the computer designating at least a portion of the text data;
acquire position information of original positions in said voice data recording medium, at which a corresponding portion of the voice data is recorded, according to a word contained in a portion of the text data designated; and
delete a corresponding portion of the voice data from said voice data recording medium having the voice data recorded therein based on the position information, and cancel display of the designated portion of the text data.
-
-
10. A speech recognition apparatus, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
a detecting means for detecting a check mark that is appended to the voice data and distinguishes an interval within the voice data;
a speech recognition means for ignoring a portion of the data associated with the given check mark and recognizing speech represented by another portion of the voice data; and
a display means for displaying the result of recognition performed by said speech recognition means. - View Dependent Claims (11, 12)
a voice data input means for inputting voice data;
an interval designating means enabling designation of a desired interval within the voice data input by said voice data input means;
a recording means for appending a check mark, which distinguishes the interval designated using said interval designating means, to the voice data and recording the voice data in a voice data recording medium; and
a recording medium attaching means for use in freely detachably attaching said voice data recording medium.
-
-
12. Speech recognition apparatus according to claim 10, wherein said speech recognition means ignores one of the words just before the frame having the given check mark recorded therein and/or the word which includes the frame having the check mark recorded therein.
-
13. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
-
read voice data from a voice data recording medium in which the voice data is recorded;
detect a check mark that is appended to the voice data and distinguishes an interval within the voice data;
ignore a portion of the voice data associated with the given check mark and recognize speech represented by another portion of the voice data; and
display the result of speech recognition. - View Dependent Claims (14)
-
-
15. A speech recognition apparatus for recognizing speech within a programmed computer, comprising:
-
a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
means for manually inputting a control condition;
a speech recognition means for recognizing speech represented by the voice data and converting it into text data; and
said speech recognition means including means responsive to said control condition to perform speech recognition according to the control condition;
a display means for displaying the text data, said speech recognition program, responsive to said control condition, further causing the computer to recognize only a given number of words and convert them into text data at intervals of a given time when causing the computer to recognize speech represented by the voice data and convert it into text data; and
further including an attachment for receiving said voice data recording medium. - View Dependent Claims (16)
-
Specification