Speech recognition with text generation from portions of voice data preselected by manual-input commands

US 6,353,809 B2
Filed: 06/02/1998
Issued: 03/05/2002
Est. Priority Date: 06/06/1997
Status: Expired due to Term

First Claim

Patent Images

1. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:

read voice data from a voice data recording medium in which the voice data is recorded;

read a control condition manually inputted into said computer;

recognize speech represented by the voice data so as to convert it into text data;

limit the conversion of text data according to the control condition; and

display the text data;

said speech recognition program, responsive to said control condition, which includes a manually inputted time interval and a manually inputted number of words, causes the computer to recognize only a given number of words of voice data according to the manually inputted number of words, at intervals determined by the inputted time interval number, and to convert said given number of words of voice data into text data.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A computer reads a voice speech recognition program from a first recording medium, and reads voice data from a second recording medium, and causes a CPU in the computer to recognize speech represented by the read voice data according to the speech recognition program, convert the result of speech recognition into text data, and display the converted text data on a display unit. A check mark button used by a speaker designates a portion of voice data, which is input through a microphone, corresponding to an unnecessary word or the like. The portion of the voice data in which a check mark is inscribed is not regarded as an object of speech recognition. Only the other portion of voice data in which the check mark is not inscribed is regarded as an object of speech recognition, and speech recognition is thus carried out. Furthermore, the sound level of a voice portion of voice data is rated. The gain of the voice data is adjusted according to the rated level. On the basis of the voice data whose sound level has been adjusted, speech recognition is carried out.

Citations

16 Claims

1. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  read a control condition manually inputted into said computer;
  
  recognize speech represented by the voice data so as to convert it into text data;
  
  limit the conversion of text data according to the control condition; and
  
  display the text data;
  
  said speech recognition program, responsive to said control condition, which includes a manually inputted time interval and a manually inputted number of words, causes the computer to recognize only a given number of words of voice data according to the manually inputted number of words, at intervals determined by the inputted time interval number, and to convert said given number of words of voice data into text data.
- View Dependent Claims (2)
- - 2. A speech recognition apparatus according to claim 1, wherein voice data recorded in said voice data recording medium is compressed digital voice data.

3. A recording medium for use with a computer and having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  read a control condition manually inputted into said computer;
  
  recognize speech represented by the voice data so as to convert it into text data;
  
  limit the conversion into text data according to the control condition; and
  
  display the text data;
  
  wherein said speech recognition program, responsive to said control condition, which includes a manually inputted original position in said voice data recording medium and a number of words starting at said original position, further causes the computer to recognize only the manually inputted number of words starting at the manually inputted original position in said voice data recording medium having voice data recorded therein and to convert them into text when causing the computer to recognize speech represented by the voice data and convert it to text data.

4. A recording medium having a speech recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice recording medium in which the voice data is recorded;
  
  recognize a control condition manually inputted into the computer, which condition is a given word; and
  
  recognize speech represented by the voice data so as to detect the given word; and
  
  indicate original positions of the given word in the voice recording medium responsive to the control condition, process the voice data so as to recognize occurrences of the given word therein, and indicate the original positions of the given word in said voice data.
- View Dependent Claims (5, 6, 7)
- - 5. A recording medium having a speech recognition program recorded therein according to claim 4, wherein said speech recognition program further causes the computer to create an index mark at the original positions of the given word in said voice data recording medium having the voice data recorded therein after causing the computer to recognize speech represented by the voice data and detect the given word.
  - 6. A recording medium having a speech recognition program recorded therein according to claim 5, wherein said speech recognition program responsive to said control condition, further causes the computer to reproduce voice data starting at a given original position in said voice data recording medium having the voice data recorded therein after causing the computer to indicate the original positions of the given word.
  - 7. A recording medium having a speech recognition program recorded therein according to claim 4, wherein said speech recognition program further causes the computer to indicate the position of the given word in a position of reproduction indicative of the voice data.

8. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  recognize speech represented by the voice data so as to convert it into text data;
  
  display the text data;
  
  recognizing a control designation manually inputted into said computer;
  
  designating at least part of the text data responsive to said control designation; and
  
  delete a portion of the voice data corresponding to a portion of the text data designated responsive to said control designation from said voice data recording medium, and cancel display of the designated portion of the text data.

9. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from the voice data recording medium in which the voice data is recorded;
  
  recognize speech represented by the voice data so as to convert it into text data;
  
  acquire position information of positions in said voice data recording medium, at which portions of the voice data corresponding to words of the text data are recorded, in one-to-one correspondence with the words;
  
  display the text data;
  
  recognize a control condition manually inputted into the computer designating at least a portion of the text data;
  
  acquire position information of original positions in said voice data recording medium, at which a corresponding portion of the voice data is recorded, according to a word contained in a portion of the text data designated; and
  
  delete a corresponding portion of the voice data from said voice data recording medium having the voice data recorded therein based on the position information, and cancel display of the designated portion of the text data.

10. A speech recognition apparatus, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a detecting means for detecting a check mark that is appended to the voice data and distinguishes an interval within the voice data;
  
  a speech recognition means for ignoring a portion of the data associated with the given check mark and recognizing speech represented by another portion of the voice data; and
  
  a display means for displaying the result of recognition performed by said speech recognition means.
- View Dependent Claims (11, 12)
- - 11. A speech recognition apparatus according to claim 10, wherein, the check mark is recorded by a voice recording apparatus including:
12. Speech recognition apparatus according to claim 10, wherein said speech recognition means ignores one of the words just before the frame having the given check mark recorded therein and/or the word which includes the frame having the check mark recorded therein.

13. A recording medium having a speech recognition program recorded therein, wherein said speech recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  detect a check mark that is appended to the voice data and distinguishes an interval within the voice data;
  
  ignore a portion of the voice data associated with the given check mark and recognize speech represented by another portion of the voice data; and
  
  display the result of speech recognition.
- View Dependent Claims (14)
- - 14. Speech recognition apparatus according to claim 13, wherein said speech recognition means ignores one of the words just before the frame having the given check mark recorded therein and/or the word which includes the frame having the check mark recorded therein.

15. A speech recognition apparatus for recognizing speech within a programmed computer, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  means for manually inputting a control condition;
  
  a speech recognition means for recognizing speech represented by the voice data and converting it into text data; and
  
  said speech recognition means including means responsive to said control condition to perform speech recognition according to the control condition;
  
  a display means for displaying the text data, said speech recognition program, responsive to said control condition, further causing the computer to recognize only a given number of words and convert them into text data at intervals of a given time when causing the computer to recognize speech represented by the voice data and convert it into text data; and
  
  further including an attachment for receiving said voice data recording medium.
- View Dependent Claims (16)
- - 16. A voice recognition apparatus according to claim 15, wherein said voice data recording medium is attached to said attachment via an adaptor.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Olympus Corporation
Original Assignee
Olympus Optical Corporation Limited (Olympus Corporation)
Inventors
Takahashi, Hidetaka, Onishi, Takafumi
Primary Examiner(s)
Smits, Talivaldis Ivars

Application Number

US09/088,996
Publication Number

US 20010016815A1
Time in Patent Office

1,372 Days
Field of Search

704/235, 704/233, 704/225, 704/234, 704/251, 704/278
US Class Current

704/235
CPC Class Codes

G10L 15/26 Speech to text systems G10L...

Speech recognition with text generation from portions of voice data preselected by manual-input commands

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition with text generation from portions of voice data preselected by manual-input commands

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links