VOICE RECOGNITION APPARATUS AND RECORDING MEDIUM HAVING VOICE RECOGNITION PROGRAM RECORDED THEREIN

US 20010016815A1
Filed: 06/02/1998
Published: 08/23/2001
Est. Priority Date: 06/06/1997
Status: Active Grant

First Claim

Patent Images

1. A voice recognition apparatus for recognizing voice within a programmed computer, comprising:

a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;

a voice recognition means for recognizing voice represented by the voice data and converting it into text data; and

a display means for displaying the text data.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention relates to what causes a computer to read a voice recognition program from a first recording medium, and read voice data from a second recording medium, and causes a CPU in the computer to recognize voice represented by the read voice data according to the voice recognition program, convert the result of voice recognition into text data, and display the converted text data on a display unit.

Also included is a check mark button used by a speaker to designate a portion of voice data, which is input through a microphone, corresponding to an unnecessary word or the like. The portion of the voice data in which a check mark is inscribed is not regarded as an object of voice recognition. Only the other portion of the voice data in which the check mark is not inscribed is regarded as an object of voice recognition, and voice recognition is thus carried out.

Furthermore, the sound level of a voiceful portion of voice data is rated. The gain of the voice data is adjusted according to the rated level. On the basis of the voice data whose sound level has been adjusted, voice recognition is carried out.

183 Citations

23 Claims

1. A voice recognition apparatus for recognizing voice within a programmed computer, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a voice recognition means for recognizing voice represented by the voice data and converting it into text data; and
  
  a display means for displaying the text data.
- View Dependent Claims (2, 22, 23)
- - 2. A voice recognition apparatus according to claim 1, wherein voice data recorded in said voice data recording medium is compressed digital voice data.
  - 22. A voice recognition apparatus according to claim 1, wherein said voice recognition apparatus includes an attachment permitting attachment of said voice data recording medium.
  - 23. A voice recognition apparatus according to claim 22, wherein said voice data recording medium is attached to said attachment via an adaptor.

3. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  recognize voice represented by the voice data so as to convert it into text data; and
  
  display the text data.
- View Dependent Claims (4, 5)
- - 4. A recording medium having a voice recognition program recorded therein according to claim 3, wherein said voice recognition program further causes the computer to recognize in voice or voice-recognize only a given number of words and convert them into text data at intervals of a given time when causing the computer to recognize voice represented by the voice data and convert it into text data.
  - 5. A recording medium having a voice recognition program recorded therein according to claim 3 or 4, wherein said voice recognition program further causes the computer to voice-recognize only a given number of words starting at a given position in said voice data recording medium having voice data recorded therein and to convert them into text data when causing the computer to recognize voice represented by the voice data and convert it to text data.

6. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  recognize voice represented by the voice data so as to detect a given word; and
  
  indicate the positions of the given word.
- View Dependent Claims (7, 8)
- - 7. A recording medium having a voice recognition program recorded therein according to claim 6, wherein said voice recognition program further causes the computer to create an index mark at the positions of the given word in said voice data recording medium having the voice data recorded therein after causing the computer to recognize voice represented by the voice data and detect the given word.
  - 8. A recording medium having a voice recognition program recorded therein according to claim 7, wherein said voice recognition program further causes the computer to reproduce voice data starting at a given position in said voice data recording medium having the voice data recorded therein after causing the computer to indicate the positions of the given word.

9. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  recognize voice represented by the voice data so as to convert it into text data;
  
  display the text data;
  
  enable designation of at least part of the text data using a designation input means; and
  
  delete a portion of the voice data corresponding to a portion of the text data designated using said designation input means from said voice data recording medium, and cancel display of the designated portion of the text data.

10. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  recognize voice represented by the voice data so as to convert it into text data;
  
  acquire position information of positions in said voice data recording medium, at which portions of the voice data corresponding to words of the text data are recorded, in one-to-one correspondence with the words;
  
  display the text data;
  
  enable designation of at least part of the text data using a designation input means;
  
  acquire position information of positions in said voice data recording medium, at which a corresponding portion of the voice data is recorded, according to a word contained in a portion of the text data designated using said designation input means; and
  
  delete the corresponding portion of the voice data from said voice data recording medium having the voice data recorded therein on the basis of the position information, and cancel display of the designated portion of the text data.

11. A voice recognition apparatus, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a detecting means for detecting a check mark that is appended to the voice data and distinguishes an interval within the voice data;
  
  a voice recognition means for not recognizing voice represented by a portion of the voice data associated with the given check mark but recognizing voice represented by the other portion of the voice data; and
  
  a display means for displaying the result of recognition performed by said voice recognition means.
- View Dependent Claims (12)
- - 12. A voice recognition apparatus according to claim 11, wherein the check mark is recorded by a voice recording apparatus including:
    - a voice data input means for inputting voice data;
      
      an interval designating means enabling designation of a desired interval within the voice data input by said voice data input means;
      
      a recording means for appending a check mark, which distinguishes the interval designated using said interval designating means, to the voice data and recording the voice data in a voice data recording medium; and
      
      a recording medium attaching means for use in freely detachably attaching said voice data recording medium.

13. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  detect a check mark that is appended to the voice data and distinguishes an interval within the voice data;
  
  not recognize voice represented by a portion of the voice data associated with the given check mark but recognize voice represented by the other portion of the voice data; and
  
  display the result of voice recognition.

14. A voice recognition apparatus, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a level adjusting means for adjusting the sound level of the voice data read by said voice data reading means according to a given procedure;
  
  a voice recognizing means for recognizing voice represented by the voice data whose sound level has been adjusted by said level adjusting means; and
  
  a display means for displaying the result of recognition performed by said voice recognizing means.

15. A voice recognition apparatus, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a voice rating means for rating the voice data read by said voice data reading means as voiceful portions and voiceless portions;
  
  a level adjusting means for adjusting the sound level of the voice data read by said voice data reading means on the basis of absolute values of amplitudes of voice signals of voice data items rated as the voiceful portions by said voice rating means;
  
  a voice recognizing means for inputting the voice data whose sound level has been adjusted by said level adjusting means, and recognizing voice; and
  
  a display means for displaying the result of recognition performed by said voice recognizing means.
- View Dependent Claims (16)
- - 16. A voice recognition apparatus according to claim 15, further comprising a minimum value calculating means for calculating a minimum value of an energy level of voice data of a given interval, wherein a criterion of said voice rating means is set on the basis of the minimum value calculated by said minimum value calculating means.

17. A voice recognition apparatus, comprising:
- a voice data reading means for reading voice data from a voice data recording medium in which the voice data is recorded;
  
  a voice rating means for rating the voice data read by said voice data reading means as voiceful portions and voiceless portions;
  
  an averaging means for averaging absolute values of voice data items rated as the voiceful portions by said voice rating means;
  
  a gain calculating means for calculating a gain on the basis of the average value;
  
  a multiplying means for multiplying the voice data by the gain;
  
  a voice recognizing means for recognizing voice represented by the voice data multiplied by the gain; and
  
  a display means for displaying the result of recognition performed by said voice recognizing means.

18. A voice recognition apparatus, comprising:
- a voice data reading means for reading voice data of a desired file from a voice data recording medium in which voice data digitized and divided into frames is recorded in units of a file;
  
  a voice rating means for rating the voice data read by said voice data reading means as voiceful frames and voiceless frames;
  
  an averaging means for averaging absolute values of voice data items in frames rated as the voiceful frames by said voice rating means;
  
  a gain calculating means for calculating a gain on the basis of the average value;
  
  a multiplying means for multiplying the voice data by the gain;
  
  a voice recognizing means for recognizing voice represented by the voice data multiplied by the gain; and
  
  a display means for displaying the result of recognition performed by said voice recognizing means.

19. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  adjust the sound level of the read voice data;
  
  recognize voice represented by the voice data whose sound level has been adjusted; and
  
  display the result of voice recognition.

20. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  rate the read voice data as voiceful portions and voiceless portions;
  
  adjust the sound level of the read voice data on the basis of the absolute values of voice data items rated as the voiceful portions according to a given procedure;
  
  recognize voice represented by the voice data whose sound level has been adjusted; and
  
  display the result of voice recognition.

21. A recording medium having a voice recognition program recorded therein, wherein said voice recognition program causes a computer to:
- read voice data from a voice data recording medium in which the voice data is recorded;
  
  rate the read voice data as voiceful portions and voiceless portions;
  
  average absolute values of voice data items rated as the voiceful portions;
  
  calculate a gain on the basis of the average value;
  
  multiply the voice data by the gain;
  
  input the voice data multiplied by the gain so as to recognize voice; and
  
  display the result of voice recognition.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Olympus Corporation
Original Assignee
Olympus Optical Corporation Limited (Olympus Corporation)
Inventors
ONISHI, TAKAFUMI, TAKAHASHI, HIDETAKA

Granted Patent

US 6,353,809 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/235
CPC Class Codes

G10L 15/26 Speech to text systems G10L...

VOICE RECOGNITION APPARATUS AND RECORDING MEDIUM HAVING VOICE RECOGNITION PROGRAM RECORDED THEREIN

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

183 Citations

23 Claims

Specification

Solutions

Use Cases

Quick Links

VOICE RECOGNITION APPARATUS AND RECORDING MEDIUM HAVING VOICE RECOGNITION PROGRAM RECORDED THEREIN

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

183 Citations

23 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links