Apparatus, method and computer program product for recognizing speech

US 20070225980A1
Filed: 03/01/2007
Published: 09/27/2007
Est. Priority Date: 03/24/2006
Status: Active Grant

First Claim

Patent Images

1. A speech recognition apparatus comprising:

a semantic-relation storage unit that stores semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other;

a first input accepting unit that accepts an input of a first speech;

a first candidate producing unit that recognizes the first speech and produces first recognition candidates and first likelihood of the first recognition candidates;

a first-candidate selecting unit that selects one of the first recognition candidates as a recognition result of the first speech based on the first likelihood of the first recognition candidates;

a second input accepting unit that accepts an input of a second speech including an object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word;

a second candidate producing unit that recognizes the second speech and produces second recognition candidates and second likelihood of the second recognition candidates;

a word extracting unit that extracts recognition candidates of the object word and recognition candidates of the clue word from the second recognition candidates;

a second-candidate selecting unit that acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the objected word and the extracted recognition candidates of the clue word, from the semantic-relation storage unit, and selects one of the second recognition candidates as a recognition result of the second speech based on the acquired relevance ratio;

a correction-portion identifying unit that compares the recognition result of the first speech with the recognition result of the second speech, and identifies a portion corresponding to the object word; and

a correcting unit that corrects the identified portion corresponding to the object word.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech recognition apparatus includes a first-candidate selecting unit that selects a recognition result of a first speech from first recognition candidates based on likelihood of the first recognition candidates; a second-candidate selecting unit that extracts recognition candidates of a object word contained in the first speech and recognition candidates of a clue word from second recognition candidates, acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word, and selects a recognition result of the second speech based on the acquired relevance ratio; a correction-portion identifying unit that identifies a portion corresponding to the object word in the first speech; and a correcting unit that corrects the word on identified portion.

Citations

15 Claims

1. A speech recognition apparatus comprising:
- a semantic-relation storage unit that stores semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other;
  
  a first input accepting unit that accepts an input of a first speech;
  
  a first candidate producing unit that recognizes the first speech and produces first recognition candidates and first likelihood of the first recognition candidates;
  
  a first-candidate selecting unit that selects one of the first recognition candidates as a recognition result of the first speech based on the first likelihood of the first recognition candidates;
  
  a second input accepting unit that accepts an input of a second speech including an object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word;
  
  a second candidate producing unit that recognizes the second speech and produces second recognition candidates and second likelihood of the second recognition candidates;
  
  a word extracting unit that extracts recognition candidates of the object word and recognition candidates of the clue word from the second recognition candidates;
  
  a second-candidate selecting unit that acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the objected word and the extracted recognition candidates of the clue word, from the semantic-relation storage unit, and selects one of the second recognition candidates as a recognition result of the second speech based on the acquired relevance ratio;
  
  a correction-portion identifying unit that compares the recognition result of the first speech with the recognition result of the second speech, and identifies a portion corresponding to the object word; and
  
  a correcting unit that corrects the identified portion corresponding to the object word.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
- - 2. The speech recognition apparatus according to claim 1, wherein the recognition candidates of the object word include first words, the recognition candidates of the clue word include second words, and the second-candidate selecting unit selects a first word and a second word from the first words and the second words, respectively having the relevance ratio associated with the semantic relation between the first word and the second word being maximum, and selects the recognition result of the second speech that includes the selected first word and the selected second word.
  - 3. The speech recognition apparatus according to claim 1, further comprising:
    - a language model storage unit that stores therein language models that associate a connection relation among words with degree of the connection relation, whereinthe second-candidate selecting unit further acquires the degree of the connection relation associated with the connection relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word, and selects the recognition result of the second speech based on the acquired degree of the connection relation and the relevance ratio.
  - 4. The speech recognition apparatus according to claim 1, wherein the second-candidate selecting unit selects the recognition result of the second speech based on the second likelihood of the second recognition candidates and the relevance ratio.
  - 5. The speech recognition apparatus according to claim 1, further comprising:
    - a word-dictionary storage unit that stores words and an appearance probability of the words associated with each other, whereinthe second-candidate selecting unit further acquires the appearance probability associated with the recognition candidates of the object word , and selects the recognition result of the second speech based on the acquired appearance probability and the relevance ratio.
  - 6. The speech recognition apparatus according to claim 1, whereinthe semantic-relation storage unit stores a hierarchical relation of semantic contents among the words and the relevance ratio associated with each other, andthe second-candidate selecting unit acquires from the semantic-relation storage unit the relevance ratio associated with the hierarchical relation of semantic contents between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word, and selects the recognition result of the second speech based on the acquired relevance ratio.
  - 7. The speech recognition apparatus according to claim 1, wherein the semantic-relation storage unit stores at least one of synonym relation and quasi-synonym relation among words as the semantic relation associated with the relevance ratio.
  - 8. The speech recognition apparatus according to claim 1, whereinthe semantic-relation storage unit stores a co-occurrence relation indicating that a plurality of words appear together and a co-occurrence probability indicating a probability of appearing the co-occurrence relation associated with each other, andthe second-candidate selecting unit acquires from the semantic-relation storage unit the co-occurrence probability associated with the co-occurrence relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word, and selects the recognition result of the second speech based on the acquired co-occurrence probability.
  - 9. The speech recognition apparatus according to claim 1, wherein the correcting unit corrects the identified portion corresponding to the object word with the word selected by the second-candidate selecting unit to the recognition candidates of the object word.
  - 10. The speech recognition apparatus according to claim 1, wherein the correcting unit corrects the identified portion corresponding to the object word with the recognition result of the second speech selected by the second-candidate selecting unit.
  - 11. The speech recognition apparatus according to claim 1, further comprising:
    - a display unit that displays the recognition result of the first speech; and
      
      a correction-portion specifying unit that specifies a correction portion in the recognition result of the first speech displayed on the display unit, whereinthe correction-portion identifying unit identifies a portion corresponding to the object word in the first speech from a predetermined range at least one of before and after the specified correction portion.
  - 12. The speech recognition apparatus according to claim 11, wherein the second input accepting unit accepts a speech input after the correction portion is specified as an input of the second speech.
  - 13. The speech recognition apparatus according to claim 1, whereinthe first input accepting unit accepts a speech input when a first button is pressed as the first speech, andthe second input accepting unit accepts a speech input when a second button is pressed as the second speech.

14. A speech recognition method comprising:
- accepting a first speech;
  
  recognizing the accepted first speech to produce first recognition candidates and first likelihood of the first recognition candidates;
  
  selecting one of the first recognition candidates produced for a first speech as the recognition result of the first speech based on the first likelihood of the first recognition candidates;
  
  accepting a second speech that includes a object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word;
  
  recognizing the accepted second speech to produce second recognition candidates and second likelihood of the second recognition candidates;
  
  extracting recognition candidates of the object word and recognition candidates of the clue word from the produced second recognition candidates;
  
  acquiring a relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word from a semantic-relation storage unit that stores therein semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other;
  
  selecting one of the second recognition candidates as the recognition result of the second speech based on the acquired relevance ratio;
  
  comparing the recognition result of the first speech with the recognition result of the second speech;
  
  identifying a portion corresponding to the object word in the first speech; and
  
  correcting the identified portion corresponding to the object word.

15. A computer program product having a computer readable medium including programmed instructions for recognizing speech, wherein the instructions, when executed by a computer, cause the computer to perform:
- accepting a first speech;
  
  recognizing the accepted first speech to produce first recognition candidates and first likelihood of the first recognition candidates;
  
  selecting one of the first recognition candidates produced for a first speech as the recognition result of the first speech based on the first likelihood of the first recognition candidates;
  
  accepting a second speech that includes a object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word;
  
  recognizing the accepted second speech to produce second recognition candidates and second likelihood of the second recognition candidates;
  
  extracting recognition candidates of the object word and recognition candidates of the clue word from the produced second recognition candidates;
  
  acquiring a relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word from a semantic-relation storage unit that stores therein semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other;
  
  selecting one of the second recognition candidates as the recognition result of the second speech based on the acquired relevance ratio;
  
  comparing the recognition result of the first speech with the recognition result of the second speech;
  
  identifying a portion corresponding to the object word in the first speech; and
  
  correcting the identified portion corresponding to the object word.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation), Toshiba Digital Solutions Corporation (Toshiba Corporation)
Original Assignee
Kabushiki Kaisha Toshiba (Toshiba Corporation)
Inventors
Sumita, Kazuo

Granted Patent

US 7,974,844 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/240
CPC Class Codes

G10L 15/1815 Semantic context, e.g. disa...

G10L 15/22 Procedures used during a sp...

Apparatus, method and computer program product for recognizing speech

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus, method and computer program product for recognizing speech

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links