PRONUNCIATION VARIATION RULE EXTRACTION APPARATUS, PRONUNCIATION VARIATION RULE EXTRACTION METHOD, AND PRONUNCIATION VARIATION RULE EXTRACTION PROGRAM

US 20100268535A1
Filed: 11/27/2008
Published: 10/21/2010
Est. Priority Date: 12/18/2007
Status: Active Grant

First Claim

Patent Images

1-20. -20. (canceled)

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A problem to be solved is to robustly detect a pronunciation variation example and acquire a pronunciation variation rule having a high generalization property, with less effort. The problem can be solved by a pronunciation variation rule extraction apparatus including a speech data storage unit, a base form pronunciation storage unit, a sub word language model generation unit, a speech recognition unit, and a difference extraction unit. The speech data storage unit stores speech data. The base form pronunciation storage unit stores base form pronunciation data representing base form pronunciation of the speech data. The sub word language model generation unit generates a sub word language model from the base form pronunciation data. The speech recognition unit recognizes the speech data by using the sub word language model. The difference extraction unit extracts a difference between a recognition result outputted from the speech recognition unit and the base form pronunciation data by comparing the recognition result and the base form pronunciation data.

103 Citations

View as Search Results

36 Claims

1-20. -20. (canceled)

21. A pronunciation variation rule extraction apparatus comprising:
- a speech data storage unit for storing speech data;
  
  a surface form pronunciation storage unit for storing surface form pronunciation data representing surface form pronunciation of said speech data;
  
  a sub word language model generation unit for generating a sub word language model from said surface form pronunciation data;
  
  a speech recognition unit for recognizing said speech data by using said sub word language model;
  
  a difference extraction unit for extracting a difference between a recognition result outputted from said speech recognition unit and said surface form pronunciation data by comparing said recognition result and said surface form pronunciation data; and
  
  a language model weight control unit for controlling a weight value of said sub word language model,wherein said language model weight control unit outputs a plurality of weight values,said speech recognition unit recognizes said speech data for each of said plurality of weight values, andsaid language model weight control unit determines based on said difference at time when difference is extracted whether said weight value should be updated or not.
- View Dependent Claims (22, 23, 24, 25, 26, 27)
- - 22. The pronunciation variation rule extraction apparatus according to claim 21, wherein when said difference is smaller than a predetermined threshold, said language model weight control unit updates said weight value such that said weight value is decreased.
  - 23. The pronunciation variation rule extraction apparatus according to claim 21, wherein when said difference is larger than a predetermined threshold, said language model weight control unit updates said weight value such that said weight value is increased.
  - 24. The pronunciation variation rule extraction apparatus according to claim 21, wherein said difference extraction unit calculates said difference as an editing distance between said recognition result and said surface form pronunciation data.
  - 25. The pronunciation variation rule extraction apparatus according to claim 21, wherein said difference extraction unit extracts as said difference, a pronunciation variation example including letter string pair of different portions between said recognition result and said surface form pronunciation data and a weight value of said sub word language model received by said speech recognition unit from said language model weight control unit at a time of obtain of said recognition result.
  - 26. The pronunciation variation rule extraction apparatus according to claim 25, further comprising pronunciation variation probability estimation unit for generating a probability rule of pronunciation variation from said pronunciation variation example.
  - 27. The pronunciation variation rule extraction apparatus according to claim 26, wherein said pronunciation variation probability estimation unit generates, based on a magnitude of a weight value of said sub word language model at a time of observation of a pronunciation variation example, said probability rule of said pronunciation variation such that said pronunciation variation example has a high appearance probability.

28. A pronunciation variation rule extraction method comprising:
- storing surface form pronunciation data representing surface form pronunciation of speech data;
  
  generating a sub word language model from said surface form pronunciation data;
  
  recognizing said speech data by using said sub word language model;
  
  extracting a difference between a recognition result of said recognizing and said surface form pronunciation data by comparing said recognition result and said surface form pronunciation data; and
  
  controlling a weight value of said sub word language model,wherein said controlling includes outputting a plurality of weight values,said recognizing includes recognizing said speech data for each of said plurality of weight values, andsaid controlling further includes determining based on said difference at time when said difference is extracted whether said weight value should be updated or not.
- View Dependent Claims (29, 30, 31, 32)
- - 29. The pronunciation variation rule extraction method according to claim 28, wherein said controlling further includes updating said weight value, when said difference is smaller than a predetermined threshold, such that said weight value is decreased.
  - 30. The pronunciation variation rule extraction method according to claim 28, wherein said controlling further includes updating said weight value, when said difference is larger than a predetermined threshold, such that said weight value is increased.
  - 31. The pronunciation variation rule extraction method according to claim 28, wherein said extracting includes:
    - calculating said difference as an editing distance between said recognition result and said surface form pronunciation data; and
      
      extracting as said difference, a pronunciation variation example including letter string pair of different portions between said recognition result and said surface form pronunciation data and a weight value of said sub word language model received at a time of obtain of said recognition result.
  - 32. The pronunciation variation rule extraction method according to claim 31, further comprising generating a probability rule of pronunciation variation from said pronunciation variation example,wherein said generating said probability rule includes generating, based on a magnitude of a weight value of said sub word language model at a time of observation of a pronunciation variation example, said probability rule of said pronunciation variation such that said pronunciation variation example has a high appearance probability.

33. A computer-readable recording medium which records a pronunciation variation rule extraction program which causes a computer to function as:
- a speech data storage unit for storing speech data;
  
  a surface form pronunciation storage unit for storing surface form pronunciation data representing surface form pronunciation of said speech data;
  
  a sub word language model generation unit for generating a sub word language model from said surface form pronunciation data;
  
  a speech recognition unit for recognizing said speech data by using said sub word language model;
  
  a difference extraction unit for extracting a difference between a recognition result outputted from said speech recognition unit and said surface form pronunciation data by comparing said recognition result and said surface form pronunciation data; and
  
  a language model weight control unit for controlling a weight value of said sub word language model,wherein said language model weight control unit outputs a plurality of weight values,said speech recognition unit recognizes said speech data for each of said plurality of weight values, andsaid language model weight control unit determine based on said difference at a time when said difference is extracted whether said weight value should be updated or not.
- View Dependent Claims (34, 35, 36)
- - 34. The computer-readable recording medium according to claim 33, wherein said language model weight control unit updates said weight value such that said weight value is decreased when said difference is smaller than a predetermined threshold.
  - 35. The computer-readable recording medium according to claim 33, wherein said language model weight control unit updates said weight value such that said weight value is increased when said difference is larger than a predetermined threshold.
  - 36. The pronunciation variation rule extraction program according to claim 33, wherein said difference extraction unit calculates said difference as an editing distance between said recognition result and said surface form pronunciation data, and extracts as said difference, a pronunciation variation example including letter string pair of different portions between said recognition result and said surface form pronunciation data and a weight value of said sub word language model received by said speech recognition unit from said language model weight control unit at a time of obtain of said recognition result,further comprising a pronunciation variation probability estimation unit for generating a probability rule of pronunciation variation from said pronunciation variation example,wherein said pronunciation variation probability estimation unit generates, based on a magnitude of a weight value of said sub word language model at a time of observation of a pronunciation variation example, said probability rule of said pronunciation variation such that said pronunciation variation example has a high appearance probability.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
NEC Corporation
Original Assignee
NEC Corporation
Inventors
Koshinaka, Takafumi

Granted Patent

US 8,595,004 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/236
CPC Class Codes

G10L 15/06 Creation of reference templ...

G10L 15/187 Phonemic context, e.g. pron...

PRONUNCIATION VARIATION RULE EXTRACTION APPARATUS, PRONUNCIATION VARIATION RULE EXTRACTION METHOD, AND PRONUNCIATION VARIATION RULE EXTRACTION PROGRAM

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

103 Citations

36 Claims

Specification

Solutions

Use Cases

Quick Links

PRONUNCIATION VARIATION RULE EXTRACTION APPARATUS, PRONUNCIATION VARIATION RULE EXTRACTION METHOD, AND PRONUNCIATION VARIATION RULE EXTRACTION PROGRAM

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

103 Citations

36 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links