Similar word discrimination method and its apparatus

US 6,038,531 A
Filed: 01/22/1998
Issued: 03/14/2000
Est. Priority Date: 01/30/1997
Status: Expired due to Term

First Claim

Patent Images

1. A similar word discrimination method for discriminating words that may be misrecognized because of their similarity, comprising the steps of:

receiving voice data of input words;

using a learning voice model to obtain a specified output that shows a level of correctness in response to the voice data of the input words;

processing the output to establish a specified period in which the characteristic components of the input words are included in the output, when the output shows a level of correctness of a predetermined amount or greater;

examining the characteristics of the voice data of said input words during the specified period; and

discriminating between the input words and words that are similar to the input words on the basis of the examination.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method is provided which performs word recognition using the dynamic recurrent neural networks (DRNN) model and which is able to discriminate, with high precision, similar words for which misrecognition often occurs. When the vocal sounds of some words are input, the DRNN output corresponding to the input word vocal data is generated by the word detection signal output component using the DRNN word model and encoded into coded data by using a code book. When the DRNN output from the word detection signal output component has a correctness of a predetermined or greater level, a processor establishes a fixed period that includes the characteristic components of the input words in the DRNN output. The processor then examines the code data in the established fixed period. Discrimination of input words and words that are similar to the input words is accomplished on the basis of the examination results.

Citations

33 Claims

1. A similar word discrimination method for discriminating words that may be misrecognized because of their similarity, comprising the steps of:
- receiving voice data of input words;
  
  using a learning voice model to obtain a specified output that shows a level of correctness in response to the voice data of the input words;
  
  processing the output to establish a specified period in which the characteristic components of the input words are included in the output, when the output shows a level of correctness of a predetermined amount or greater;
  
  examining the characteristics of the voice data of said input words during the specified period; and
  
  discriminating between the input words and words that are similar to the input words on the basis of the examination.

2. A similar word discrimination method for discriminating words that may be misrecognized because of their similarity, comprising the steps of:
- receiving voice data of input words;
  
  using a learning dynamic recurrent neural networks (DRNN) voice model to obtain a specified DRNN output showing a level of correctness in response to the voice data of input words;
  
  processing the DRNN output to establish a specified period in which the characteristic components of the input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  encoding the input word voice data into code data by using a code book;
  
  examining the characteristics of the code data of said input words during the specified period; and
  
  discriminating between the input words and words that are similar to the input words on the basis of the examination.
- View Dependent Claims (3, 4, 5)
- - 3. The similar word discrimination method of claim 2, wherein the steps of examining the characteristics of code data and discriminating between the input words and words that are similar to the input words further comprise:
    - examining the code data corresponding to vowel sounds from among the code data in the established specified period; and
      
      discriminating between the input words and words that are similar to the input words depending upon the vowel sounds from among the code data.
  - 4. The similar word discrimination method of claim 2, further comprising the step of:
    - creating the code book from five vowel sounds.
  - 5. The similar word discrimination method of claim 2, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

6. A similar word discrimination method for discriminating words that may be misrecognized because of their similarity, comprising the steps of:
- successively receiving voice data of input words from the speech of multiple speakers;
  
  using a learning dynamic recurrent neural networks (DRNN) voice model to obtain a specified DRNN output showing a level of correctness in response to the voice data of each input word;
  
  processing the DRNN output to establish a specified period in which the characteristic components of each input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  encoding each input word voice data into code data by using a code book;
  
  creating histogram data, including a code histogram, from the coded data that includes characteristics for the specified period of each input word;
  
  accumulating standard histogram data by storing histogram data for each input word;
  
  comparing the histogram data of each input word with the standard histogram data; and
  
  discriminating between the input words and words that are similar to the input words on the basis of the comparison.
- View Dependent Claims (7, 8)
- - 7. The similar word discrimination method of claim 6, wherein the steps of comparing the histogram data of each input words with the standard histogram data and discriminating between the input words and words that are similar to the input words further comprise:
    - standardizing the respective histograms and calculating the differential between the respective histograms, anddiscriminating the input words and the words that are similar to the input words based on the size of the differential.
  - 8. The similar word discrimination method of claim 6, whereinthe DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

9. A similar word discrimination method for discriminating words that may be misrecognized because of their similarity, comprising the steps of:
- receiving voice data of input words;
  
  creating a learning dynamic recurrent neural networks (DRNN) sub-voice model, that uses a DRNN voice model, to obtain a specified DRNN output for the characteristic components of respective similar words showing a level of correctness in response to the voice data of input words;
  
  processing the DRNN output to establish a specified period in which the characteristic components of the input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  examining the characteristics of the voice data of said input words during the specified period; and
  
  discriminating between the input words and words that are similar to the input words on the basis of the examination.
- View Dependent Claims (10, 11)
- - 10. The similar word discrimination method of claim 9, wherein:
    - the discrimination of the input words and the words that are similar to the input words is accomplished based on the value of the DRNN output which shows the level of correctness above a specified level in accordance with the DRNN sub-voice model.
  - 11. The similar word discrimination method of claim 9 wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

12. A similar word discrimination apparatus for discriminating words that may be misrecognized because of their similarity having a learning voice model that performs recognition processing to obtain specified output showing a level of correctness in response to voice data of input words, comprising:
- a word detection signal output means that outputs the level of correctness above a predetermined level by means of the voice model that reacts to the vocal data of the input words, when there is vocal input of some words; and
  
  a processing means that, when the word detection signal output means generates an output showing a level of correctness above a predetermined level, establishes a specified period that includes characteristic components of the vocal data of the input words, examines the characteristics of the vocal data of the input words during the specified period, and performs discrimination of the input words and the words that are similar to the input words on the basis of the examination results.

13. A similar word discrimination apparatus for discriminating words that may be misrecognized because of their similarity having a learning dynamic recurrent neural networks (DRNN) voice model that performs recognition processing to obtain specified output showing a level of correctness in response to voice data of input words, comprising:
- a word detection signal output means that generates a DRNN output corresponding to the input word vocal data using the DRNN voice model, at the time of the vocal input of some words; and
  
  a codification means that codifies the input word vocal data using a code book; and
  
  a processing means that, when the word detection signal output means generates a DRNN output showing a level of correctness above a predetermined level, establishes a specified period that includes characteristic components of the vocal data of the input words, examines the data encoded by the codification means during the specified period, and performs discrimination of the input words and words that are similar to the input words on the basis of the examination results.
- View Dependent Claims (14, 15, 16)
- - 14. The similar word discrimination apparatus of claim 13, wherein:
    - the code data is examined for vowel sounds among the code data during the specified period, and processing is performed that discriminates among the input words and the words that are similar to the input words on the basis of the examination results.
  - 15. The similar word discrimination apparatus of claim 13, wherein:
    - the code book is created from five vowels.
  - 16. The similar word discrimination apparatus of claims 13, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

17. A similar word discrimination apparatus for discriminating words that may be misrecognized because of their similarity having a learning dynamic recurrent neural networks (DRNN) voice model that performs recognition processing to obtain specified output showing a level of correctness in response to voice data of input words, comprising:
- a word detection signal output means that generates a DRNN output corresponding to the input word vocal data using the DRNN voice model, at the time of the vocal input of some words; and
  
  a codification means that codifies the input word vocal data using a code book;
  
  a standard histogram storage means that preserves the histogram data created for each word from the code data during a specified period as standard histogram data, the histogram data includes the characteristic components of the respective similar words from among the code data obtained from the speech of multiple speakers for the respective similar words; and
  
  a processing means that, when the word detection signal output means generates a DRNN output showing a level of correctness above a predetermined level, establishes the specified period that includes characteristic components of the vocal data of the input words, creates a code histogram for the specified period using code data encoded by the codification means during the specified period, and performs discrimination of the input words and words that are similar to the input words by comparing the histogram data for each word with the standard histogram data.
- View Dependent Claims (18, 19)
- - 18. The similar word discrimination apparatus of claim 17, wherein:
    - the histogram data and the standard histogram data are standardized, a comparison is made between the histogram data created from the input words and the standard histogram data, the differential between the respective histogram data is computed, and discrimination of the input words and the words that are similar to the input words is made based on the size of the differential.
  - 19. The similar word discrimination apparatus of claim 17, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

20. A similar word discrimination apparatus for discriminating words that may be misrecognized because of their similarity having a learning dynamic recurrent neural networks (DRNN) voice model that performs recognition processing to obtain specified output showing a level of correctness in response to voice data of input words, comprising:
- a DRNN sub-voice storage means for storing a learning DRNN sub-voice model that generates DRNN output showing a level of correctness for characteristic components of the respective similar words that may be misrecognized;
  
  a word detection signal output means that outputs the level of correctness at a predetermined level or greater from the DRNN voice model and from the DRNN sub-voice model in response to the voice data of the input words, when there is vocal input of some words; and
  
  a processing means that, when the word detection signal output means generates a DRNN output showing a level of correctness above a predetennined level, establishes a specified period that includes characteristic components of the vocal data of the input words, uses the DRNN sub-voice model to examine the DRNN output characteristics of the vocal data of the input words during the specified period, and performs discrimination of the input words and the words that are similar to the input words on the basis of the examination results.
- View Dependent Claims (21, 22)
- - 21. The similar word discrimination apparatus of claim 20, wherein:
    - the DRNN output is examined by the DRNN sub-voice model relating to the input words during the specified period, and on the basis of the examination results, the process for accomplishing discrimination of the input words and words that are similar to the input words by the DRNN sub-voice model compares the DRNN output to determine which value shows the level of correctness of a predetermined level or greater.
  - 22. The similar word discrimination apparatus of claim 20, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

23. A similar word discrimination apparatus, comprising:
- means for receiving voice data of input words;
  
  means for using a learning voice model to obtain a specified output that shows a level of correctness in response to the voice data of the input words;
  
  means for processing the output to establish a specified period in which the characteristic components of the input words are included in the output, when the output shows a level of correctness of a predetermined amount or greater;
  
  means for examining the characteristics of the voice data of said input words during the specified period; and
  
  means for discriminating between the input words and words that arc similar to the input words on the basis of the examination.

24. A similar word discrimination apparatus, comprising:
- means for receiving voice data of input words;
  
  means for using a learning dynamic recurrent neural networks (DRNN) voice model to obtain a specified DRNN output showing a level of correctness in response to the voice data of input words;
  
  means for processing the DRNN output to establish a specified period in which the characteristic components of the input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  means for encoding the input word voice data into code data by using a code book;
  
  means for examining the characteristics of the code data of said input words during the specified period; and
  
  means for discriminating between the input words and words that are similar to the input words on the basis of the examination.
- View Dependent Claims (25, 26, 27)
- - 25. The similar word discrimination apparatus of claim 24, further comprising:
    - means for examining the code data corresponding to vowel sounds from among the code data in the established specified period; and
      
      means for discriminating between the input words and words that are similar to the input words depending upon the vowel sounds from among the code data.
  - 26. The similar word discrimination apparatus of claim 24, further comprising:
    - means for creating the code book from five vowel sounds.
  - 27. The similar word discrimination apparatus of claims 24, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

28. A similar word discrimination apparatus, comprising:
- means for successively receiving voice data of input words from the speech of multiple speakers;
  
  means for using a learning dynamic recurrent neural networks (DRNN) voice model to obtain a specified DRNN output showing a level of correctness in response to the voice data of each input word;
  
  means for processing the DRNN output to establish a specified period in which the characteristic components of each input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  means for encoding each input word voice data into code data by using a code book;
  
  means for creating histogram data, including a code histogram, from the coded data that includes characteristics for the specified period of each input word;
  
  means for accumulating standard histogram data by storing histogram data for each input word;
  
  means for comparing the histogram data of each input word with the standard histogram data; and
  
  means for discriminating between the input words and words that are similar to the input words on the basis of the comparison.
- View Dependent Claims (29, 30)
- - 29. The similar word discrimination apparatus of claim 28, further comprising:
    - means for standardizing the respective histograms and calculating the differential between the respective histograms, andmeans for discriminating the input words and the words that are similar to the input words based on the size of the differential.
  - 30. The similar word discrimination apparatus of claim 28, whereinthe DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

31. A similar word discrimination apparatus, comprising:
- means for receiving voice data of input words;
  
  means for creating a learning dynamic recurrent neural networks (DRNN) sub-voice model, that uses a DRNN voice model, to obtain a specified DRNN output for the characteristic components of respective similar words showing a level of correctness in response to the voice data of input words;
  
  means for processing the DRNN output to establish a specified period in which the characteristic components of the input words are included in the DRNN output, when the DRNN output shows a level of correctness of a predetermined amount or greater;
  
  means for examining the characteristics of the voice data of said input words during the specified period; and
  
  means for discriminating between the input words and words that are similar to the input words on the basis of the examination.
- View Dependent Claims (32, 33)
- - 32. The similar word discrimination apparatus of claim 31, wherein:
    - the discrimination of the input words and the words that are similar to the input words is accomplished based on the value of the DRNN output that shows the level of correctness above a predetermined level in accordance with the DRNN sub-voice model.
  - 33. The similar word discrimination apparatus of claim 31, wherein:
    - the DRNN voice model creates correspondence between groups of words having similar word groups and generates a DRNN output showing the level of correctness at a predetermined level or greater for each of the similar word groups of the group of words.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Seiko Epson Corporation (Seiko Group)
Original Assignee
Seiko Epson Corporation (Seiko Group)
Inventors
Miyazawa, Yasunaga, Inazumi, Mitsuhiro, Hasegawa, Hiroshi, Aizawa, Tadashi
Primary Examiner(s)
Dorvil, Richemond

Application Number

US09/010,621
Time in Patent Office

782 Days
Field of Search

704/232, 704/256, 704/231, 704/241, 704/236, 704/255
US Class Current

704/232
CPC Class Codes

G10L 15/16 using artificial neural net...

Similar word discrimination method and its apparatus

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

33 Claims

Specification

Solutions

Use Cases

Quick Links

Similar word discrimination method and its apparatus

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

33 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links