Database annotation and retrieval
First Claim
1. An apparatus for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
- means for receiving a first sequence of sub-word units representative of a first one of said at least two words and for receiving a second sequence of sub-word units representative of a second one of said at least two words;
means for aligning and for comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
means for determining a sequence of sub-word units representative of the received words in dependence upon the aligned pairs of sub-word units determined by said aligning and comparing means.
1 Assignment
0 Petitions
Accused Products
Abstract
A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.
-
Citations
58 Claims
-
1. An apparatus for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving a first sequence of sub-word units representative of a first one of said at least two words and for receiving a second sequence of sub-word units representative of a second one of said at least two words;
means for aligning and for comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
means for determining a sequence of sub-word units representative of the received words in dependence upon the aligned pairs of sub-word units determined by said aligning and comparing means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52)
-
-
25. An apparatus for determining a sequence of sub-word units representative of at least two words, the apparatus comprising:
-
means for receiving a first sequence of sub-word units representative of a first word and for receiving a second sequence of sub-word units representative of a second word;
means for aligning sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
means for determining a sequence of sub-word units representative of the first and second sequences of sub-word units by determining, for each aligned pair of sub-word units, a sub-word unit that is confusingly similar to the first and second sub-word units of the aligned pair.
-
-
26. An apparatus for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving the words output by the word recognition unit;
means for generating a sequence of sub-word units representative of each of the received words;
means for aligning and comparing the sub-word units from each generated sequence of sub-word units to identify a number aligned groups of sub-word units; and
means for determining a sequence of sub-word units representative of the received words in dependence upon the aligned groups of sub-word units determined by said aligning and comparing means.
-
-
27. An apparatus for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving the words output by the word recognition unit;
means for generating a sequence of sub-word units representative of each of the received words;
means for aligning and comparing the sub-word units from each generated sequence of sub-word units to identify a number aligned groups of sub-word units; and
means for determining a sequence of sub-word units representative of the first and second sequences of sub-word units by determining, for each of the sub-word units in an aligned group, a sub-word unit that it confusingly similar to the sub-word units of the group.
-
-
28. An apparatus for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving a first sequence of sub-word units representative of a first one of said at least two words and for receiving a second sequence of sub-word units representative of a second one of said at least two words;
means for aligning and for comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units;
first comparing means for comparing, for each aligned pair, the first sequence sub-word unit in the aligned pair with each of a plurality of sub-word units taken from a set of predetermined sub-word units, to provide a corresponding plurality of comparison scores representative of the similarities between the first sequence sub-word unit and the respective sub-word units of the set;
second comparing means for comparing, for each aligned pair, the second sequence sub-word unit in the aligned pair with each of said plurality of sub-word units from the set, to provide a further corresponding plurality of comparison scores representative of the similarities between said second sequence sub-word unit and the respective sub-word units of the set;
means for combining the comparison scores obtained when comparing the first and second sequence sub-word units in the aligned pair with the same sub-word unit from the set, to generate a plurality of combined comparison scores;
third comparing means for comparing, for each aligned pair, the combined comparison scores generated by said combining means for the aligned pair; and
means for determining, for each aligned pair of sub-word units, a sub-word unit representative of the sub-word units in the aligned pair in dependence upon a comparison result output by said third comparing means for the aligned pair.
-
-
29. A method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving a first sequence of sub-word units representative of a first one of said at least two words;
receiving a second sequence of sub-word units representative of a second one of said at least two words;
aligning and comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
determining a sequence of sub-word units representative of the received sequences of sub-word units in dependence upon the aligned pairs of sub-word units determined in said aligning and comparing step.
-
-
53. A method of determining a sequence of sub-word units representative of at least two words, the method comprising the steps of:
-
receiving a first sequence of sub-word units representative of a first word;
receiving a second sequence of sub-word units representative of a second word;
aligning sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
determining a sequence of sub-word units representative of the first and second sequences of sub-word units by determining, for each aligned pair of sub-word units, a sub-word unit that is confusingly similar to the first and second sub-word units of the aligned pair.
-
-
54. A method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the method comprising the steps of:
-
receiving the words output by the word recognition unit;
generating a sequence of sub-word units representative of each of the received words;
aligning and comparing the sub-word units from each generated sequence of sub-word units to identify a number of aligned groups of sub-word units; and
determining a sequence of sub-word units representative of the received words in dependence upon the aligned groups of sub-word units determined in said aligning and comparing step.
-
-
55. A method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the method comprising the steps of:
-
receiving the words output by the word recognition unit;
generating a sequence of sub-word units representative of each of the received words;
aligning and comparing the sub-word units from each generated sequence of sub-word units to identify a number of aligned groups of sub-word units; and
determining a sequence of sub-word units representative of the first and second sequences of sub-word units by determining, for each of the sub-word units in an aligned group, a sub-word unit that is confusingly similar to the sub-word units of the group.
-
-
56. A method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving a first sequence of sub-word units representative of a first one of said at least two words;
receiving a second sequence of sub-word units representative of a second one of said at least two words;
aligning and comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units;
a first comparing step of comparing, for each aligned pair, the first sequence sub-word unit in the aligned pair with each of a plurality of sub-word units taken from a set of predetermined sub-word units, to provide a corresponding plurality of comparison scores representative of the similarities between the first sequence sub-word unit and the respective sub-word units of the set;
a second comparing step of comparing, for each aligned pair, the second sequence sub-word unit in the aligned pair with each of said plurality of sub-word units from the set, to provide a further corresponding plurality of comparison scores representative of the similarities between said second sequence sub-word unit and the respective sub-word units of the set;
combining the comparison scores obtained when comparing the first and second sequence sub-word units in the aligned pair with the same sub-word unit from the set, to generate a plurality of combined comparison scores;
a third comparing step of comparing, for each aligned pair, the combined comparison scores generated in said combining step for the aligned pair; and
determining, for each aligned pair, a sub-word unit representative of the sub-word units in the aligned pair in dependence upon a comparison result output from said third comparing step for the aligned pair.
-
-
57. A computer readable medium storing computer executable process steps to perform a method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the process steps comprising the steps of:
-
receiving a first sequence of sub-word units representative of a first one of said at least two words;
receiving a second sequence of sub-word units representative of a second one of said at least two words;
aligning and comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
determining a sequence of sub-word units representative of the received sequences of sub-word units in dependence upon the aligned pairs of sub-word units determined in said aligning and comparing step.
-
-
58. A computer executable program for controlling a processor to perform a method of determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the program comprising:
-
code for receiving a first sequence of sub-word units representative of a first one of said at least two words;
code for receiving a second sequence of sub-word units representative of a second one of said at least two words;
code for aligning and comparing sub-word units of the first sequence with sub-word units of the second sequence to form a number of aligned pairs of sub-word units; and
code for determining a sequence of sub-word units representative of the received sequences of sub-word units in dependence upon the aligned pairs of sub-word units determined in said aligning and comparing step.
-
Specification