Database annotation and retrieval
First Claim
1. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
- a receiver operable to receive a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit and operable to receive a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit;
an aligner operable to align and to compare sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and
a processor operable to process the best aligned pair of sub-word unit labels formed by said aligner, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives,wherein said processor is operable to determine said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
1 Assignment
0 Petitions
Accused Products
Abstract
A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.
-
Citations
61 Claims
-
1. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
a receiver operable to receive a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit and operable to receive a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit; an aligner operable to align and to compare sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and a processor operable to process the best aligned pair of sub-word unit labels formed by said aligner, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives, wherein said processor is operable to determine said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. An apparatus for determining a sequence of sub-word unit labels representative of at least two words, the apparatus comprising:
-
a receiver operable to receive a first sequence of sub-word unit labels representative of a first word and a second sequence of sub-word unit labels representative of a second word output by a word recognition unit in response to a common input word; an aligner operable to align sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and a processor operable to process the best aligned pair of sub-word unit labels formed by said aligner to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the first and second sequences of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the sub-word unit labels of the aligned pair.
-
-
25. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
a receiver operable to receive the at least word alternatives output by the word recognition unit; a generator operable to generate, for each received word alternative, a sequence of sub-word unit labels representative of the word alternative; an aligner operable to align and compare the sub-word unit labels of each generated sequence of sub-word unit labels to form a number aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and a processor operable to process the best aligned group of sub-word unit labels formed by said aligner, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the received word alternatives, wherein said processor is operable to determine said output sequence of sub-word unit labels by determining, for each aligned group of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
26. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
a receiver operable to receive the at least two word alternatives output by the word recognition unit; a generator operable to generate, for each received word alternative, a sequence of sub-word unit labels representative of the received word alternative; an aligner operable to align and compare the sub-word unit labels of each generated sequence of sub-word unit labels to form a number aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and a processor operable to process the best aligned group of sub-word unit labels formed by said aligner, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives by determining, for each aligned group of sub-word unit labels, a sub-word unit label that is confusingly similar to the sub-word unit labels of the group.
-
-
27. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
a receiver operable to receive a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit and operable to receive a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit; an aligner operable to align and compare sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels; a first comparator operable to compare, for each aligned pair, the first sequence sub-word unit label in the aligned pair with each of a plurality of sub-word unit labels taken from a set of predetermined sub-word unit labels, to provide a corresponding plurality of comparison scores representative of the similarities between the first sequence sub-word unit label and the respective sub-word unit labels of the set; a second comparator operable to compare, for each aligned pair, the second sequence sub-word unit label in the aligned pair with each of said plurality of sub-word unit labels from the set, to provide a further corresponding plurality of comparison scores representative of the similarities between said second sequence sub-word unit label and the respective sub-word unit labels of the set; a combiner operable to combine the comparison scores obtained when comparing the first and second sequence sub-word unit labels in the aligned pair with the same sub-word unit label from the set, to generate a plurality of combined comparison scores; a third comparator operable to compare, for each aligned pair, the combined comparison scores generated by said combiner for the aligned pair; and a determiner operable to determine, for each aligned pair of sub-word unit labels, a sub-word unit label representative of the sub-word unit labels in the aligned pair in dependence upon a comparison result output by said third comparator for the aligned pair.
-
-
28. A method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit; receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit; aligning and comparing sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and processing the best aligned pair of sub-word unit labels formed in said aligning step, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives, wherein said processing step determines said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A method of determining a sequence of sub-word unit labels representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving a first sequence of sub-word unit labels representative of a first word of said at least two words; receiving a second sequence of sub-word unit labels representative of a second word of said at least two word; aligning sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and processing the best aligned pair of sub-word unit labels formed in said aligning step, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the first and second sequences of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the sub-word unit labels of the aligned pair.
-
-
52. A method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving the at least two word alternatives output by the word recognition unit; generating, for each received word alternative, a sequence of sub-word unit labels representative of the word alternative; aligning and comparing the sub-word unit labels of each generated sequence of sub-word unit labels to form a number of aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and processing the best aligned group of sub-word unit labels formed in said aligning and comparing step to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the received word alternatives, wherein said processing step determines said output sequence of sub-word unit labels by determining, for each aligned group of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
53. A method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving the at least two word alternatives output by the word recognition unit; generating, for each received word alternative, a sequence of sub-word unit labels representative of the word alternative; aligning and comparing the sub-word unit labels of each generated sequence of sub-word unit labels to form a number of aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and processing the best aligned group of sub-word labels formed in said aligning and comparing step to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives by determining, for each aligned group, a sub-word unit label that is confusingly similar to the sub-word unit labels of the group.
-
-
54. A method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the method comprising:
-
receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit; receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternative output by said word recognition unit; aligning and comparing sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels; a first comparing step of comparing, for each aligned pair, the first sequence sub-word unit label in the aligned pair with each of a plurality of sub-word unit labels taken from a set of predetermined sub-word unit labels, to provide a corresponding plurality of comparison scores representative of the similarities between the first sequence sub-word unit label and the respective sub-word unit labels of the set; a second comparing step of comparing, for each aligned pair, the second sequence sub-word unit label in the aligned pair with each of said plurality of sub-word unit labels from the set, to provide a further corresponding plurality of comparison scores representative of the similarities between said second sequence sub-word unit label and the respective sub-word unit labels of the set; combining the comparison scores obtained when comparing the first and second sequence sub-word unit labels in the aligned pair with the same sub-word unit label from the set, to generate a plurality of combined comparison scores; a third comparing step of comparing, for each aligned pair, the combined comparison scores generated in said combining step for the aligned pair; and determining, for each aligned pair, a sub-word unit representative of the sub-word unit labels in the aligned pair in dependence upon a comparison result output from said third comparing step for the aligned pair.
-
-
55. A computer readable medium storing computer executable process steps to perform a method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the process steps comprising:
-
steps for receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit; steps for receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by the word recognition unit; steps for aligning and comparing sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and steps for processing the best aligned pair of sub-word unit labels formed in said aligning and comparing step to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives, wherein said steps for processing determines said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
56. A computer executable program stored on a computer readable medium, the computer executable program for controlling a processor to perform a method of determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the program comprising:
-
code for receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by the word recognition unit; code for receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternative output by the word recognition unit; code for aligning and comparing sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels, and code for processing the best aligned pair of sub-word unit labels formed by said aligning and comparing code to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives, wherein said code for processing determines said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
57. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit and for receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit; means for aligning and for comparing sub-word unit labels of the first sequence with sub-word labels of the second sequence to form a number of aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and means for processing the best aligned pair of sub-word unit labels formed by said aligning means to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the least two word alternatives, wherein said means for processing determines said output sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
58. An apparatus for determining a sequence of sub-word unit labels representative of at least two words output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving a first sequence of sub-word unit labels representative of a first word and for receiving a second sequence of sub-word unit labels representative of a second word; means for aligning sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number aligned pairs of sub-word unit labels to determine a best aligned pair of sub-word unit labels; and means for processing the best aligned pair of sub-word unit labels formed by said aligning means to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the first and second sequence of sub-word unit labels by determining, for each aligned pair of sub-word unit labels, a sub-word unit label that is confusingly similar to the sub-word unit labels of the aligned pair.
-
-
59. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving the at least two word alternatives output by the word recognition unit; means for generating for each received word alternative, a sequence of sub-word unit labels representative of the word alternative; means for aligning and comparing the sub-word unit labels of each generated sequence of sub-word unit labels to form a number aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and means for processing the best aligned group of sub-word unit labels formed by said aligning means, to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the received word alternatives, wherein said means for processing determines said output sequence of sub-word unit labels by determining, for each aligned group of sub-word unit labels, a sub-word unit label that is confusingly similar to the first and second sub-word unit labels of the aligned pair.
-
-
60. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving the at least two word alternatives output by the word recognition unit; means for generating, for each received word alternative, a sequence of sub-word unit labels representative of the received word alternative; means for aligning and comparing the sub-word unit labels of each generated sequence of sub-word unit labels to form a number aligned groups of sub-word unit labels to determine a best aligned group of sub-word unit labels; and means for processing the best aligned group of sub-word unit labels formed by said aligning means to determine an output sequence of sub-word unit labels which is different from the first and second sequences of sub-word unit labels and which is representative of the at least two word alternatives by determining, for each aligned group of sub-word unit labels, a sub-word unit label that is confusingly similar to the sub-word unit labels of the group.
-
-
61. An apparatus for determining a sequence of sub-word unit labels representative of at least two word alternatives output by a word recognition unit in response to a common input word to be recognised, the apparatus comprising:
-
means for receiving a first sequence of sub-word unit labels representative of a first one of said at least two word alternatives output by said word recognition unit and for receiving a second sequence of sub-word unit labels representative of a second one of said at least two word alternatives output by said word recognition unit; means for aligning and for comparing sub-word unit labels of the first sequence with sub-word unit labels of the second sequence to form a number of aligned pairs of sub-word unit labels; first comparing means for comparing, for each aligned pair, the first sequence sub-word unit label in the aligned pair with each of a plurality of sub-word unit labels taken from a set of predetermined sub-word unit labels, to provide a corresponding plurality of comparison scores representative of the similarities between the first sequence sub-word unit label and the respective sub-word unit labels of the set; second comparing means for comparing, for each aligned pair, the second sequence sub-word unit label in the aligned pair with each of said plurality of sub-word unit labels from the set, to provide a further corresponding plurality of comparison scores representative of the similarities between said second sequence sub-word unit label and their respective sub-word unit labels of the set; means for combining the comparison scores obtained when comparing the first and second sequence sub-word unit labels in the aligned pair with the same sub-word unit labels from the set, to generate a plurality of combined comparison scores; third comparing means for comparing, for each aligned pair, the combined comparison scores generated by said combining means for the aligned pair; and means for determining, for each aligned pair of sub-word unit labels, a sub-word unit label representative of the sub-word unit labels in the aligned pair in dependence upon a comparison result output by said third comparing means for the aligned pair.
-
Specification