Character and phoneme recognition based on probability clustering
First Claim
1. A method comprising steps of:
- obtaining a set of probability values for a character candidate, the set of probability values including a respective probability value for each of a set of character types, each character type'"'"'s probability value indicating a probability that the character candidate is an instance of the character type; and
using the set of probability values to identify which of a number of character probability clusters is nearest to the candidate;
the character probability clusters being regions of a space;
the space having a respective dimension for each of the character types;
the character candidate'"'"'s respective probability value for each character type indicating the character candidate'"'"'s position in the character type'"'"'s dimension so that the set of probability values define the character candidate'"'"'s position in the space.
4 Assignments
0 Petitions
Accused Products
Abstract
Prior to character or phoneme recognition, a classifier provides a respective probability list for each of a sequence of sample characters or phonemes, each probability list indicating the respective sample'"'"'s probability for each character or phoneme type. These probability lists are clustered in character or phoneme probability space, in which each dimension corresponds to the probability that a character or phoneme candidate is an instance of a specific character or phoneme type. For each resulting cluster, data is stored indicating its cluster ID and a probability list indicating the probability of each type at the cluster'"'"'s center. Then, during recognition, a probability cluster identifier compares the probability list for each candidate with the probability list for each cluster to find the nearest cluster. The cluster identifier then provides the nearest cluster'"'"'s cluster ID to a constraint satisfier that attempts to recognize the candidate based on rules, patterns, or a combination of rules and patterns. If necessary, the constraint satisfier uses the cluster ID to retrieve the stored probability list of the cluster to assist it in recognition.
277 Citations
26 Claims
-
1. A method comprising steps of:
-
obtaining a set of probability values for a character candidate, the set of probability values including a respective probability value for each of a set of character types, each character type'"'"'s probability value indicating a probability that the character candidate is an instance of the character type; and using the set of probability values to identify which of a number of character probability clusters is nearest to the candidate;
the character probability clusters being regions of a space;
the space having a respective dimension for each of the character types;
the character candidate'"'"'s respective probability value for each character type indicating the character candidate'"'"'s position in the character type'"'"'s dimension so that the set of probability values define the character candidate'"'"'s position in the space. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a classifier for providing a set of probability values for a character candidate;
the set of probability values including a respective probability value for each of a set of character types, each character type'"'"'s probability value indicating a probability that the character candidate is an instance of the character type;a probability cluster identifier for using the set of probability values to produce data identifying which of a number of character probability clusters is nearest to the character candidate;
the character probability clusters being regions of a space;
the space having a respective dimension for each of the character types;
the character candidate'"'"'s respective probability value for each character type indicating the character candidate'"'"'s position in the character type'"'"'s dimension so that the set of probability values define the character candidate'"'"'s position in the space; anda constraint satisfier for providing data identifying a character type of the character candidate based on the cluster identifying data from the probability cluster identifier. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A method comprising steps of:
-
obtaining a set of probability values for a phoneme candidate, the set of probability values including a respective probability value for each of a set of phoneme types, each phoneme type'"'"'s probability value indicating a respective probability that the phoneme candidate is an instance of the phoneme type; and using the set of probability values to identify which of a number of phoneme probability clusters is nearest to the candidate;
the phoneme probability clusters being regions of a space;
the space having a respective dimension for each of the phoneme types;
the phoneme candidate'"'"'s respective probability value for each phoneme type indicating the phoneme candidate'"'"'s position in the phoneme type'"'"'s dimension so that the set of probability values define the phoneme candidate'"'"'s position in the space. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. A system comprising:
-
a classifier for providing a set of probability values for a phoneme candidate;
the set of probability values including a respective probability value for each of a set of phoneme types, each phoneme type'"'"'s probability value indicating a probability that the phoneme candidate is an instance of the phoneme type;a probability cluster identifier for using the set of probability values to produce data identifying which of a number of phoneme probability clusters is nearest to the phoneme candidate;
the phoneme probability clusters being regions of a space;
the space having a respective dimension for each of the phoneme types;
the phoneme candidate'"'"'s respective probability value for each phoneme type indicating the phoneme candidate'"'"'s position in the phoneme type'"'"'s dimension so that the set of probability values define the phoneme candidate'"'"'s position in the space; anda constraint satisfier for providing data identifying a phoneme type of the phoneme candidate based on the cluster identifying data from the probability cluster identifier. - View Dependent Claims (22, 23, 24, 25, 26)
-
Specification