Means for resolving ambiguities in text based upon character context
First Claim
Patent Images
1. A method for processing a reference sequence of occurrences of elements, having an order, comprising the steps of:
- assigning each of said elements to a plurality of groups of elements, said groups of elements being formed into a plurality of sets of groups of elements such that an element is assigned to at most one group belonging to a given set, said groups of elements being formed such that all elements contained in a group share a common contextual property such that for each selected set of said plurality of sets of groups, each selected group within said selected set of said plurality of sets of groups is distinguishable from the groups other than said selected group within said selected set of said plurality of sets of groups based on the order of occurrences of elements in said reference sequence of occurrences of elements;
selecting a set of positive integer values;
defining, for each selected set of groups of elements, and for each positive integer value N in said set of positive integer values, an N-gram table, each table containing one or more entries, each entry associated with a specified sequence of N groups of elements contained in said selected set;
examining said reference sequence of occurrences of elements in order to determine the number of occurrences of each of said specified sequence of N groups of elements; and
associating with each entry of each of said N-gram tables a value related to the number of occurrences of the specified sequences of groups of elements associated with said entry.
8 Assignments
0 Petitions
Accused Products
Abstract
A method of identifying an object within a set of object candidates includes the steps of:
calculating the probability of occurrence of each member of a set of string candidates, wherein each string candidate contains one member of the set of object candidates, the calculating employing formulae using a method of groups and projections; and
identifying one of the objects based on the calculated probability.
225 Citations
38 Claims
-
1. A method for processing a reference sequence of occurrences of elements, having an order, comprising the steps of:
-
assigning each of said elements to a plurality of groups of elements, said groups of elements being formed into a plurality of sets of groups of elements such that an element is assigned to at most one group belonging to a given set, said groups of elements being formed such that all elements contained in a group share a common contextual property such that for each selected set of said plurality of sets of groups, each selected group within said selected set of said plurality of sets of groups is distinguishable from the groups other than said selected group within said selected set of said plurality of sets of groups based on the order of occurrences of elements in said reference sequence of occurrences of elements; selecting a set of positive integer values; defining, for each selected set of groups of elements, and for each positive integer value N in said set of positive integer values, an N-gram table, each table containing one or more entries, each entry associated with a specified sequence of N groups of elements contained in said selected set; examining said reference sequence of occurrences of elements in order to determine the number of occurrences of each of said specified sequence of N groups of elements; and associating with each entry of each of said N-gram tables a value related to the number of occurrences of the specified sequences of groups of elements associated with said entry. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 34)
-
-
33. A method as in claim 64 wherein one of said selected sets includes a group containing all letters.
-
35. A method for processing a reference sequence of occurrences of elements comprising the steps of:
-
assigning each of said elements to one of a plurality of groups of elements, said groups of elements being formed such that all elements contained in a group share a common contextual property, all elements having the contextual property associated with a group are contained in that group, and no element is assigned to more than one group; selecting a positive integer N; defining an N-gram table containing one or more entries, each entry associated with a specified sequence of groups of elements; examining said reference sequence of occurrences of elements in order to determine the number of occurrences of each of said specified sequences of groups of elements; and associating with each entry of said table a first value indicating there are few occurrences of the specified sequence of groups of elements associated with said entry in the reference sequence of occurrences of elements and otherwise associating a second value. - View Dependent Claims (36, 37, 38)
-
Specification