System and method for performing analysis on word variants
First Claim
1. A computer-readable medium having stored thereon a first lexicon data structure for each of a plurality of lexicon words, the first lexicon data structure comprising:
- a host form variant field containing data representing a host form variant;
a host form word field containing data representing a host form word for the host form variant represented by the data of the host form variant field; and
a verification field containing data representing a property of the host form variant represented by the data of the host form variant field, the property being indicative of whether the host form variant is itself a valid word or whether the host form variant must be combined with another entry in the lexicon.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer-readable medium stores a first lexicon data structure for lexicon words. The first data structure includes a host form variant field containing a host form variant such as a clitic host form variant, a host form field containing the host form of the host form variant (only present if the forms differ) such as a clitic host verbal form, and a verification field indicative of whether the host form variant is a valid word. The first data structure also includes a segment association field containing data or segmentation bits associating the host form variant with certain types of attachment entries in the lexicon, which also contain data or segmentation bits, to define valid combinations between the host form variant and at least one of the attachment entries in the lexicon. A second lexicon data structure for each of the attachment entries in the lexicon is also stored.
-
Citations
21 Claims
-
1. A computer-readable medium having stored thereon a first lexicon data structure for each of a plurality of lexicon words, the first lexicon data structure comprising:
-
a host form variant field containing data representing a host form variant;
a host form word field containing data representing a host form word for the host form variant represented by the data of the host form variant field; and
a verification field containing data representing a property of the host form variant represented by the data of the host form variant field, the property being indicative of whether the host form variant is itself a valid word or whether the host form variant must be combined with another entry in the lexicon. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of annotating verb-clitic form segments in a lexicon, the method comprising:
defining, for a segment, final segment data indicative of whether the segment must appear in a final position of any verb-clitic words formed using the segment. - View Dependent Claims (16, 17, 18, 19)
-
20. A method of combining first and second verb-clitic form segments from a lexicon to form a verb-clitic word, the method comprising:
-
determining whether absence of final segment data associated with the first verb-clitic form segment indicates that the first verb-clitic form segment cannot be a final segment of the verb-clitic word;
determining whether final segment data associated with the second verb-clitic form segment indicates that the second verb-clitic form segment must be the final segment of the verb-clitic word; and
combining the first and second verb-clitic form segments from the lexicon to form the verb-clitic word only if it is determined that the first verb-clitic form segment cannot be the final segment and that the second verb-clitic form segment must be the final segment. - View Dependent Claims (21)
-
Specification