Automated word-form transformation and part of speech tag assignment
First Claim
Patent Images
1. A method of creating a data structure for use with a morphological algorithm, comprising:
- creating a data structure having a plurality of paths that maps a plurality of words into a set of classes;
modifying the data structure to remove a portion of one or more of the paths that is not necessary to unambiguously map the words to the set of classes; and
storing the data structure on a tangible computer readable medium.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of creating a data structure for use with a morphological algorithm is discussed. The method includes creating a data structure having a plurality of paths. The data structure maps a plurality of words into a set of classes. The method further includes modifying the data structure to remove a portion of one or more of the paths that is not necessary to unambiguously map the words to the set of classes and storing the data structure on a tangible computer readable medium.
68 Citations
20 Claims
-
1. A method of creating a data structure for use with a morphological algorithm, comprising:
-
creating a data structure having a plurality of paths that maps a plurality of words into a set of classes; modifying the data structure to remove a portion of one or more of the paths that is not necessary to unambiguously map the words to the set of classes; and storing the data structure on a tangible computer readable medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of providing morphological information to a computer implemented application, comprising:
-
receiving an input signal indicative of a word; selecting a piece of morphological class data from the finite state automaton-based data structure that is mapped to a location in the data structure associated with at least a portion of the word; and providing the piece of morphological class data to the application without accessing a dictionary. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A tangible computer medium for storing a system adapted to perform automated morphological operations on an input, comprising:
-
a first finite state automaton-based data structure having a plurality of paths that maps a dictionary of words into a set of classes, wherein at least one of the paths is shorter than the word that is mapped to it; a second finite state automaton-based data structure having a plurality of paths that maps a dictionary of words into a set of classes; and an algorithm configured to access at least one of the first and second data structures to retrieve data related to the sets of classes. - View Dependent Claims (19, 20)
-
Specification