KEYWORD CLASSIFICATION AND DETERMINATION IN LANGUAGE MODELLING
First Claim
1. A computer-implemented method for defining a keyword class vector, comprising:
- determining a set of seed keywords from a set of keywords;
determining first and second most similar keywords from the set of seed keywords; and
determining a class vector from first and second keyword vectors associated with the first and second most similar keywords.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method and apparatus defines a keyword class vector. A set of seed keywords is determined from a set of keywords and first and second most similar keywords from the set of seed keywords are then determined. A class vector is determined from first and second keyword vectors associated with the first and second most similar keywords. The method and apparatus also classifies a keyword in a keyword class. A similarity for a keyword vector associated with the keyword is determined with reference to a plurality of class vectors, each class vector having an associated class and determines a most similar class vector of the plurality of class vectors from the similarity determination. The keyword is then classified in a most similar class associated with the most similar class vector.
57 Citations
28 Claims
-
1. A computer-implemented method for defining a keyword class vector, comprising:
-
determining a set of seed keywords from a set of keywords; determining first and second most similar keywords from the set of seed keywords; and determining a class vector from first and second keyword vectors associated with the first and second most similar keywords. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for classifying a keyword in a keyword class, the method comprising:
-
determining a similarity for a keyword vector associated with the keyword with reference to a plurality of class vectors, each class vector having an associated class; determining a most similar class vector of the plurality of class vectors from the similarity determination; and classifying the keyword in a most similar class associated with the most similar class vector. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer-implemented method for determining a keyword in a set of words, the method comprising:
-
assigning a distance parameter for a first word in the word set, the distance parameter designating a first word distance from the word set; parsing a document for an occurrence of the first word in the document; upon identification of an occurrence of the first word in the document, modifying the distance parameter; and upon determination the modified distance parameter satisfies a threshold criterion, designating the word as a keyword. - View Dependent Claims (17, 18, 19, 20)
-
-
21. Apparatus for defining a keyword class vector, the apparatus being configured to:
-
determine a set of seed keywords from a set of keywords; determine first and second most similar keywords from the set of seed keywords; and determine a class vector from first and second keyword vectors associated with the first and second most similar keywords.
-
-
22. Apparatus for classifying a keyword in a keyword class, the apparatus being configured to:
-
determine a similarity for a keyword vector associated with the keyword with reference to a plurality of class vectors, each class vector having an associated class; determine a most similar class vector of the plurality of class vectors from the similarity determination; and classifying the keyword in a most similar class associated with the most similar class vector.
-
-
23. Apparatus for determining a keyword in a set of words, the apparatus being configured to:
-
assign a distance parameter for a first word in the word set, the distance parameter designating a first word distance from the word set; parse a document for an occurrence of the first word in the document; upon identification of an occurrence of the first word in the document, modify the distance parameter; and upon determination the modified distance parameter satisfies a threshold criterion, designate the word as a keyword.
-
-
24. (canceled)
-
25. A computer program product having computer code stored thereon for defining a keyword class, the computer code being configured to:
-
determine a set of seed keywords from a set of keywords; determine first and second most similar keywords from the set of seed keywords; and determine a class vector from first and second keyword vectors associated with the first and second most similar keywords.
-
-
26. A computer program product having computer code stored thereon for classifying a keyword in a keyword class, the computer code being configured to:
-
determine a similarity for a keyword vector associated with the keyword with reference to a plurality of class vectors, each class vector having an associated class; determine a most similar class vector of the plurality of class vectors from the similarity determination; and
classifying the keyword in a most similar class associated with the most similar class vector.
-
-
27. A computer program product having computer code stored thereon for classifying a keyword in a keyword class, the computer code being configured to:
-
assign a distance parameter for a first word in the word set, the distance parameter designating a first word distance from the word set; parse a document for an occurrence of the first word in the document; upon identification of an occurrence of the first word in the document, modify the distance parameter; and upon determination the modified distance parameter satisfies a threshold criterion, designate the word as a keyword.
-
-
28. (canceled)
Specification