Method and system of ranking and clustering for document indexing and retrieval
First Claim
1. One or more computer readable media storing computer executable instructions to perform a method for vectorizing a set of document predicate structures, the method comprising:
- identifying at least one predicate and argument in said set of document predicate structures by a predicate key that is an integer representation;
estimating conceptual nearness of two of said document predicate structures in said set of document predicate structures by subtracting corresponding ones of said predicate keys; and
outputting at least one document based upon the estimated conceptual nearness.
5 Assignments
0 Petitions
Accused Products
Abstract
A relevancy ranking and clustering method and system that determines the relevance of a document relative to a user'"'"'s query using a similarity comparison process. Input queries are parsed into one or more query predicate structures using an ontological parser. The ontological parser parses a set of known documents to generate one or more document predicate structures. A comparison of each query predicate structure with each document predicate structure is performed to determine a matching degree, represented by a real number. A multilevel modifier strategy is implemented to assign different relevance values to the different parts of each predicate structure match to calculate the predicate structure'"'"'s matching degree. The relevance of a document to a user'"'"'s query is determined by calculating a similarity coefficient, based on the structures of each pair of query predicates and document predicates. Documents are autonomously clustered using a self-organizing neural network that provides a coordinate system that makes judgments in a non-subjective fashion.
-
Citations
19 Claims
-
1. One or more computer readable media storing computer executable instructions to perform a method for vectorizing a set of document predicate structures, the method comprising:
-
identifying at least one predicate and argument in said set of document predicate structures by a predicate key that is an integer representation; estimating conceptual nearness of two of said document predicate structures in said set of document predicate structures by subtracting corresponding ones of said predicate keys; and outputting at least one document based upon the estimated conceptual nearness. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. One or more computer readable media storing computer executable instructions to perform a method for vectorizing a set of document predicate structures, the method comprising:
-
identifying at least one predicate in said set of document predicate structures by a predicate key that is an integer representation; estimating conceptual nearness of two of said document predicate structures in said set of document predicate structures by subtracting corresponding ones of said predicate keys; and outputting at least one document based upon the estimated conceptual nearness. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. One or more computer readable media storing computer executable instructions to perform a method for constructing multi-dimensional vector representations for each document of a set of documents, the method comprising:
-
determining each predicate structure of one or more predicate structures M in each document of the set of documents, said M predicate structures including a predicate and at least one argument; identifying the predicate and the at least one argument in each of said M predicate structures by a predicate key that is an integer representation; determining a fixed number of arguments q for vector construction; constructing an N-dimensional vector representation of each document based upon the predicate and q arguments; and outputting at least one document of the set of documents based upon the constructed N-dimensional vector representation of the at least one document, wherein any predicate structure of said M predicate structures that includes less than q arguments fills unfilled argument positions with a numerical zero. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification