Retrieving and ranking of documents from database description
First Claim
1. A method for retrieving and/or ranking documents in a database, the documents including attribute data, said method comprising steps of, providing a document matrix from said documents, said matrix including numerical elements derived from said attribute data;
- providing covariance matrix from said document matrix;
executing singular value decomposition of said covariance matrix so as to obtain the following formula;
K−
v┘
©
┘
VT, wherein K represents said covariance matrix, V represents the matrix consisting of eigenvectors, ©
represents a diagonal matrix, and VT represents a transpose of the matrix V;
reducing a dimension of said matrix V using predetermined numbers of eigenvectors included in said matrix V, said eigenvectors including an eigenvector corresponding to the largest singular value;
reducing a dimension of said document matrix using said dimension reduced matrix V; and
retrieving and/or ranking said documents in said database by computing a scalar product between said dimension reduced document matrix and a query vector.
3 Assignments
0 Petitions
Accused Products
Abstract
A method, a computer system, and a program product for retrieving and/or ranking documents in a database. The method comprising steps of, providing a document matrix derived from the documents, the matrix including numerical elements derived from the attributes; providing a covariance matrix derived from the document matrix; executing singular value decomposition of the covariance matrix so as to obtain the following formula:K=V┘©┘VT, wherein K represents the covariance matrix, V represents the matrix consisting of eigenvectors, © represents a diagonal matrix, and VT represents a transpose of the matrix V; reducing a dimension of the matrix V using a predetermined number of eigenvectors included in the matrix V, the eigenvectors including an eigenvector corresponding to the largest singular value; reducing a dimension of the document matrix using the dimension reduced matrix V; and retrieving and/or ranking the documents in the database by computing the scalar product between the dimension reduced document matrix and a query vector.
-
Citations
18 Claims
-
1. A method for retrieving and/or ranking documents in a database, the documents including attribute data, said method comprising steps of,
providing a document matrix from said documents, said matrix including numerical elements derived from said attribute data; -
providing covariance matrix from said document matrix;
executing singular value decomposition of said covariance matrix so as to obtain the following formula;
K−
v┘
©
┘
VT,wherein K represents said covariance matrix, V represents the matrix consisting of eigenvectors, ©
represents a diagonal matrix, and VT represents a transpose of the matrix V;
reducing a dimension of said matrix V using predetermined numbers of eigenvectors included in said matrix V, said eigenvectors including an eigenvector corresponding to the largest singular value;
reducing a dimension of said document matrix using said dimension reduced matrix V; and
retrieving and/or ranking said documents in said database by computing a scalar product between said dimension reduced document matrix and a query vector. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system for executing a method for retrieving and/or ranking documents in a database, the documents including attribute data, said computer system having apparatus for:
-
providing a document matrix from said documents, said matrix including numerical elements derived from said attribute data;
providing a covariance matrix from said document matrix;
executing singular value decomposition of said covariance matrix so as to obtain the following formula;
K=V┘
©
┘
VT,wherein K represents said covariance matrix, V represents the matrix consisting of eigenvectors, ©
represents a diagonal matrix, and VT represents a transpose of the matrix V;
reducing a dimension of said matrix V using predetermined numbers of eigenvectors included in said matrix V, said eigenvectors including an eigenvector corresponding to the largest singular value;
reducing a dimension of said document matrix using said dimension reduced matrix V; and
retrieving and/or ranking said documents in said database by computing a scalar product between said dimension reduced document matrix and a query vector. - View Dependent Claims (8, 9, 10, 11, 12, 14, 15, 16, 17, 18)
-
-
13. A program product including a computer readable computer program for executing a method for retrieving and/or ranking documents in a database, the documents including attribute data, said computer program including computer readable code for causing said apparatus to execute the steps of:
-
providing a document matrix from said documents, said matrix including numerical elements derived from said attribute data;
providing covariance matrix from said document matrix;
executing singular value decomposition of said covariance matrix so as to obtain the following formula;
K=V┘
©
┘
VT,wherein K represents said covariance matrix, V represents the matrix consisting of eigenvectors, ©
represents a diagonal matrix, and VT represents a transpose of the matrix V;
reducing a dimension of said matrix V using predetermined numbers of eigenvectors included in said matrix V said eigenvectors including an eigenvector corresponding to the largest singular value;
reducing a dimension of said document matrix using said dimension reduced matrix V; and
retrieving and/or ranking said documents in said database by computing scalar product between said dimension reduced document matrix and a query vector.
-
Specification