Retrieving, detecting and identifying major and outlier clusters in a very large database
First Claim
1. A method for retrieving, detecting and identifying documents in a database, said documents in said database being constructed as a document matrix from attributes included in said documents, said method comprising steps of;
- creating said document matrix from said documents using at least one attribute;
creating a scaled residual matrix based on said document matrix from a predetermined function;
executing singular value decomposition to obtain a basis vector corresponding to the largest singular value;
re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector;
repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vectors; and
executing dimension reduction of said document matrix to perform detection, retrieval and identification of said documents in said database.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention discloses a method, a computer system, a computer readable medium and a sever. The method of the present invention comprises steps of; creating said document matrix from said documents using at least one attribute; creating a scaled residual matrix based on said document matrix using a predetermined function; executing singular value decomposition to obtain a basis vector corresponding to the largest singular value; re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector; repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vector; and executing reduction of said document matrix to perform detection, retrieval and identification of said documents in said database.
77 Citations
23 Claims
-
1. A method for retrieving, detecting and identifying documents in a database, said documents in said database being constructed as a document matrix from attributes included in said documents, said method comprising steps of;
-
creating said document matrix from said documents using at least one attribute;
creating a scaled residual matrix based on said document matrix from a predetermined function;
executing singular value decomposition to obtain a basis vector corresponding to the largest singular value;
re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector;
repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vectors; and
executing dimension reduction of said document matrix to perform detection, retrieval and identification of said documents in said database. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system for retrieving, detecting, and identifying documents in a database, said documents in said database being constructed as a document matrix from attributes included in said documents, said computer system comprises;
-
means for creating said document matrix from said documents using at least one attribute;
means for scaling said residual matrix based on said document matrix from a predetermined function;
means for executing singular value decomposition to obtain a basis vector corresponding to the largest singular value;
means for re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vectors;
means for repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vector; and
means for executing dimension reduction of said document matrix to perform detection, retrieval, and identification of said documents in said database. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer readable medium storing a computer program for retrieving, detecting, and identifying documents in a database, said documents in said database being constructed as a document matrix from attributes included in said documents, said program executing steps of;
-
creating said document matrix from said documents using at least one attribute;
creating a scaled residual matrix based on said document matrix from a predetermined function;
executing singular value decomposition to obtain a basis vector corresponding to the largest singular value;
re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector;
repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vectors; and
executing dimension reduction of said document matrix to perform detection, retrieval and identification of said documents in said database. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A server for retrieving, detecting, and identifying documents in a database, said documents in said database being constructed as a document matrix from attributes included in said documents, said server communicating to a client through a network, said server comprising;
-
means for receiving a request for retrieving and detecting said document though said network;
means for receiving another request for selecting a method for singular value decomposition from said client;
means for creating said document matrix from said documents using at least one attribute;
means for scaling said residual matrix based on said document matrix from a predetermined function;
means for executing singular value decomposition to obtain a basis vector corresponding to the largest singular value in response to said another request;
means for re-constructing said residual matrix and scaling dynamically said re-constructed residual matrix to obtain another basis vector;
means for repeating said singular value decomposition step to said re-constructing step to create a predetermined set of basis vectors; and
means for executing dimension reduction of said document matrix to perform detection, retrieval and identification of said documents in said database; and
means for returning at least one result of said detection, said retrieval and said identification to said client. - View Dependent Claims (23)
-
Specification