Data retrieval method, system and program product
First Claim
1. A data retrieval system for causing a computer to retrieve data being stored in a database, said retrieval system comprising:
- a database storing data as a vector digitized based on a keyword;
a means for generating a residual vector from said data, said residual vector corresponding to a vector in which an element corresponding to a contribution component in a direction of a basic vector calculated is subtracted from a previously obtained residual vector, the basic vector and the residual vector a newly generated residual vector lie in an orthogonal relationship, to compute and store a covariance matrix and an eigenvector of said covariance matrix, and for generating and storing a set of basic vectors from a set of computed eigenvectors;
a means for reading out said data and at least one of said eigenvectors from the database, and for computing and storing a contribution vector of said eigenvector to said data, and for contracting or enlarging a residual vector by reading out said contribution vector to compute and store a new eigenvector; and
a means for selecting a keyword to be used for labeling clusters according to a similarity between said stored basic vector and said data, and a weight on said similarity so as to store the keyword in the database;
means for classifying data into clusters of documents having the same or similar keywords and depending on a similarity between the stored basic vector and the data; and
a means for outputing cluster data of a cluster to a graphical user interface system for displaying the cluster data.
1 Assignment
0 Petitions
Accused Products
Abstract
A data retrieval system having a database storing data as a vector that has been digitized based on a keyword. The system has capability of generating a residual vector from the data to compute a covariance matrix and an eigenvector of the covariance matrix. The system further has the capability for generating and storing a set of basic vectors from the set of the computed eigenvectors, and for reading out the data and at least one of the eigenvectors from a memory. This is utilized to compute a contribution of the eigenvector to the data, and for contracting or enlarging a residual vector to store. The system selects a keyword to be used for labeling according to a similarity between the stored basic vector and the data, and a weight on the similarity so as to store the keyword in a memory.
-
Citations
9 Claims
-
1. A data retrieval system for causing a computer to retrieve data being stored in a database, said retrieval system comprising:
- a database storing data as a vector digitized based on a keyword;
a means for generating a residual vector from said data, said residual vector corresponding to a vector in which an element corresponding to a contribution component in a direction of a basic vector calculated is subtracted from a previously obtained residual vector, the basic vector and the residual vector a newly generated residual vector lie in an orthogonal relationship, to compute and store a covariance matrix and an eigenvector of said covariance matrix, and for generating and storing a set of basic vectors from a set of computed eigenvectors;
a means for reading out said data and at least one of said eigenvectors from the database, and for computing and storing a contribution vector of said eigenvector to said data, and for contracting or enlarging a residual vector by reading out said contribution vector to compute and store a new eigenvector; and
a means for selecting a keyword to be used for labeling clusters according to a similarity between said stored basic vector and said data, and a weight on said similarity so as to store the keyword in the database;
means for classifying data into clusters of documents having the same or similar keywords and depending on a similarity between the stored basic vector and the data; and
a means for outputing cluster data of a cluster to a graphical user interface system for displaying the cluster data. - View Dependent Claims (2, 3)
- a database storing data as a vector digitized based on a keyword;
-
4. A data retrieval method for causing a computer to retrieve data stored in a database, said data retrieval method comprising the steps of:
-
reading out data from a database storing data as a vector digitized based on a keyword; computing and storing a covariance matrix and an eigenvector of said covariance matrix, using said data; generating and storing a set of basic vectors from a set of said computed eigenvectors; reading out said data and at least one eigenvector from the database, and computing and storing a contribution vector of said eigenvector to said data; and
computing a residual vector from said data and said eigenvector, and contracting or enlarging a residual vector by reading out said contribution vector to compute and store a new eigenvector; and
classifying data into clusters of documents having the same or similar keywords and depending on a similarity between the stored basic vector and the data; and
outputting cluster data of a cluster to a graphical user interface system for displaying the cluster data. - View Dependent Claims (5, 6)
-
-
7. A computer executable program product comprising a computer readable media having computer executable program thereon for implementing a data retrieval method for causing a computer to retrieve data stored in a database, said program comprising the steps of:
- reading out data from a database storing data as a vector digitized based on a keyword;
computing and storing a covariance matrix and an eigenvector of said covariance matrix, using said data;
generating and storing a set of basic vectors from a set of said computed eigenvectors;
reading out said data and at least one eigenvector from the database, and computing and storing a contribution vector of said eigenvector to said data, and computing a residual vector from said data and said eigenvector, and contracting or enlarging a residual vector by reading out said contribution vector to compute and store a new eigenvector in the database; and
classifying data into clusters of documents having the same or similar keywords and depending on a similarity between the stored basic vector and the data; and
outputting cluster data of a cluster to a graphical user interface system for displaying the cluster data. - View Dependent Claims (8, 9)
- reading out data from a database storing data as a vector digitized based on a keyword;
Specification