Modified collection frequency ranking method
First Claim
1. A computer implemented method for ranking records of a database located during a search of an index to the database, comprising:
- indexing the records of the database by storing index entries in a memory to create the index, each index entry including a word entry representing a unique portion of information of the database and one or more location entries indicating where the unique portion of information represented by the word entry occurs in the records of the database;
assigning a weight to each index entry according to a relative frequency of occurrence of the portion of information in the database;
parsing a query into terms and operators, each term associated with a corresponding index entry;
sequentially searching index entries to locate records of the database which are qualified by the terms and operators of the query;
scoring each located record according to the number of times portions of information corresponding to the terms of the query occur in each record and their associated weights;
storing the scores and identities of the located records in entries of a ranking list, the ranking list having a predetermined number of entries; and
in response to the ranking list becoming full, determining if any unlocated records of the database can receive a score higher than one of the records stored of the ranking list using index entries having a lowest weight, and if not, searching the index using index entries having weights higher than index entries having the lowest weight.
12 Assignments
0 Petitions
Accused Products
Abstract
There is provided an information processing system in which files on plural file servers having different file management manners can be efficiently accessed by a single client program, a control method for the information processing system, and a storage medium for storing programs for performing the control. A gateway program 20 is provided on a first server information processing device 120, and an agent program 30 which is operated in conformity with the gateway program 20 is provided on a second server information processing device 130, whereby a copy of the directory structure of files on the second server information processing device is created on the first server information processing device 120. In addition, at the time when the copy is created, data in a file are not copied, and the data copy is performed only when the file is opened by an application program 8a of a client information processing device 110.
-
Citations
3 Claims
-
1. A computer implemented method for ranking records of a database located during a search of an index to the database, comprising:
-
indexing the records of the database by storing index entries in a memory to create the index, each index entry including a word entry representing a unique portion of information of the database and one or more location entries indicating where the unique portion of information represented by the word entry occurs in the records of the database; assigning a weight to each index entry according to a relative frequency of occurrence of the portion of information in the database; parsing a query into terms and operators, each term associated with a corresponding index entry; sequentially searching index entries to locate records of the database which are qualified by the terms and operators of the query; scoring each located record according to the number of times portions of information corresponding to the terms of the query occur in each record and their associated weights; storing the scores and identities of the located records in entries of a ranking list, the ranking list having a predetermined number of entries; and in response to the ranking list becoming full, determining if any unlocated records of the database can receive a score higher than one of the records stored of the ranking list using index entries having a lowest weight, and if not, searching the index using index entries having weights higher than index entries having the lowest weight. - View Dependent Claims (2, 3)
-
Specification