Document retrieval system recognizing types and values of numeric search conditions
First Claim
1. A document retrieval system for retrieving a document from a set of documents according to one or more terms received as search conditions submitted by a user, the terms that can be received including both numeric terms, which include numeric values, and non-numeric terms, which do not include numeric values, the document retrieval system comprising:
- a first processor for generating at least one item of index information for each document in the set of documents, the index information generated for the documents including both numeric term index information and non-numeric term index information, the non-numeric term index information including information about frequency of occurrence of non-numeric terms in the document, the numeric term index information including information about types and numeric values of numeric terms in the document;
a memory unit for storing the index information generated by the first processor;
a second processor for examining each numeric term received as a search condition, determining its type, and generating numeric search information indicating the type and numeric value of the numeric term;
a third processor for using the non-numeric term index information generated by the first processor to calculate, for each document in the set of documents including a non-numeric term received as a search condition, a first numeric quantity having a single numeric value indicating a goodness of fit between the non-numeric term and the document;
a fourth processor for using the numeric term index information generated by the first processor and the numeric search information generated by the second processor from a numeric term received as a search condition to calculate, for each document in the set of documents including a numeric term of the same type as the numeric term received as the search condition, by comparing numeric values of the numeric terms, a second numeric quantity having a single numeric value indicating a goodness of fit between the numeric term received as the search condition and the document, closer numeric values indicating better fits;
a fifth processor for calculating, for a document in the set of documents, a third numeric quantity indicating a goodness of fit between the document and the one or more terms received as search conditions, the third numeric quantity being calculated from the first and second numeric quantities; and
an output unit for sending information for displaying a list of documents from the set of documents to the user according to the third numeric quantity.
1 Assignment
0 Petitions
Accused Products
Abstract
A document retrieval system has an index information generator that generates index information about the frequency of occurrence of non-numeric terms in documents, and index information about the numeric values of numeric terms and the types of quantities they represent. Numeric terms received as search conditions are processed by a numeric information generator that generates search information indicating the type and numeric value of each numeric term. The index information generator and numeric information generator preferably convert numeric values to a standard notation. Numeric search conditions can then be used appropriately, flexibly, and conveniently to obtain useful retrieval results.
-
Citations
16 Claims
-
1. A document retrieval system for retrieving a document from a set of documents according to one or more terms received as search conditions submitted by a user, the terms that can be received including both numeric terms, which include numeric values, and non-numeric terms, which do not include numeric values, the document retrieval system comprising:
-
a first processor for generating at least one item of index information for each document in the set of documents, the index information generated for the documents including both numeric term index information and non-numeric term index information, the non-numeric term index information including information about frequency of occurrence of non-numeric terms in the document, the numeric term index information including information about types and numeric values of numeric terms in the document; a memory unit for storing the index information generated by the first processor; a second processor for examining each numeric term received as a search condition, determining its type, and generating numeric search information indicating the type and numeric value of the numeric term; a third processor for using the non-numeric term index information generated by the first processor to calculate, for each document in the set of documents including a non-numeric term received as a search condition, a first numeric quantity having a single numeric value indicating a goodness of fit between the non-numeric term and the document; a fourth processor for using the numeric term index information generated by the first processor and the numeric search information generated by the second processor from a numeric term received as a search condition to calculate, for each document in the set of documents including a numeric term of the same type as the numeric term received as the search condition, by comparing numeric values of the numeric terms, a second numeric quantity having a single numeric value indicating a goodness of fit between the numeric term received as the search condition and the document, closer numeric values indicating better fits; a fifth processor for calculating, for a document in the set of documents, a third numeric quantity indicating a goodness of fit between the document and the one or more terms received as search conditions, the third numeric quantity being calculated from the first and second numeric quantities; and an output unit for sending information for displaying a list of documents from the set of documents to the user according to the third numeric quantity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A retrieval execution apparatus for use in a system for retrieving a document from a set of documents according to one or more terms received as search conditions submitted by a user, the terms that can be received including both numeric terms, which include numeric values, and non-numeric terms, which do not include numeric values, the retrieval execution apparatus having:
-
a first processor for generating at least one item of index information for each document in the set of documents, the index information generated for the documents including both numeric term index information and non-numeric term index information, the non-numeric term index information including information about frequency of occurrence of non-numeric terms in the document, the numeric term index information including information about types and numeric values of numeric terms in the document; a second processor for examining each numeric term received as a search condition, determining its type, and generating numeric search information indicating the type and numeric value of the numeric term; a third processor for using the non-numeric term index information generated by the index information generator to calculate, for each document in the set of documents including a non-numeric term received as a search condition, a first numeric quantity having a single numeric value indicating a goodness of fit between the non-numeric term and the document; a fourth processor for using the numeric term index information generated by the first processor and the numeric search information generated by the second processor from a numeric term received as a search condition to calculate, for each document in the set of documents including a numeric term of the same type as the numeric term received as the search condition, by comparing numeric values of the numeric terms, a second numeric quantity having a single numeric value indicating a goodness of fit between the numeric term received as the search condition and the document, closer numeric values indicating better fits; a fifth processor for calculating, for a document in the set of documents, a third numeric quantity indicating a goodness of fit between the document and the one or more terms received as search conditions, the third numeric quantity being calculated from the first and second numeric quantities; and an output unit for sending information for displaying a list of documents from the set of documents to the user according to the third numeric quantity. - View Dependent Claims (14)
-
-
15. A method of retrieving a document from a set of documents according to one or more terms received as search conditions submitted by a user, the terms that can be received including both numeric terms, which include numeric values, and non-numeric terms, which do not include numeric values, the method comprising:
-
generating at least one item of index information for each document in the set of documents, the index information generated for the documents including both numeric term index information and non-numeric term index information, the non-numeric term index information including information about frequency of occurrence of non-numeric terms in the document, the numeric term index information including information about types and numeric values of numeric terms appearing in the document; and examining each numeric term received as a search condition, determining its type, and generating numeric search information indicating the type and numeric value of the numeric term; using the non-numeric term index information to calculate, for each document in the set of documents including a non-numeric term received as a search condition, a first numeric quantity having a single numeric value indicating a goodness of fit between the non-numeric term and the document; and using the numeric term index information and the numeric search information generated from a numeric term received as a search condition to calculate, for each document in the set of documents including a numeric term of the same type as the numeric term received as the search condition, by comparing numeric values of the numeric terms, a second numeric quantity having a single numeric value indicating a goodness of fit between the numeric term received as the search condition and the document, closer numeric values indicating better fits; calculating, for a document in the set of documents, a third numeric quantity having a single numeric value indicating a goodness of fit between the document and the one or more terms received as search conditions, the third numeric quantity being calculated from the first and second numeric quantities; and sending information for displaying a list of documents from the set of documents to the user according to the third numeric quantity.
-
-
16. A machine-readable medium storing a program for retrieving a document from a set of documents according to one or more terms received as search conditions submitted by a user, the search terms that can be received including both numeric terms, which include numeric values, and non-numeric terms, which do not include numeric values, the program being executable by computing device, the program including:
-
instructions for generating at least one item of index information for each document in the set of documents, the index information generated for the documents including both numeric term index information and non-numeric term index information, the non-numeric term index information including information about frequency of occurrence of non-numeric terms in the document, the numeric term index information including information about types and numeric values of numeric terms appearing in the document; instructions for examining each numeric term received as a search condition, determining its type, and generating numeric search information indicating the type and numeric value of the numeric term; instructions for using the non-numeric term index information to calculate, for each document in the set of documents including a non-numeric term received as a search condition, a first numeric quantity having a single numeric value indicating a goodness of fit between the non-numeric term and the document; and instructions for using the numeric term index information and the numeric search information generated from a numeric term received as a search condition to calculate, for each document in the set of documents including a numeric term of the same type as the numeric term received as the search condition, by comparing numeric values of the numeric terms, a second numeric quantity having a single numeric value indicating a goodness of fit between the numeric term received as the search condition and the document, closer numeric values indicating better fits; instructions for calculating, for a document in the set of documents, a third numeric quantity having a single numeric value indicating a goodness of fit between the document and the one or more terms received as search conditions, the third numeric quantity being calculated from the first and second numeric quantities; and instructions for sending information for displaying a list of documents from the set of documents to the user according to the third numeric quantity.
-
Specification