INFORMATION SEARCH METHOD, APPARATUS, PROGRAM AND COMPUTER READABLE RECORDING MEDIUM
First Claim
1. An information search apparatus comprising:
- a character string input unit configured to obtain a character string from a client;
a character string information search unit configured to obtain information that includes the character string from an index DB;
a similarity calculation unit configured to calculate degree of similarity between the character string and searched information; and
an output unit configured to output the searched information in descending order of the degree of similarity, whereinthe character string information search unit includes a unit configured to, when the input character string contains a plurality of words, search an index DB, based on each word, that stores words and occurrence position information of the words to obtain a distance between occurrence positions of the words, andthe similarity calculation unit includes a unit configured to calculate the degree of similarity based on the distance between occurrence positions of the words.
1 Assignment
0 Petitions
Accused Products
Abstract
An information search apparatus is provided. The information search apparatus includes: a character string input unit configured to obtain a character string from a client; a character string information search unit configured to obtain information that includes the character string from an index DB; a similarity calculation unit configured to calculate degree of similarity between the character string and searched information; and an output unit configured to output the searched information in descending order of the degree of similarity. In the information search apparatus, the character string information search unit includes a unit configured to, when the input character string contains a plurality of words, search an index DB, based on each word, that stores words and occurrence position information of the words to obtain a distance between occurrence positions of the words, and the similarity calculation unit includes a unit configured to calculate the degree of similarity based on the distance between occurrence positions of the words.
32 Citations
16 Claims
-
1. An information search apparatus comprising:
-
a character string input unit configured to obtain a character string from a client; a character string information search unit configured to obtain information that includes the character string from an index DB; a similarity calculation unit configured to calculate degree of similarity between the character string and searched information; and an output unit configured to output the searched information in descending order of the degree of similarity, wherein the character string information search unit includes a unit configured to, when the input character string contains a plurality of words, search an index DB, based on each word, that stores words and occurrence position information of the words to obtain a distance between occurrence positions of the words, and the similarity calculation unit includes a unit configured to calculate the degree of similarity based on the distance between occurrence positions of the words. - View Dependent Claims (2, 3, 4, 5, 15, 16)
-
-
6. An information search apparatus comprising:
-
a character string input unit configured to obtain a character string from a client; a character string information search unit configured to obtain information on a document including the character string from an index DB that stores sentence-based word occurrence position information in the document for each word; a similarity calculation unit configured to calculate degree of similarity between the character string and the document; and an output unit configured to output information of the document in descending order of the degree of similarity, wherein the character string information search unit includes a unit configured to, when the input character string includes a plurality of words, search the index DB based on each word to obtain sentence-based occurrence position information of each word for each document, and the similarity calculation unit includes a unit configured to calculate the degree of similarity between each document and the character string based on degree of sentence-based co-occurrence of the plurality of words in each document. - View Dependent Claims (7)
-
-
8. An information search method in an apparatus for obtaining a character string from a client, obtaining information that includes the character string from an index DB, calculating degree of similarity between the character string and searched information, and outputting the searched information in descending order of the degree of similarity, the information search method comprising:
-
a character string information search step in which, when the input character string contains a plurality of words, a character string information search unit searches an index DB, based on each word, that stores words and occurrence position information of the words to obtain a distance between occurrence positions of the words, and a step in which a similarity calculation unit calculates the degree of similarity based on the distance between occurrence positions of the words. - View Dependent Claims (9, 10, 11, 12)
-
-
13. An information search method executed by an information search apparatus, the information search apparatus comprising:
-
a character string input unit configured to obtain a character string from a client; a character string information search unit configured to obtain information on a document including the character string from an index DB that stores sentence-based word occurrence position information in the document for each word; a similarity calculation unit configured to calculate degree of similarity between the character string and the document; and an output unit configured to output searched information of the document in descending order of the degree of similarity, the information search method comprising; a character string information search step in which, when the input character string includes a plurality of words, the character string information search unit searches the index DB based on each word to obtain sentence-based occurrence position information of each word for each document, and a similarity calculation step in which the similarity calculation unit calculates the degree of similarity between each document and the character string based on degree of sentence-based co-occurrence of the plurality of words in each document. - View Dependent Claims (14)
-
Specification