System and method for creating a data file for use in searching a database
First Claim
1. A processor-implemented database search system, comprising:
- a database for storing a document file;
a database control unit for controlling a transfer of a document file to and from said database;
a search engine for searching said database on the basis of a keyword comprised of a character string, and for returning a search result to said database control unit;
an index file for use in a search process by means of said search engine, for retaining information indicating a correspondence of said keyword to positional information of the keyword, based on a field in the document file that includes said keyword;
wherein the index file comprises a key file that includes a list of pointers to character strings and positional information;
wherein the character strings are included in the document file and are stored in the database by document area where a character string in the document file appears and by a positional information file;
wherein the positional information file includes a list of positional information containing information that specifies the document file where a character string exists and information that specifies a position of the character string in the document file regarding the character string in the key file; and
wherein for a variable-length chain included in the index file, a relational file is created by dividing each word in the variable-length chain into a plurality of fixed-length keywords.
1 Assignment
0 Petitions
Accused Products
Abstract
An index file for use in search processing employing a search engine, retains information that indicates a correspondence of a keyword to its positional information. The index file comprises a key file that includes a list of pointers to character strings. The character strings are included in document files that are stored in a document database and to positional information, by document area where a character string in a document file appears and a POS file. The POS file includes a list of positional information containing information that specifies a document file where a character string exists and information that specifies a position of the character string in a document file regarding the character strings in the key file.
-
Citations
12 Claims
-
1. A processor-implemented database search system, comprising:
-
a database for storing a document file; a database control unit for controlling a transfer of a document file to and from said database; a search engine for searching said database on the basis of a keyword comprised of a character string, and for returning a search result to said database control unit; an index file for use in a search process by means of said search engine, for retaining information indicating a correspondence of said keyword to positional information of the keyword, based on a field in the document file that includes said keyword; wherein the index file comprises a key file that includes a list of pointers to character strings and positional information; wherein the character strings are included in the document file and are stored in the database by document area where a character string in the document file appears and by a positional information file; wherein the positional information file includes a list of positional information containing information that specifies the document file where a character string exists and information that specifies a position of the character string in the document file regarding the character string in the key file; and wherein for a variable-length chain included in the index file, a relational file is created by dividing each word in the variable-length chain into a plurality of fixed-length keywords. - View Dependent Claims (2, 3, 4)
-
-
5. A processor-implemented computer program product being exacutable by a processor for use in searching database, comprising:
-
database control means for controlling the transfer of a document file to and from a database; and search means for referencing the index file retaining information that specifies a document file which includes a keyword containing a character string and information indicating a correspondence to the keyword according to a field in the document file, the document file including the keyword and transmitting information specifying the document file where the character string appears in the field to the database control means; wherein the index file comprises a key file that includes a list of pointers to character strings and positional information; wherein the character strings are included in the document file and are stored in the database by document area where a character string in the document file appears and by a positional information file; wherein the positional information file includes a list of positional information containing information that specifies the document file where a character string exists and information that specifies a position of the character string in the document file regarding the character string in the key file; and wherein for a variable-length chain included in the index file, a relational file is created by dividing each word in the variable-length chain into a plurality of fixed-length keywords. - View Dependent Claims (10, 11, 12)
-
-
6. A computer program product being executable by a processor having a plurality of executable instruction codes stored on a computer-usable medium, for use in searching a database, comprising:
-
a set of instruction codes for receiving an input of a search expression that contains a search term made of a given character string and field information specifying a field in a document file where the search term appears; a set of instruction codes for referencing an index file stored in memory and retaining information indicating a correspondence of information that specifies said document file to the keyword according to a field in said document file; a set of instruction codes for specifying a keyword corresponding to said search expression that comprises said field information; a set of instruction codes for acquiring information specifying a document file including the specified keyword; and a set of instruction codes for outputting the information acquired by referencing said index file as a search result; wherein the index file comDrises a key file that includes a list of pointers to character strings and positional information; wherein the character strings are included in the document file and are stored in the database by document area where a character string in the document file aroears and by a positional information file; wherein the positional information file includes a list of positional information containing information that soecifies the document file where a character string exists and information that srecifies a position of the character string in the document file regarding the character string in the key file; and wherein for a variable-length chain included in the index file, a set of instruction codes creates a relational file by dividing each word in the variable-length chain into a plurality of fixed-length keywords. - View Dependent Claims (7, 8, 9)
-
Specification