×

Index to a semi-structured database

  • US 7,409,381 B1
  • Filed: 07/30/1999
  • Issued: 08/05/2008
  • Est. Priority Date: 07/30/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. Apparatus for accessing a semi-structured database in accordance with an input request for information, wherein the semi-structured database comprises a plurality of items, each item comprising one or more fields having a plurality of characters therein, at least one of the fields being a free text field, the apparatus comprising:

  • a processor;

    means for accessing a data store comprising a plurality of index entries, each index entry representing a concordance between an entry in a field of an item in the semi-structured database and that item in the semi-structured database;

    input means for receiving a request for information, the request comprising a natural language phrase;

    a parser for parsing the received request to determine components of the request;

    a slot filler for generating a slot-and-filler request having a plurality of slots, each slot corresponding to a group of index entries in said data store, the slot-filler being arranged to identify, one or more object components representing an object of the received request from the parsed request, and allocate at least one object component to a respective slot of a slot-and-filler request; and

    a query constructor for accessing the data store, wherein the query constructor is arranged to compare the or each allocated object component in the slot-and-filler request with a group of index entries in said data store corresponding to the slot of the allocated component, so as to identify an index entry corresponding thereto, and to use the identified index entry to identify a corresponding item in the semi-structured database;

    an index generator comprising a processor arranged, in respect of each item in the semi-structured database, to analyze each field in accordance with a predetermined criterion so as to identify an entry within said field, and to generate at least one index entry representing a concordance between an identified entry and the item corresponding to the identified entry, and store the generated index entry in the data store;

    wherein for each of a plurality of predetermined formats, the processor is arranged to search said free text field to identify a sequence of characters having a format corresponding to the predetermined format, said identified sequence of characters being deemed to constitute an identified entry;

    wherein for the free text field, the processor is arranged to define any data not identified as an entry as a free text entry;

    wherein the free text entry comprises at least one free text word defined by a sequence of alphanumeric characters, the processor being arranged to identify at least one selected free text word for a field by comparing the free text entry with at least one selection criterion defining one or more predetermined characteristics of a selected free text word.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×