Information recognition apparatus for recognizing recognition object information
First Claim
1. An information recognition apparatus for recognizing recognition object information in the form of a series of pieces of information which is composed of a plurality of information elements for each of which a predetermined number of element words each of which can make the information element are determined, comprising:
- a word storage section in which all element words which can make the information elements are stored;
a rule storage section in which rules representing a hierarchical relationship of the information elements are stored;
element word recognition means for recognizing words in recognition object information, detecting, for each of the information elements in the recognition object information, element word candidates based on a result of the recognition, the stored contents of said word storage section and the stored contents of said rule storage section and detecting likelihoods of the element word candidates;
a record storage section in which actually existing recognition object information which can be represented as combinations of element words is stored in the form of records each of which includes record items provided by the information elements of the recognition object information;
record number acquisition means for retrieving said record storage section using the element word candidates detected by said element word recognition means successively as a key to acquire, for each of the element word candidates, a record number of a record which includes the element word candidate;
likelihood calculation means for providing likelihood counters in a corresponding relationship to the individual record numbers acquired by said record number acquisition means and adding the likelihoods of the element word candidates detected by said element word recognition means to those of said likelihood counters which correspond to the record numbers of the records which include the element word candidates;
result discrimination means for discriminating a record to be determined as a recognition result based on the count values of said likelihood counters; and
result extraction means for extracting a record to be determined as a recognition result from said record storage section based on a result of the discrimination of said result discrimination means.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides an information recognition apparatus for recognition of an address or the like which can recognize recognition object information, which is inputted in the form which does not have punctuations or element designations, at a high speed and with a high degree of accuracy. An element word recognition unit detects element word candidates of each information element of recognition element information and likelihoods of the element word candidates. A record number acquisition unit retrieves a record storage unit to acquire, for each element word candidate detected by the element word recognition unit, a record number of a record including the element word candidate. A likelihood calculation unit calculates likelihoods of the records using corresponding likelihood counters. A result discrimination unit discriminates a record to be determined as a recognition result of the recognition object information based on count values of the likelihood counters, and a result extraction unit extracts a record to be determined as a recognition result from the record storage section based on a result of the discrimination of the result discrimination unit.
-
Citations
8 Claims
-
1. An information recognition apparatus for recognizing recognition object information in the form of a series of pieces of information which is composed of a plurality of information elements for each of which a predetermined number of element words each of which can make the information element are determined, comprising:
-
a word storage section in which all element words which can make the information elements are stored; a rule storage section in which rules representing a hierarchical relationship of the information elements are stored; element word recognition means for recognizing words in recognition object information, detecting, for each of the information elements in the recognition object information, element word candidates based on a result of the recognition, the stored contents of said word storage section and the stored contents of said rule storage section and detecting likelihoods of the element word candidates; a record storage section in which actually existing recognition object information which can be represented as combinations of element words is stored in the form of records each of which includes record items provided by the information elements of the recognition object information; record number acquisition means for retrieving said record storage section using the element word candidates detected by said element word recognition means successively as a key to acquire, for each of the element word candidates, a record number of a record which includes the element word candidate; likelihood calculation means for providing likelihood counters in a corresponding relationship to the individual record numbers acquired by said record number acquisition means and adding the likelihoods of the element word candidates detected by said element word recognition means to those of said likelihood counters which correspond to the record numbers of the records which include the element word candidates; result discrimination means for discriminating a record to be determined as a recognition result based on the count values of said likelihood counters; and result extraction means for extracting a record to be determined as a recognition result from said record storage section based on a result of the discrimination of said result discrimination means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification