Method of and system for extracting predetermined elements from input document based upon model which is adaptively modified according to variable amount in the input document
First Claim
Patent Images
1. A method of extracting one or more elements from a document using model data, the model data including at least a template, comprising:
- a) determining the template for a predetermined document type, the template having a set of predetermined characteristics for each of the elements;
b) inputting at least one input document;
c) extracting the elements having the predetermined characteristics from the input document according to the model data;
d) storing the extracted characteristics of the elements in the model data;
e) determining a distance value between the stored characteristics and a corresponding one in the model data;
f) determining a variable amount based upon the distance value for each of the element; and
g) modifying the model data based upon the variable amount.
1 Assignment
0 Petitions
Accused Products
Abstract
Elements are extracted from an input document image according to a predetermined extraction model. Based upon the variability in a predetermined set of layout characteristics of the extracted elements, the extraction model is adaptively modified to improve future performance in extracting the elements.
-
Citations
18 Claims
-
1. A method of extracting one or more elements from a document using model data, the model data including at least a template, comprising:
-
a) determining the template for a predetermined document type, the template having a set of predetermined characteristics for each of the elements;
b) inputting at least one input document;
c) extracting the elements having the predetermined characteristics from the input document according to the model data;
d) storing the extracted characteristics of the elements in the model data;
e) determining a distance value between the stored characteristics and a corresponding one in the model data;
f) determining a variable amount based upon the distance value for each of the element; and
g) modifying the model data based upon the variable amount. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for extracting one or more elements from a document using model data, comprising:
-
a model generation unit for generating model data for a predetermined document type, the model data including at least a template, the template having a set of predetermined characteristics for each of the elements;
a document input unit for inputting at least one input document;
an element extraction unit connected to said model generator and said document input unit for extracting the elements having the predetermined characteristics from the input document according to the model data;
a characteristics storage unit connected to said element extraction unit for storing the extracted characteristics of the elements in the model data;
a learning process unit connected to said characteristics storage unit for determining a distance value between the stored characteristics and a corresponding one in the model data and for determining a variable amount based upon the distance value; and
a model updating unit connected to said learning process unit and said model generation unit for modifying the model data based upon the variable amount. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification