Method of learning associations between documents and data sets
First Claim
Patent Images
1. A method of learning associations between classes of documents and one or more structured data sets, said method comprising the steps of:
- classifying a document into a class selected from a predefined set of classes;
displaying one or more structured data sets, wherein the displayed structured data sets are dependent on association information for the class;
receiving one or more indications of changes to the displayed structured data sets;
amending the association information for the class based on the received indications.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of learning associations between classes of documents and one or more structured data sets comprises a step of classifying an input document into a class selected from a predefined set of classes (step 115). One or more structured data sets are displayed (step 130), wherein the displayed structured data sets are dependent on association information for the class. One or more indications of changes to the displayed structured data sets are received (steps 815, 830, 845) and the association information for the class is amended (step 850) based on the received indications.
-
Citations
26 Claims
-
1. A method of learning associations between classes of documents and one or more structured data sets, said method comprising the steps of:
-
classifying a document into a class selected from a predefined set of classes;
displaying one or more structured data sets, wherein the displayed structured data sets are dependent on association information for the class;
receiving one or more indications of changes to the displayed structured data sets;
amending the association information for the class based on the received indications. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of extracting information for processing a document;
- the method comprising the steps of;
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
extracting data from the document and the data set to process the document according to one or more tasks associated with the class.
- the method comprising the steps of;
-
16. A method of verifying information for processing a document, the method comprising the steps of:
-
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
verifying information in the document using the identified data set;
extracting information from the document to process the document according to one or more tasks associated with the class.
-
-
17. An apparatus for learning associations between classes of documents and one or more structured data sets, said apparatus comprising:
-
means for classifying a document into a class selected from a predefined set of classes;
means for displaying one or more structured data sets, wherein the displayed structured data sets are dependent on association information for the class;
means for receiving one or more indications of changes to the displayed structured data sets; and
means for amending the association information for the class based on the received indications.
-
-
18. An apparatus for extracting information for processing a document, said apparatus comprising:
-
means for classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
means for identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
means for extracting data from the document and the data set to process the document according to one or more tasks associated with the class.
-
-
19. An apparatus for verifying information for processing a document, the apparatus comprising:
-
means for classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
means for identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
means for verifying information in the document using the identified data set;
means for extracting information from the document to process the document according to one or more tasks associated with the class.
-
-
20. A computer program product comprising machine-readable program code recorded on a machine-readable recording medium, for controlling the operation of a data processing apparatus on which the program code executes to perform a method of learning associations between classes of documents and one or more structured data sets, said method comprising the steps of:
-
classifying a document into a class selected from a predefined set of classes;
displaying one or more structured data sets, wherein the displayed structured data sets are dependent on association information for the class;
receiving one or more indications of changes to the displayed structured data sets; and
amending the association information for the class based on the received indications.
-
-
21. A computer program product comprising machine-readable program code recorded on a machine-readable recording medium, for controlling the operation of a data processing apparatus on which the program code executes to perform a method of extracting information for processing a document;
- the method comprising the steps of;
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
extracting data from the document and the data set to process the document according to one or more tasks associated with the class.
- the method comprising the steps of;
-
22. A computer program product comprising machine-readable program code recorded on a machine-readable recording medium, for controlling the operation of a data processing apparatus on which the program code executes to perform a method of verifying information for processing a document, the method comprising the steps of:
-
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
verifying information in the document using the identified data set;
extracting information from the document to process the document according to one or more tasks associated with the class.
-
-
23. A computer program comprising machine-readable program code for controlling the operation of a data processing apparatus on which the program executes to perform a method of learning associations between classes of documents and one or more structured data sets, said method comprising the steps of:
-
classifying a document into a class selected from a predefined set of classes;
displaying one or more structured data sets, wherein the displayed structured data sets are dependent on association information for the class;
receiving one or more indications of changes to the displayed structured data sets;
amending the association information for the class based on the received indications.
-
-
24. A computer program comprising machine-readable program code for controlling the operation of a data processing apparatus on which the program executes to perform a method of extracting information for processing a document;
- the method comprising the steps of;
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
extracting data from the document and the data set to process the document according to one or more tasks associated with the class.
- the method comprising the steps of;
-
25. A computer program comprising machine-readable program code for controlling the operation of a data processing apparatus on which the program executes to perform a method of verifying information for processing a document, the method comprising the steps of:
-
classifying the document into a class selected from a predefined set of classes, wherein said classifying is dependent on first information in the document;
identifying a data set based on second information in the document, wherein said identifying is dependent on association information adaptively obtained through processing other documents in the class;
verifying information in the document using the identified data set;
extracting information from the document to process the document according to one or more tasks associated with the class.
-
-
26. A system for learning associations between classes of documents and one or more structured data sets, said system comprising:
-
data storage for storing at least one document, association information for a predefined set of classes of documents, and one or more databases; and
a processor in communication with the data storage and adapted to;
classify a document into a corresponding class selected from the predefined set of classes;
display one or more structured data sets derived from the one or more databases based on the association information for the corresponding class;
receive one or more indications of changes to the displayed structured data sets; and
amend the association information for the corresponding class based on the received indications.
-
Specification