Method and system for extraction
First Claim
Patent Images
1. A method for extracting information from at least one document in at least one set of documents, the method comprising:
- generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score;
determining, using at least one features processor, negative features and positive features based on N-gram statistics;
determining, using at least one negative features processor, whether negative features apply to the at least one possible match;
deleting, using at least one deleting processor, any possible match to which the negative feature applies from the at least one possible match list;
determining, using at least one positive features processor, whether any of the possible matches are positive features; and
re-ordering, using at least one re-ordering processor, the possible matches in the at least one possible match list based on the information learned from determining whether any of the possible matches are positive features.
11 Assignments
0 Petitions
Accused Products
Abstract
A system and method for extracting information from at least one document in at least one set of documents, the method comprising: generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score.
92 Citations
39 Claims
-
1. A method for extracting information from at least one document in at least one set of documents, the method comprising:
-
generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score; determining, using at least one features processor, negative features and positive features based on N-gram statistics; determining, using at least one negative features processor, whether negative features apply to the at least one possible match; deleting, using at least one deleting processor, any possible match to which the negative feature applies from the at least one possible match list; determining, using at least one positive features processor, whether any of the possible matches are positive features; and re-ordering, using at least one re-ordering processor, the possible matches in the at least one possible match list based on the information learned from determining whether any of the possible matches are positive features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for extracting information from at least one document in at least one set of documents, the method comprising:
generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
25. A method for extracting information from at least one document in at least one set of documents, the method comprising:
-
generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score; determining, using at least one features processor, positive features based on N-gram statistics; and re-ordering, using at least one re-ordering processor, the possible matches in the at least one possible match list based on the information learned from determining whether any of the possible matches are positive features. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A computer system for extracting information from at least one document in at least one set of documents, the system comprising:
-
at least one processor; wherein the at least one processor is configured to perform; generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score; determining, using at least one features processor, negative features and positive features based on N-gram statistics; determining, using at least one negative features processor, whether negative features apply to the at least one possible match; deleting, using at least one deleting processor, any possible match to which the negative feature applies from the at least one possible match list; determining, using at least one positive features processor, whether any of the possible matches are positive features; and re-ordering, using at least one re-ordering processor, the possible matches in the at least one possible match list based on the information learned from determining whether any of the possible matches are positive features.
-
-
38. A computerized system for extracting information from at least one document in at least one set of documents, the system comprising:
-
at least one processor; wherein the processor is configured to perform; generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score.
-
-
39. A computerized system for extracting information from at least one document in at least one set of documents, the system comprising:
-
at least one processor; wherein the processor is configured to perform; generating, using at least one ranking and/or matching processor, at least one ranked possible match list comprising at least one possible match for at least one target entry on the at least one document, the at least one ranked possible match list based on at least one attribute score and at least one localization score; determining, using at least one features processor, positive features based on N-gram statistics; and re-ordering, using at least one re-ordering processor, the possible matches in the at least one possible match list based on the information learned from determining whether any of the possible matches are positive features.
-
Specification