System and method for retrieving documents or sub-documents based on examples
First Claim
1. A system for extracting information comprising:
- a query input;
a database of documents;
a plurality of classifiers arranged in a hierarchical cascade of classifier layers, wherein each classifier comprises a set of weighted training data points comprising feature vectors representing any portion of a document, and wherein said classifiers are operable to retrieve documents from said database matching said query input; and
a terminal classifier weighing an output from said cascade according to a rate of success of query terms being matched by each layer of said cascade.
0 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are a system, method, and program storage device implementing the method of extracting information, wherein the method comprises inputting a query; searching a database of documents based on the query; retrieving documents from the database matching the query using a plurality of classifiers arranged in a hierarchical cascade of classifier layers, wherein each classifier comprises a set of weighted training data points comprising feature vectors representing any portion of a document; and weighing an output from the cascade according to a rate of success of query terms being matched by each layer of the cascade, wherein the weighing is performed using a terminal classifier.
43 Citations
57 Claims
-
1. A system for extracting information comprising:
-
a query input;
a database of documents;
a plurality of classifiers arranged in a hierarchical cascade of classifier layers, wherein each classifier comprises a set of weighted training data points comprising feature vectors representing any portion of a document, and wherein said classifiers are operable to retrieve documents from said database matching said query input; and
a terminal classifier weighing an output from said cascade according to a rate of success of query terms being matched by each layer of said cascade. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 40, 41, 42, 43, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56)
-
-
20. A method of extracting information, said method comprising:
-
inputting a query;
searching a database of documents based on said query;
retrieving documents from said database matching said query using a plurality of classifiers arranged in a hierarchical cascade of classifier layers, wherein each classifier comprises a set of weighted training data points comprising feature vectors representing any portion of a document; and
weighing an output from said cascade according to a rate of success of query terms being matched by each layer of said cascade, wherein said weighing is performed using a terminal classifier. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A program storage device readable by computer, tangibly embodying a program of instructions executable by said computer to perform a program storage device of extracting information, said program storage device comprising:
-
inputting a query;
searching a database of documents based on said query;
retrieving documents from said database matching said query using a plurality of classifiers arranged in a hierarchical cascade of classifier layers, wherein each classifier comprises a set of weighted training data points comprising feature vectors representing any portion of a document; and
weighing an output from said cascade according to a rate of success of query terms being matched by each layer of said cascade, wherein said weighing is performed using a terminal classifier. - View Dependent Claims (44, 45, 46, 57)
-
Specification