Method and system for extracting web query interfaces
First Claim
1. A computer program product being embodied on a computer readable medium for extracting semantic information about a plurality of documents being accessible via a computer network, said computer program product comprising computer-executable instructions for:
- generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and
, constructing at least one parse tree indicative of a semantic structure of the at least one document from said tokens dependently upon a grammar being indicative of presentation conventions.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer program product being embodied on a computer readable medium for extracting semantic information about a plurality of documents being accessible via a computer network, the computer program product including computer-executable instructions for: generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and, constructing at least one parse tree indicative of a semantic structure of the at least one document from the tokens dependently upon a grammar being indicative of presentation conventions.
28 Citations
21 Claims
-
1. A computer program product being embodied on a computer readable medium for extracting semantic information about a plurality of documents being accessible via a computer network, said computer program product comprising computer-executable instructions for:
-
generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and
,constructing at least one parse tree indicative of a semantic structure of the at least one document from said tokens dependently upon a grammar being indicative of presentation conventions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A web querying device comprising:
-
a form extractor for generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and
,a soft parser for constructing at least one parse tree indicative of a semantic structure of the at least one document from said tokens dependently upon a grammar being indicative of presentation conventions. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A device for automatically querying a plurality of electronic query documents comprising:
-
means for generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and
,means for constructing at least one parse tree indicative of a semantic structure of the at least one document from said tokens dependently upon a grammar being indicative of presentation conventions. - View Dependent Claims (20)
-
-
21. A method for extracting semantic information about a plurality of documents being accessible via a computer network, said method comprising:
-
generating a plurality of tokens from at least one of the documents, each token being indicative of a displayed item and a corresponding position; and
,constructing at least one parse tree indicative of a semantic structure of the at least one document from said tokens dependently upon a grammar being indicative of presentation conventions.
-
Specification