Method for extracting content from structured or unstructured text documents
First Claim
Patent Images
1. ) A method for extracting content from a document, comprising the step of:
- creating at least one selection envelope based upon a plurality of selection commands for locating specific content within said document; and
selecting content from said document based upon said at least one selection envelope.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for selecting textual content within a document. Text is selected using mechanisms of pattern recognition on the document'"'"'s structure or content itself. A pattern recognition rule selects the desired text by identifying the start and/or end positions of the content in the document. The delineated contents is then said to be enclosed in an envelope. A series of envelopes may be used to identify the desired content. Successive envelopes are defined relative to a previous envelope. The contents of any envelope within a series, including the final envelope, may be extracted for use by other documents.
-
Citations
1 Claim
-
1. ) A method for extracting content from a document, comprising the step of:
-
creating at least one selection envelope based upon a plurality of selection commands for locating specific content within said document; and
selecting content from said document based upon said at least one selection envelope.
-
Specification