USING ANCHOR POINTS IN DOCUMENT IDENTIFICATION
First Claim
1. A processor-implemented method for identifying a document file comprising:
- responsive to locating a recognized set of characters in a document file comprising a plurality of characters, using the recognized set of characters an anchor point and performing the steps comprising;
selecting an examination set of characters from the document file, the examination set being selected based upon proximity to the anchor point; and
searching the examination set for one or more indicators to assist in uniquely identifying the document file.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed that allow for indexing, processing, or both of information from physical media or electronic media, which may be received from a plurality of sources. In embodiments, a document file may be matched using pattern matching methods and may include comparisons with a comparison reference database to improve or accelerate the indexing process. In embodiments, information may be presented to a user as potential matches thereby improving manual indexing processes. In embodiments, one or more additional actions may occur as part of the processing, including without limitation, association additional data with a document file, making observations from the document file, notifying individuals, creating composite messages, and billing events. In an embodiment, data from a document file may be associated with a key word, key phrase, or word frequency value that enables adaptive learning so that unindexed data may be automatically indexed based on user interaction history.
12 Citations
20 Claims
-
1. A processor-implemented method for identifying a document file comprising:
responsive to locating a recognized set of characters in a document file comprising a plurality of characters, using the recognized set of characters an anchor point and performing the steps comprising; selecting an examination set of characters from the document file, the examination set being selected based upon proximity to the anchor point; and searching the examination set for one or more indicators to assist in uniquely identifying the document file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A processor-implemented method for identifying a document comprising:
-
searching a document comprising a plurality of characters to identify an anchor point comprising a set of characters; and responsive to identifying an anchor point; assigning proximity weighting to at least some of the characters in the document based upon their position relative to the anchor point; selecting an examination set of characters from the document using the proximity weightings; and searching the examination set for one or more indicators to assist in uniquely identifying the document. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A system comprising:
-
one or more processors; and a non-transitory computer-readable medium or media comprising one or more sequences of instructions which, when executed by at least one of the one or more processors, causes steps to be performed comprising; searching a document comprising a plurality of characters to identify an anchor point comprising a set of characters; and responsive to identifying an anchor point; assigning proximity weighting to at least some of the characters in the document based upon their position relative to the anchor point; selecting an examination set of characters from the document using the proximity weightings; and searching the examination set for one or more indicators to assist in uniquely identifying the document. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification