Methods and systems for annotating electronic documents
First Claim
1. A computer-implemented method of annotating an electronic document comprising:
- receiving annotation information corresponding to an annotation of a first electronic document file at an annotation location, wherein the first electronic document file is a text document;
generating annotation metadata associated with the annotation information by creating topographic patterns from text of the first electronic document file and the annotation information, wherein the topographic patterns include one or more document patterns, one or more anchor patterns, one or more neighborhood patterns, or combinations thereof;
storing the annotation information and associated annotation metadata in an annotation file that is separate from the first electronic document file;
generating topographic patterns from text of a target electronic document file, wherein;
the topographic patterns include document patterns, anchor patterns, neighborhood patterns, or combinations thereof, and the target electronic document file is a modified version of the first electronic document file such that an underlying text of the target electronic document file is different from an underlying text of the first electronic document file;
the document patterns include a document name, a document length, a document date, document shingles, or combinations thereof;
the anchor patterns of the first electronic document file include a word or a text associated with the annotation located in the first electronic document file, a first character of the word or the text, a last character of the word or the text, a word or text length, or combinations thereof;
the anchor patterns of the target electronic document file include a plurality of words associated with the target electronic document file, wherein characteristics of each word include a position of the word within the target electronic document file, a length of the word, a first character of the word, a first character of the word, or combinations thereof;
the neighborhood patterns of the first electronic document file comprise a plurality of document shingles associated with the annotation location in the first electronic document file; and
the neighborhood patterns of the target electronic document file comprise a plurality of document shingles associated with one or more potential anchor locations;
comparing the topographic patterns of the first electronic document file to the topographic patterns of the target electronic document file to determine an anchor location within the target electronic document file; and
anchoring the annotation information, by a computer device, to the target electronic document file at the anchor location within the text of the target electronic document file.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method of annotating an electronic document may include receiving annotation information corresponding to a first electronic document file and creating annotation metadata that is associated with the annotation information. The method may further include storing the annotation information and associated annotation metadata in an annotation file that is separate from the first electronic document file, and anchoring the annotation information to a target electronic document file at an anchor location corresponding to the annotation metadata. The annotation metadata may be generated by assigning a target offset value to individual neighboring tokens defining an annotation neighborhood, wherein the target offset values correspond to positions of the neighboring tokens with respect to an annotation location within the first electronic document file. The annotation metadata may also comprise topographic patterns that are compared between source and target documents to determine the anchor location.
56 Citations
10 Claims
-
1. A computer-implemented method of annotating an electronic document comprising:
-
receiving annotation information corresponding to an annotation of a first electronic document file at an annotation location, wherein the first electronic document file is a text document; generating annotation metadata associated with the annotation information by creating topographic patterns from text of the first electronic document file and the annotation information, wherein the topographic patterns include one or more document patterns, one or more anchor patterns, one or more neighborhood patterns, or combinations thereof; storing the annotation information and associated annotation metadata in an annotation file that is separate from the first electronic document file; generating topographic patterns from text of a target electronic document file, wherein; the topographic patterns include document patterns, anchor patterns, neighborhood patterns, or combinations thereof, and the target electronic document file is a modified version of the first electronic document file such that an underlying text of the target electronic document file is different from an underlying text of the first electronic document file; the document patterns include a document name, a document length, a document date, document shingles, or combinations thereof; the anchor patterns of the first electronic document file include a word or a text associated with the annotation located in the first electronic document file, a first character of the word or the text, a last character of the word or the text, a word or text length, or combinations thereof; the anchor patterns of the target electronic document file include a plurality of words associated with the target electronic document file, wherein characteristics of each word include a position of the word within the target electronic document file, a length of the word, a first character of the word, a first character of the word, or combinations thereof; the neighborhood patterns of the first electronic document file comprise a plurality of document shingles associated with the annotation location in the first electronic document file; and the neighborhood patterns of the target electronic document file comprise a plurality of document shingles associated with one or more potential anchor locations; comparing the topographic patterns of the first electronic document file to the topographic patterns of the target electronic document file to determine an anchor location within the target electronic document file; and anchoring the annotation information, by a computer device, to the target electronic document file at the anchor location within the text of the target electronic document file. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method of annotating an electronic document comprising:
-
receiving annotation information corresponding to a first electronic document file, wherein the first electronic document file is a text document; creating annotation metadata that is associated with the annotation information and comprises contextual information relating to an annotation location in the first electronic document file, wherein; the annotation metadata is created by assigning a target offset value to neighboring tokens defining an annotation neighborhood, wherein the target offset values correspond to positions of the neighboring tokens with respect to the annotation location within the first electronic document file; the neighboring tokens comprise words within the first electronic document file; the target offset value of a particular neighboring token represents a number of words from which the particular neighboring token is separated from the annotation location; and the target offset value of each neighboring token provides a candidate anchoring position; storing the annotation information and associated annotation metadata in an annotation file that is separate from the first electronic document file; and anchoring, by a computer device, the annotation information to a target electronic document file at an anchor location corresponding to the annotation metadata, wherein the target electronic document file is a modified version of the first electronic document file such that an underlying text of the target electronic document file is different from an underlying text of the first electronic document file, wherein anchoring the annotation information further comprises; evaluating the candidate anchoring positions such that a weight of each candidate anchoring position is determined at least in part on an influence function; selecting the candidate anchoring position having a maximum preference from the neighboring tokens as the anchor location; and anchoring the annotation information at the anchor location within the target electronic document file. - View Dependent Claims (5, 6)
-
-
7. A computer-implemented method of annotating an electronic document comprising:
-
receiving annotation information corresponding to a first electronic document file, wherein the first electronic document file is a text document; creating annotation metadata that is associated with the annotation information and comprises contextual information relating to an annotation location in the first electronic document file, wherein; the annotation metadata is created by assigning a target offset value to neighboring tokens defining an annotation neighborhood, wherein the target offset values correspond to positions of the neighboring tokens with respect to the annotation location within the first electronic document file; the target offset value of each neighboring token provides a candidate anchoring position; a base token is associated with the annotation location; a magnitude of the target offset value of the base token is less than the magnitude of each of the remaining target offset values associated with the remaining neighborhood tokens; and the magnitude of the target offset value of a neighboring token that is located relatively further from the annotation location is greater than the magnitude of the target offset value of a neighboring token that is located relatively closer to the annotation location; storing the annotation information and associated annotation metadata in an annotation file that is separate from the first electronic document file; and anchoring, by a computer device, the annotation information to a target electronic document file at an anchor location corresponding to the annotation metadata, wherein the target electronic document file is a modified version of the first electronic document file such that an underlying text of the target electronic document file is different from an underlying text of the first electronic document file, wherein anchoring the annotation information further comprises; evaluating the candidate anchoring positions such that a weight of each candidate anchoring position is determined at least in part on an influence function; selecting the candidate anchoring position having a maximum preference from the neighboring tokens as the anchor location; and anchoring the annotation information at the anchor location within the target electronic document file. - View Dependent Claims (8, 9, 10)
-
Specification