Automated identification and marking of new and changed content in a structured document
First Claim
1. An automatic method of indicating changes in a structured document implemented via a computer, the method comprising:
- determining that a unit of content in an updated version of a structured document and a related unit of content in a previous version of the structured documents are associated with the same entry, wherein determining comprises;
accessing a base topic set having a single topic identifier associated with each unit of content in the previous version of the structured document;
accessing an undated topic set having a single topic identifier associated with each unit of content in the updated version of the structured document;
identifying a particular topic identifier in the updated topic set that corresponds to the same particular topic identifier in the base topic setcomparing the unit of content associated with the particular topic identifier in the updated version of the structured document with the related unit of content associated with the particular topic identifier in the previous version of the structured document to determine whether the unit of content in the updated version of the structure document has been modified with respect to the related unit of content in the previous version of the structure document;
generating a table of contents associated with the updated version of the structured document, the table of contents having one entry associated with each unit of content in the updated version of the structured document; and
marking a first entry in the table of contents if the unit of content associated with the first entry has been modified a predetermined degree from a previous version of the content, wherein the predetermined degree is represented by a difference metric as determined by a content comparator, such that;
in an event that the difference is counted as a modification, the difference metric represents each change in words, tags, and formatting, wherein the changes comprise changes in font, color, size, inserted content, and deleted content between the respective units of content in the updated version of the structured document and the previous version of the structured document; and
in an event that not all differences are counted as modifications, changes are classified by type such that changes of meaning are classified in a different type than changes in form, wherein changes in form include rephrasing that does not change the meaning of the content, and the difference metric represents the number of changes of meaning within each unit of content.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for indicating changes in a structured document includes identifying a new topic or a modified topic in an updated version of a structured document having one or more topics, generating a table of contents having one or more topic entries associated with the one or more topics, marking a topic entry associated with the new topic or the modified topic with a marker indicating that the associated topic is new or modified. A system includes an unmarked table of contents having an entry associated with a unit of updated topic content, and a marking module marking the entry if the unit of updated topic content has changed or the unit of updated topic content comprises a new topic.
24 Citations
15 Claims
-
1. An automatic method of indicating changes in a structured document implemented via a computer, the method comprising:
-
determining that a unit of content in an updated version of a structured document and a related unit of content in a previous version of the structured documents are associated with the same entry, wherein determining comprises; accessing a base topic set having a single topic identifier associated with each unit of content in the previous version of the structured document; accessing an undated topic set having a single topic identifier associated with each unit of content in the updated version of the structured document; identifying a particular topic identifier in the updated topic set that corresponds to the same particular topic identifier in the base topic set comparing the unit of content associated with the particular topic identifier in the updated version of the structured document with the related unit of content associated with the particular topic identifier in the previous version of the structured document to determine whether the unit of content in the updated version of the structure document has been modified with respect to the related unit of content in the previous version of the structure document; generating a table of contents associated with the updated version of the structured document, the table of contents having one entry associated with each unit of content in the updated version of the structured document; and marking a first entry in the table of contents if the unit of content associated with the first entry has been modified a predetermined degree from a previous version of the content, wherein the predetermined degree is represented by a difference metric as determined by a content comparator, such that; in an event that the difference is counted as a modification, the difference metric represents each change in words, tags, and formatting, wherein the changes comprise changes in font, color, size, inserted content, and deleted content between the respective units of content in the updated version of the structured document and the previous version of the structured document; and in an event that not all differences are counted as modifications, changes are classified by type such that changes of meaning are classified in a different type than changes in form, wherein changes in form include rephrasing that does not change the meaning of the content, and the difference metric represents the number of changes of meaning within each unit of content. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-readable storage medium having stored thereon computer-executable instructions that, when executed, cause a computer to perform a process comprising:
-
determining that a unit of content in an updated version of a structured document and a related unit of content in a previous version of the structured document are associated with the same entry, wherein determining comprises; accessing a base topic set having a single topic identifier associated with each unit of content in the previous version of the structured document; accessing an updated topic set having a single topic identifier associated with each unit of content in the updated version of the structured document; identifying a particular topic identifier in the updated topic set that corresponds to the same particular topic identifier in the base topic set; comparing the unit of content associated with the particular topic identifier in the updated version with the related unit of content associated with the particular topic identifier in the previous version to determine whether the unit of content in the updated version has been modified with respect to the related unit of content; generating a table of contents associated with the updated version of the structured document, the table of contents having one entry associated with each unit of content in the updated version of the structured document; and marking a first entry in the table of contents if the unit of content associated with the first entry has been modified a predetermined degree from a previous version of the content, wherein the predetermined degree is represented by a difference metric as determined by a content comparator, such that; in an event that the difference is counted as a modification, the difference metric represents each change in words, tags, and formatting, wherein the changes comprise changes in font, color, size, inserted content, and deleted content between the respective units of content in the updated version and the previous version; and in an event that not all differences arc counted as modifications, changes are classified by type such that changes of meaning are classified in a different type than changes in form, wherein changes in form include rephrasing that does not change the meaning of the content, and the difference metric represents the number of changes of meaning within each unit of content. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
means for storing one or more units of content included in an updated version of a structured document; means for updating a table of contents having an entry associated with each unit of content; means for automatically generating a list of modified topics comprising means for determining that a unit of content in the updated version of the structured document and a related unit of content in a previous version of the structured document are associated with the same entry, wherein the means for determining comprises; means for accessing a base topic set having a single topic identifier associated with each unit of content in the previous version of the structured document; means for accessing an updated topic set having a single topic identifier associated with each unit of content in the updated version of the structured document; means for identifying a particular topic identifier in the updated topic set that corresponds to the same particular topic identifier in the base topic set; means for comparing the unit of content associated with the particular topic identifier in the updated version of the structured document with the related unit of content associated with the particular topic identifier in the previous version of the structured document, to determine whether the unit of content in the updated version of the structured document has been modified with respect to the related unit of content in the previous version of the structured document; means for generating a table of contents associated with the updated version of a structured document, the table of contents having one entry associated with each unit of content in the updated version of the structured document; and means for automatically marking a first entry in the table of contents indicating that the unit of content associated with the first entry has been modified a predetermined degree from a previous version of the content, wherein the predetermined degree is represented by a difference metric as determined by a content comparator, such that; in an event that the each difference is counted as a modification, the difference metric represents each change in words, tags, and formatting, wherein the changes comprise changes in font, color, size, inserted content, and deleted content between the respective units of content in the updated version of the structured document and the previous version of the structured document; and in an event that not all differences are counted as modifications, changes are classified by type such that changes of meaning are classified in a different type than changes in form, wherein changes in form include rephrasing that does not change the meaning of the content. and the difference metric represents the number of changes of meaning within each unit of content. - View Dependent Claims (14, 15)
-
Specification