AUTOMATED IDENTIFICATION OF RECURRING TEXT
First Claim
1. One or more computer-readable media having instructions stored thereon which, when executed by a processor of a computing device, cause the computing device to provide a recurring text identification service configured to:
- receive a request to identify recurring text within a plurality of documents;
analyze individual segments of the plurality of documents to generate segment identifiers respectively associated with the segments, wherein the segment identifiers are based at least in part on content of the segments, and wherein segments with the same content have equivalent segment identifiers;
generate a distribution of the segment identifiers; and
enable the distribution of segment identifiers to be used to streamline identification of recurring text within the plurality of documents.
6 Assignments
0 Petitions
Accused Products
Abstract
In embodiments, one or more computer-readable media may have instructions stored thereon which, when executed by a processor of a computing device, provide the computing device with a recurring text identification service. The recurring text identification service may be configured, in some embodiments, to receive a request to identify recurring text within a plurality of documents. The recurring text identification service may be further configured to analyze individual segments of the plurality of documents to generate segment identifiers respectively associated with the segments. In embodiments, the segment identifiers may be based on content of the segments. In embodiments, segments with the same content may have equivalent segment identifiers. The recurring text identification service may further be configured to generate a distribution of the segment identifiers and may enable the distribution of segment identifiers to be used to streamline identification of recurring text within the plurality of documents.
13 Citations
27 Claims
-
1. One or more computer-readable media having instructions stored thereon which, when executed by a processor of a computing device, cause the computing device to provide a recurring text identification service configured to:
-
receive a request to identify recurring text within a plurality of documents; analyze individual segments of the plurality of documents to generate segment identifiers respectively associated with the segments, wherein the segment identifiers are based at least in part on content of the segments, and wherein segments with the same content have equivalent segment identifiers; generate a distribution of the segment identifiers; and enable the distribution of segment identifiers to be used to streamline identification of recurring text within the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for identifying recurring text contained within one or more documents comprising:
-
a processor; and a recurring text identification service configured to cause the processor to; receive a request to identify recurring text within a plurality of documents; analyze individual segments of the plurality of documents to generate segment identifiers respectively associated with the segments, wherein the segment identifiers are based at least in part on content of the segments, and wherein segments with the same content have equivalent segment identifiers; generate a distribution of the segment identifiers; and enable the distribution of segment identifiers to be used to streamline identification of recurring text within the plurality of documents. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-implemented method for identifying recurring text in one or more documents comprising:
-
receiving, by a recurring text identification service of a computing device, a request to identify recurring text within a plurality of documents; analyzing, by the recurring text identification service, individual segments of the plurality of documents to generate segment identifiers respectively associated with the segments, wherein the segment identifiers are based at least in part on content of the segments, and wherein segments with the same content have equivalent segment identifiers; generating, by the recurring text identification service, a distribution of the segment identifiers; and enabling, by the recurring text identification service, the distribution of segment identifiers to be used in streamlining identification of recurring text within the plurality of documents. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification