IDENTIFYING CONTENT OF INTEREST
0 Assignments
0 Petitions
Accused Products
Abstract
Methods of identifying content of interest within a corpus are disclosed. The methods may comprise the step of applying a first marker set to the corpus, where the first marker set comprises at least one marker identifying a first type of text. For a first textual unit included in the corpus, the methods may comprise generating a score for the first marker set and comparing the score to a reference score. The score may indicate a number of instances of the at least one marker in the first textual unit.
-
Citations
59 Claims
-
1-22. -22. (canceled)
-
23. A computer-implemented method of identifying content of interest within a corpus, the method comprising:
-
identifying with a computer a textual unit in the corpus that includes an instance of an anchor marker set, wherein the computer comprises a processor circuit and operatively associated memory; generating with the computer a plurality of scores for the textual unit, wherein each of the plurality of scores indicates a number of instances in the textual unit of one of a plurality of marker sets; comparing with the computer the plurality of scores to a plurality of reference scores; calculating with the computer an offset between the instance of the anchor marker set and an instance of an instance of one of the plurality of marker sets; and determining with the computer whether the textual unit comprises content of interest considering the comparing and the offset. - View Dependent Claims (24, 25, 26, 27, 28, 29, 38, 39, 40, 41, 42, 43, 44, 45)
-
-
30. A system for identifying content of interest within a corpus, the system comprising a processor circuit and an operatively associated memory, wherein the processor circuit is programmed to:
-
identify a textual unit in the corpus that includes an instance of an anchor marker set; generate a plurality of scores for the textual unit, wherein each of the plurality of scores indicates a number of instances in the textual unit of one of a plurality of marker sets; compare the plurality of scores to a plurality of reference scores; calculate an offset between the instance of the anchor marker set and an instance of an instance of one of the plurality of marker sets; and determine whether the textual unit comprises content of interest considering the comparing and the offset. - View Dependent Claims (46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59)
-
-
31. A computer readable medium comprising instructions that when executed by a processor, cause the processor to perform the steps of:
-
identifying a textual unit in the corpus that includes an instance of an anchor marker set; generating a plurality of scores for the textual unit, wherein each of the plurality of scores indicates a number of instances in the textual unit of one of a plurality of marker sets; comparing the plurality of scores to a plurality of reference scores; calculating an offset between the instance of the anchor marker set and an instance of an instance of one of the plurality of marker sets; and determining whether the textual unit comprises content of interest considering the comparing and the offset.
-
-
32-37. -37. (canceled)
Specification