Media Content Assessment and Control Systems
First Claim
Patent Images
1. A computer-implemented method of adapting a characterized textual corpus state to a target state comprising:
- deriving from a textual corpus an assessed textual corpus state on a physical computing device comprising;
parsing the textual corpus and filtering the parsed textual corpus yielding the assessed textual corpus state, the assessed textual corpus comprising;
a set of derived keywords;
each derived keyword including a subset comprising an associated derived keyword frequency of occurrence within the defined textual corpus;
a set of high-frequency words;
each high-frequency word including an associated high-frequency word frequency of occurrence within the defined textual corpus;
a set of frequencies of within-sentence co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from a combined set of words comprising the set of derived keywords and the set of high-frequency words; and
a set of frequencies of within-paragraph co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from the combined set of words; and
generating on the physical device a difference based on differencing at least one of;
(a) the derived keyword frequency subset and the initial keyword frequency subset;
(b) the set of within-sentence co-occurrence of pairs of derived keywords and the set of within-sentence co-occurrence of pairs of initial keywords; and
(c) the set of within-paragraph co-occurrence of pairs of derived keywords and the set of within-paragraph co-occurrence of pairs of initial keywords.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer implemented methods, computing devices, and computing systems, wherein relationships of words or phrases within a textual corpus are assessed via frequencies of occurrence of particular words or phrases and via frequencies of co-occurrence of particular pairs of words or phrases within defined tracts of text from within the textual corpus.
-
Citations
15 Claims
-
1. A computer-implemented method of adapting a characterized textual corpus state to a target state comprising:
-
deriving from a textual corpus an assessed textual corpus state on a physical computing device comprising;
parsing the textual corpus and filtering the parsed textual corpus yielding the assessed textual corpus state, the assessed textual corpus comprising;a set of derived keywords;
each derived keyword including a subset comprising an associated derived keyword frequency of occurrence within the defined textual corpus;a set of high-frequency words;
each high-frequency word including an associated high-frequency word frequency of occurrence within the defined textual corpus;a set of frequencies of within-sentence co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from a combined set of words comprising the set of derived keywords and the set of high-frequency words; and a set of frequencies of within-paragraph co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from the combined set of words; and generating on the physical device a difference based on differencing at least one of;
(a) the derived keyword frequency subset and the initial keyword frequency subset;
(b) the set of within-sentence co-occurrence of pairs of derived keywords and the set of within-sentence co-occurrence of pairs of initial keywords; and
(c) the set of within-paragraph co-occurrence of pairs of derived keywords and the set of within-paragraph co-occurrence of pairs of initial keywords. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method of adapting a characterized textual corpus state to a target state comprising:
-
deriving from a textual corpus an assessed textual corpus state on a first physical computing device comprising;
parsing the textual corpus and filtering the parsed textual corpus yielding the assessed textual corpus state, the assessed textual corpus comprising;a set of derived keywords;
each derived keyword including a subset comprising an associated derived keyword frequency of occurrence within the defined textual corpus;a set of high-frequency words;
each high-frequency word including an associated high-frequency word frequency of occurrence within the defined textual corpus;a set of frequencies of within-sentence co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from a combined set of words comprising the set of derived keywords and the set of high-frequency words; and a set of frequencies of within-paragraph co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from the combined set of words; and generating on a second physical device a difference based on differencing at least one of;
(a) the derived keyword frequency subset and the initial keyword frequency subset;
(b) the set of within-sentence co-occurrence of pairs of derived keywords and the set of within-sentence co-occurrence of pairs of initial keywords; and
(c) the set of within-paragraph co-occurrence of pairs of derived keywords and the set of within-paragraph co-occurrence of pairs of initial keywords. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computing device comprising:
-
a processing unit and addressable memory, wherein the processing unit is configured to execute one or more instructions to; derive from a textual corpus an assessed textual corpus state on a physical computing device comprising;
the execution of one or more instructions to parse the textual corpus, filter the parsed textual corpus, and yield the assessed textual corpus state, the assessed textual corpus comprising;a set of derived keywords;
each derived keyword including a subset comprising an associated derived keyword frequency of occurrence within the defined textual corpus;a set of high-frequency words;
each high-frequency word including an associated high-frequency word frequency of occurrence within the defined textual corpus;a set of frequencies of within-sentence co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from a combined set of words comprising the set of derived keywords and the set of high-frequency words; and a set of frequencies of within-paragraph co-occurrence of pairs of words within the defined textual corpus, the pairs of words selected from the combined set of words; and generate a difference based on differencing at least one of;
(a) the derived keyword frequency subset and the initial keyword frequency subset;
(b) the set of within-sentence co-occurrence of pairs of derived keywords and the set of within-sentence co-occurrence of pairs of initial keywords; and
(c) the set of within-paragraph co-occurrence of pairs of derived keywords and the set of within-paragraph co-occurrence of pairs of initial keywords. - View Dependent Claims (12, 13, 14, 15)
-
Specification