×

Computer implemented semantic search methodology, system and computer program product for determining information density in text

  • US 8,880,389 B2
  • Filed: 12/09/2011
  • Issued: 11/04/2014
  • Est. Priority Date: 12/09/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product comprising a non-transitory computer readable medium having computer usable program code executable to perform operations for determining an informative score of textualized digital web media, the operations of the computer program product comprising:

  • compiling a list of web media sources for analysis in computer readable memory;

    querying the web media sources for a block of text;

    storing the block of text in volatile computer readable memory;

    identifying sentences within the block of text;

    storing the sentences as strings within an array;

    parsing each sentence to quantify one or more of the following;

    a number of words in the sentence;

    a number of prepositions, postpositions, adjectives, adverbs, verbs, nouns, and grammatical conjunctions, by referencing words within the sentence with a dictionary in computer readable memory;

    a number of dependent clauses in the sentence;

    a number of independent clauses in the sentence;

    a number of ellipsis, a number of dashes (both en dashes and em dashes), and a number of commas, semicolons, and colons;

    a number of subjects and predicates in the sentence;

    a number of appositions in the sentence;

    a number of syllables in each word of the sentence by cross-referencing each word with the dictionary in persistent storage; and

    a number of alphanumeric characters in the sentence;

    storing each quantified number in a persistent computer readable database with a time-stamp identifying the date the number(s) were quantified;

    calculating a semantic density score for each web media source in the list of web media sources, wherein the semantic density score is a function of the quantified numbers for each sentence in the web media source; and

    storing the semantic density score in a persistent computer readable database, the score exclusively associated with the web media content from which the score was derived.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×