×

Filtering phrases for an identifier

  • US 8,423,349 B1
  • Filed: 01/13/2009
  • Issued: 04/16/2013
  • Est. Priority Date: 01/13/2009
  • Status: Active Grant
First Claim
Patent Images

1. One or more computer-readable media storing computer-executable instructions that, when executed on one or more processors, perform acts comprising:

  • analyzing multiple corpuses of text to generate a first corpus of phrases, each of the phrases comprising at least two grammatically-correct words;

    determining, for each phrase of the first corpus of phrases;

    (i) a commonality of the phrase;

    (ii) a commonality of each word of the phrase;

    (iii) a number of syllables in the phrase, and (iv) a number of words in the phrase, the commonality of the phrase indicating a frequency of an occurrence of the phrase;

    scoring each phrase of the first corpus of phrases based on the commonality of the phrase, the commonality of each word of the phrase, the number of syllables in the phrase, and the number of words in the phrase;

    filtering the first corpus of phrases to define a second corpus of phrases comprising fewer phrases than the first corpus of phrases, wherein the filtering comprises;

    removing phrases of the first corpus of phrases based at least in part on scoring of the phrases;

    removing phrases of the first corpus of phrases that include words appearing on a predetermined blacklist; and

    removing phrases of the first corpus of phrases that comprise specified part-of-speech combinations;

    providing phrases of the second corpus of phrases to a device for selection; and

    responsive to receiving a selection of a phrase, associating the selected phrase with the user.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×