×

Calculating reliability scores from word splitting

  • US 8,706,728 B2
  • Filed: 09/30/2010
  • Issued: 04/22/2014
  • Est. Priority Date: 02/19/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • A) storing, by one or more server computers communicatively coupled to a network, an electronic dictionary comprising one or more dictionary words;

    B) receiving, by said one or more server computers, a text string without spaces;

    C) identifying, by said one or more server computers, a plurality of keywords comprising a plurality of substrings of said text string;

    D) generating, by said one or more server computers, from said text string, one or more keyword strings, each comprising a combination of said plurality of keywords;

    E) for each of said one or more keyword strings;

    i) identifying by said one or more server computers, within said plurality of keywords, one or more dictionary keywords comprising one or more of said one or more dictionary words;

    ii) calculating, by said one or more server computers, a dictionary keyword percentage variable comprising a quotient dividing a quantity of said one or more dictionary keywords by a quantity of said plurality of keywords;

    iii) calculating, by said one or more server computers, a dictionary keyword percentage weighting variable by determining a quantity of said one or more keyword strings wherein said dictionary keyword percentage variable comprises a value of 100%;

    iv) calculating, by said one or more server computers, a dictionary character percentage variable comprising a quotient dividing a quantity of characters in said one or more dictionary keywords by a quantity of characters in said text string;

    v) calculating, by said one or more server computers, a keyword string rank variable comprising a numerical rank assigned to each of said one or more keyword strings according to a quantity of said plurality of keywords within each of said one or more keyword strings; and

    vi) calculating, by said one or more server computers, a keyword count uniqueness variable calculated by identifying said one or more keyword strings with an equal number of said plurality of keywords; and

    F) calculating, by said one or more server computers, for each of said one or more keyword strings, a reliability score comprising a sum of said dictionary keyword percentage variable, said dictionary keyword percentage weighting variable, said dictionary character percentage variable, said keyword string rank variable and said keyword count uniqueness variable.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×