×

Usage based query response

  • US 8,428,948 B1
  • Filed: 07/20/2010
  • Issued: 04/23/2013
  • Est. Priority Date: 12/17/2009
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • (a) a database storing a plurality of records;

    (b) a computer readable medium storing data comprising a dictionary list comprising a plurality of word groups identified as corresponding to an invented part of speech;

    (c) a computer configured via a set of data to perform a set of tasks comprising;

    (i) receiving an input string, the input string comprising a plurality of words comprising an input word group corresponding to the invented part of speech;

    (ii) calculating a set of part of speech scores for the input string wherein calculating the set of part of speech scores comprises, for each word group from a set of word groups from the dictionary list, calculating a measure of similarity between;

    (1) the input word group corresponding to the invented part of speech; and

    (2) the word group from the set of word groups from the dictionary list for which the part of speech score is being calculated;

    (iii) determining a result set comprising a set of records retrieved from the database, wherein each record from the result set comprises an identifying word group corresponding to the invented part of speech; and

    (iv) for each record in a subset of records from the result set, determining a match score based on a relevant part of speech score from the previously calculated set of part of speech scores, wherein the relevant part of speech score corresponds to the identifying word group from the record in the subset of records for which the match score is being determined; and

    wherein the cardinality of the subset of records is less than or equal to the cardinality of the set of records from the result set;

    wherein;

    (A) each record from the plurality of records stored in the database corresponds to a class from a plurality of classes;

    (B) the data stored on the computer readable medium further comprises a class probability index, wherein the class probability index comprises, for each class in a subset of the plurality of classes;

    (i) general probability data that words appear in records corresponding to the class; and

    (ii) specialized probability data that word groups in the dictionary list are used as the invented part of speech in records corresponding to the class;

    (C) calculating a set of class scores for the input string, wherein the set of class scores comprises, for each class from the subset of the plurality of classes, a probability that the input string corresponds to that class;

    (D) each record from the result set corresponds to a class from the subset of the plurality of classes; and

    (E) the match score for each record from the subset of records from the result set is further based on a relevant class score from the previously calculated set of class scores, wherein the relevant class score corresponds to the class corresponding to the record from the subset of records for which the match score is being determined;

    wherein the cardinality of the subset of the plurality of classes is less than or equal to the cardinality of the plurality of classes.

View all claims
  • 16 Assignments
Timeline View
Assignment View
    ×
    ×