Error model formation
First Claim
1. A method of forming a target error model to facilitate spell checking input text related to a target data collection comprising steps of:
- a) providing a source query log containing user queries to at least one source data collection;
b) generating target relational data based on the source query log including corrective substring suggestions that relate to the target data collection and corresponding misspelled substrings for the corrective substring suggestions extracted from the source query log, by applying a source error model to the source query log to thereby generate source relational data including corrective substring suggestions for misspelled substrings of the source query log, and selecting a subset of the source relational data that relate to the target data collection as the target relational data;
c) building a target error model using the target relational data including target statistical occurrence data for the substrings of the target relational data derived from the source query log; and
d) storing the target error model on a computer readable medium.
2 Assignments
0 Petitions
Accused Products
Abstract
In a method of forming a target error model to facilitate correcting or suggesting corrections to misspelled input text related to a target data collection, a source query log containing user queries to at least one source data collection is provided. Next, target relational data is generated based on the source query log including corrective substring suggestions that relate to the target data collection and corresponding misspelled substrings extracted from the source query log. A target error model is then built using the target relational data. The target error model includes target statistical occurrence data for the substrings of the target relational data derived from the source query log. Finally, the target error model is stored on a computer readable medium. Additional embodiments of the invention are directed to a system configured to implement the method.
-
Citations
18 Claims
-
1. A method of forming a target error model to facilitate spell checking input text related to a target data collection comprising steps of:
-
a) providing a source query log containing user queries to at least one source data collection; b) generating target relational data based on the source query log including corrective substring suggestions that relate to the target data collection and corresponding misspelled substrings for the corrective substring suggestions extracted from the source query log, by applying a source error model to the source query log to thereby generate source relational data including corrective substring suggestions for misspelled substrings of the source query log, and selecting a subset of the source relational data that relate to the target data collection as the target relational data; c) building a target error model using the target relational data including target statistical occurrence data for the substrings of the target relational data derived from the source query log; and d) storing the target error model on a computer readable medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of forming a target error model to facilitate spell checking input text related to a target data collection comprising steps of:
-
a) providing first and second source query logs comprising query data corresponding to user queries to at least one source data collection by dividing a primary source query log corresponding to user queries to the source data collection into the first and second source query logs; b) generating source statistical occurrence data for substrings of the first source query log; c) generating source relational data including corrective substring suggestions for misspelled substrings of the second source query log based on an application of the statistical occurrence data to the second source query log; d) selecting a subset of the corrective substring suggestions and their corresponding misspelled substrings of the source relational data that relate to substrings of the target data collection as target relational data; e) building a target error model using the target relational data including target statistical occurrence data for the substrings of the target relational data; and f) storing the target error model on a computer readable medium. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A system for forming a target error model to facilitate spell checking input text related to a target data collection comprising:
-
a source query log comprising query data to at least one source data collection; a target relational data generator having an output of target relational data including corrective substring suggestions that relate to the target data collection and corresponding misspelled substrings for the corrective substring suggestions extracted from the source query log; a target error model generator including an output of target statistical occurrence data including at least one of unigram statistics and bigram statistics for the substrings of the target relational data; a computer storage medium containing a target error model comprising the target statistical occurrence data; and a computer processor being a functional component of the system and facilitating generating the output of the target relational data generator and the target error model generator. - View Dependent Claims (16, 17, 18)
-
Specification