Systems and methods for improved spell checking
First Claim
1. A system that facilitates spell checking, comprising:
- a component that receives input data containing text; and
a spell checking component that identifies a set of potentially misspelled substrings in the text and proposes at least one alternative spelling for the substring set based on at least one query log;
the query log comprising data utilized by users to query a data collection over a time frame.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention leverages iterative transformations of search query strings along with statistics extracted from search query logs and/or web data to provide possible alternative spellings for the search query strings. This provides a spell checking means that can be influenced to provide individualized suggestions for each user. By utilizing search query logs, the present invention can account for substrings not found in a lexicon but still acceptable as a search query of interest. This allows a means to provide a higher quality proposal for alternative spellings, beyond the content of the lexicon. One instance of the present invention operates at a substring level by utilizing word unigram and/or bigram statistics extracted from query logs combined with an iterative search. This provides substantially better spelling alternatives for a given query than employing only substring matching. Other instances can receive input data from sources other than a search query input.
189 Citations
42 Claims
-
1. A system that facilitates spell checking, comprising:
-
a component that receives input data containing text; and
a spell checking component that identifies a set of potentially misspelled substrings in the text and proposes at least one alternative spelling for the substring set based on at least one query log;
the query log comprising data utilized by users to query a data collection over a time frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 40, 42)
-
-
32. A method of facilitating spell checking, comprising:
-
receiving input data containing text;
identifying a set of potentially misspelled substrings in the text; and
proposing at least one alternative spelling for the substring set based on at least one query log;
the query log comprising data utilized by users to query a data collection over a time frame. - View Dependent Claims (33, 34, 35, 36, 37, 41)
-
-
38. A system that facilitates spell checking queries to a search engine, comprising:
-
means for receiving input data containing text; and
means for identifying a set of potentially misspelled substrings in the text and proposing at least one alternative spelling for the substring set based on at least one query log;
the query log comprising data utilized by users to query a data collection over a time frame.
-
-
39. A data packet, transmitted between two or more computer components, that facilitates spell checking, the data packet comprising, at least in part, information relating to a spell checking system that provides, at least in part, at least one alternative spelling for a string set based, at least in part, on at least one query log.
Specification