Method of spell-checking search queries
First Claim
1. A method comprising:
- receiving, by one or more processors of one or more server devices, a target text-string in a search query from a user device;
identifying, by one or more processors of one or more server devices, a set of contexts, from within a corpus of contexts, that include the target text-string;
determining, by one or more processors of one or more server devices, a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, where determining the quantity of correct spelling contexts, further comprises;
identifying a number of contexts in the set of contexts in which occurrences of the target text-string are equal to or greater than a first threshold and in which a ratio of target text-string occurrences to occurrences of a reference text-string is equal to or greater than a second threshold, anddetermining the quantity of correct spelling contexts in the set of contexts that include a correct spelling of the target text-string based on the number of identified contexts;
determining, by one or more processors of one or more server devices, a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of the reference text-string;
determining, by one or more processors of one or more server devices, that the target text-string is a misspelling of the reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts; and
forming, by one or more processors of one or more server devices, an alternative search query in response to determining that the target text-string is a misspelling of the reference text-string.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method for determining whether a target text-string is correctly spelled is provided. The target text-string is compared to a corpus to determine a set of contexts which each include an occurrence of the target text-string. Using heuristics, each context of the set is characterized based on occurrences in the corpus of the target text-string and a reference text-string. Contexts are characterized as including a correct spelling of the target text-string, an incorrect spelling of the reference text-string, or including an indeterminate usage of the target text-string. A likelihood that the target text-string is a misspelling of the reference text-string is computed as a function of the quantity of contexts including a correct spelling of the target text-string and the quantity of contexts including an incorrect spelling of a reference text-string. In one application, the target text-string is received in a search query, the search executed following a spell-check.
42 Citations
20 Claims
-
1. A method comprising:
-
receiving, by one or more processors of one or more server devices, a target text-string in a search query from a user device; identifying, by one or more processors of one or more server devices, a set of contexts, from within a corpus of contexts, that include the target text-string; determining, by one or more processors of one or more server devices, a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, where determining the quantity of correct spelling contexts, further comprises; identifying a number of contexts in the set of contexts in which occurrences of the target text-string are equal to or greater than a first threshold and in which a ratio of target text-string occurrences to occurrences of a reference text-string is equal to or greater than a second threshold, and determining the quantity of correct spelling contexts in the set of contexts that include a correct spelling of the target text-string based on the number of identified contexts; determining, by one or more processors of one or more server devices, a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of the reference text-string; determining, by one or more processors of one or more server devices, that the target text-string is a misspelling of the reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts; and forming, by one or more processors of one or more server devices, an alternative search query in response to determining that the target text-string is a misspelling of the reference text-string. - View Dependent Claims (2, 3, 4)
-
-
5. A method comprising:
-
receiving, by one or more processors of one or more server devices, a target text-string in a search query from a user device; identifying, by one or more processors of one or more server devices, a set of contexts, from within a corpus of contexts, that include the target text-string; determining, by one or more processors of one or more server devices, a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, determining, by one or more processors of one or more server devices, a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of a reference text-string, where determining the quantity of incorrect spelling contexts, further comprises; identifying a number of contexts in the set of contexts in which occurrences of the reference text-string are equal to or greater than a first threshold and in which a ratio of reference text-string occurrences to target text-string occurrences is equal to or greater than a second threshold, and determining the quantity of incorrect spelling contexts in the set of contexts that include an incorrect spelling of the reference text-string based on the number of identified contexts, determining, by one or more processors of one or more server devices, that the target text-string is a misspelling of the reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts; and forming, by one or more processors of one or more server devices, an alternative search query when the target text-string is a misspelling of the reference text-string. - View Dependent Claims (6, 7)
-
-
8. A method comprising:
-
receiving, by one or more processors of one or more server devices, a target text-string in a search query from a user device; identifying, by one or more processors of one or more server devices, a set of contexts, from within a corpus of contexts, that include the target text-string; determining, by one or more processors of one or more server devices, a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, determining, by one or more processors of one or more server devices, a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of a reference text-string; determining, by one or more processors of one or more server devices, that the target text-string is a misspelling of the reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts, where determining that the target text-string is the misspelling of the reference text-string is based on a function of the quantity of correct spelling contexts relative to a sum that includes the quantity of incorrect spelling contexts and the quantity of correct spelling contexts; and forming, by one or more processors of one or more server devices, an alternative search query when the target text-string is a misspelling of the reference text-string.
-
-
9. A method comprising:
-
receiving, by one or more processors of one or more server devices, a target text-string in a search query from a user device; identifying, by one or more processors of one or more server devices, a set of contexts, from within a corpus of contexts, that include the target text-string; determining, by one or more processors of one or more server devices, a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, determining, by one or more processors of one or more server devices, a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of a reference text-string; determining, by one or more processors of one or more server devices, that the target text-string is a misspelling of the reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts, where determining that the target text-string is the misspelling of the reference text-string is based on a function of the quantity of incorrect spelling contexts relative to a sum that includes the quantity of incorrect spelling contexts and the quantity of correct spelling contexts; and forming, by one or more processors of one or more server devices, an alternative search query when the target text-string is a misspelling of the reference text-string.
-
-
10. A method comprising:
-
receiving, by one or more processors of one or more server devices, a target text-string, where the target text-string is received in a search query; selecting, by one or more processors of one or more server devices, a reference text-string having characteristics corresponding to the target text-string; determining, by one or more processors of one or more server devices, a first ratio of occurrences of the reference text-string relative to occurrences of the target text-string in a first memory; determining, by one or more processors of one or more server devices, a second ratio of occurrences of the reference text-string relative to occurrences of the target text-string in a second memory; determining, by one or more processors of one or more server devices, a likelihood that the target text-string is a misspelling of the reference text-string based on the first ratio and the second ratio; and providing, by one or more processors of one or more server devices, the reference text-string in an alternative search query. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A storage device including a computer-readable medium containing computer-executable instructions, comprising:
-
one or more instructions for receiving a search query from a user, where the search query includes a target text-string; one or more instructions for selecting a reference text-string having characteristics corresponding to the target text-string; one or more instructions for determining a first ratio of occurrences of the reference text-string relative to occurrences of the target text-string in a first memory; one or more instructions for determining a second ratio of occurrences of the reference text-string relative to occurrences of the target text-string in a second memory; one or more instructions for determining that the target text-string is a misspelling of the reference text-string based on of the first ratio and the second ratio; and one or more instructions for providing an alternative search query. - View Dependent Claims (17, 18)
-
-
19. A storage device including a computer-readable medium containing computer-executable instructions, comprising:
-
one or more instructions for receiving a target text-string in a search query from a user device; one or more instructions for identifying at least one reference text-string based on the target text-string; one or more instructions for identifying a set of contexts, from within a corpus of contexts, that include the target text-string, one or more instructions for determining a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string, where the one or more instructions for determining the quantity of correct spelling contexts includes; one or more instructions for identifying a number of contexts in the set of contexts in which occurrences of the target text-string are equal to or greater than a first threshold and in which a ratio of target text-string occurrences to occurrences of the at least one reference text-string is equal to or greater than a second threshold, and one or more instructions for determining the quantity of correct spelling contexts in the set of contexts that include a correct spelling of the target text-string based on the number of identified contexts; one or more instructions for determining a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of the at least one reference text-string; one or more instructions for determining that the target text-string is a misspelling of the at least one reference text-string based on the quantity of correct spelling contexts and the quantity of incorrect spelling contexts; and one or more instructions for providing an alternative search query when the target text-string is a misspelling of the at least one reference text-string.
-
-
20. A storage device including a computer-readable medium containing computer-executable instructions, comprising:
-
one or more instructions for receiving a target text-string in a search query from a user device; one or more instructions for identifying at least one reference text-string based on the target text-string; one or more instructions for identifying a set of contexts, from within a corpus of contexts, that include the target text-string; one or more instructions for determining a quantity of correct spelling contexts in the set of contexts, where the correct spelling contexts include a correct spelling of the target text-string; one or more instructions for determining a quantity of incorrect spelling contexts in the set of contexts, where the incorrect spelling contexts include an incorrect spelling of the at least one reference text-string, where the one or more instructions for determining that the target text-string is a misspelling of the reference text-string includes; one or more instructions for determining that the target text-string is the misspelling of the at least one reference text-string based on a function of the quantity of correct spelling contexts relative to a sum that includes the quantity of incorrect spelling contexts and the quantity of correct spelling contexts; and one or more instructions for providing an alternative search query when the target text-string is a misspelling of the at least one reference text-string.
-
Specification