ACQUISITION OF SEMANTIC CLASS LEXICONS FOR QUERY TAGGING
First Claim
1. A method for updating a query tagging model using one or more stratified lexicons comprising:
- creating a list data structure comprising a set of lists, wherein a list comprises one or more phrases;
creating seed distribution data comprising one or more phrases, wherein a phrase comprises a lexicon probability distribution;
creating a sub-graph based upon the list data structure and the seed distribution data;
executing a graph learning technique upon the sub-graph to create an expanded set of phrases, wherein a phrase comprises an updated lexicon probability distribution;
creating one or more stratified lexicons based upon the expanded set of phrases; and
updating a query tagging model using the one or more stratified lexicons as lexicon features.
2 Assignments
0 Petitions
Accused Products
Abstract
A user'"'"'s search experience may be enhanced by providing additional content based upon an understanding of the user'"'"'s intent. Query tagging, the assigning of semantic labels to terms within a query, is one technique that may be utilized to determine the context of a user'"'"'s search query. Accordingly, as provided herein, a query tagging model may be updated using one or more stratified lexicons. A list data structure (e.g., lists of phrases obtained from web pages) and seed distribution data (e.g., pre-labeled probability data) may be used by a graph learning technique to obtain an expanded set of phrases and their respective probabilities of corresponding with particular lexicons (e.g., semantic class lexicons). The expanded set of phrases may be used to group phrases into stratified lexicons. The stratified lexicons may be used as features for updating and/or executing the query tagging model.
32 Citations
20 Claims
-
1. A method for updating a query tagging model using one or more stratified lexicons comprising:
-
creating a list data structure comprising a set of lists, wherein a list comprises one or more phrases; creating seed distribution data comprising one or more phrases, wherein a phrase comprises a lexicon probability distribution; creating a sub-graph based upon the list data structure and the seed distribution data; executing a graph learning technique upon the sub-graph to create an expanded set of phrases, wherein a phrase comprises an updated lexicon probability distribution; creating one or more stratified lexicons based upon the expanded set of phrases; and updating a query tagging model using the one or more stratified lexicons as lexicon features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for updating a query tagging model using one or more stratified lexicons comprising:
-
a list data structure constructor configured to create a list data structure comprising a set of lists, wherein a list comprises one or more phrases; a seed distribution constructor configured to create seed distribution data comprising one or more phrases, wherein a phrase comprises a lexicon probability distribution; a sub-graph constructor configured to create a sub-graph based upon the list data structure and the seed distribution data; a graph learning component configured to execute a graph learning technique upon the sub-graph to create an expanded set of phrases, wherein a phrase comprises an updated lexicon probability distribution; a stratified lexicon constructor configured to create one or more stratified lexicons based upon the expanded set of phrases; and a tagging model updater configured to update a query tagging model using the one or more stratified lexicons as lexicon features. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for updating a query tagging model using one or more stratified lexicons comprising:
-
a list data structure constructor configured to create a list data structure in a graph format based upon a filtered set of HTML lists; a seed distribution constructor configured to create seed distribution data based upon labeled training data; a sub-graph constructor configured to create a sub-graph of the list data structure based upon removing one or more lists and one or more phrases based upon the seed distribution data; a graph learning component configured to execute a graph learning algorithm upon the sub-graph create an expanded set of phrase based upon assigning list lexicon probability distributions to respective lists and updated lexicon probability distributions to respective phrases within the sub-graph; a stratified lexicon constructor configured to group a set of phrases within the sub-graph into a stratified lexicon, wherein the stratified lexicon is associated with a corresponding lexicon based upon a predefined range; and a tagging model updater configured to update a query tagging model based upon execution of the query tagging model upon one or more queries.
-
Specification