Deep structured semantic model produced using click-through data
First Claim
1. One or more computing devices comprising:
- a processing device; and
a projection module executable on the processing device, the projection module comprising;
a dimensionality-reduction module configured to;
receive an input item that represents linguistic information comprising a plurality of input words from a vocabulary space having a first dimensionality; and
transform the input item into a lower-dimension item that represents individual input words in another space having a second dimensionality that is smaller than the first dimensionality of the vocabulary space; and
a deep structured semantic module configured to, after the input item has been transformed into the lower-dimension item;
receive the lower-dimension item from the dimensionality-reduction module; and
project, using a model, the lower-dimension item into an output item other than the lower-dimension item,the output item being expressed in a semantic space, andthe model being discriminatively trained based on click-through data.
2 Assignments
0 Petitions
Accused Products
Abstract
A deep structured semantic module (DSSM) is described herein which uses a model that is discriminatively trained based on click-through data, e.g., such that a conditional likelihood of clicked documents, given respective queries, is maximized, and a condition likelihood of non-clicked documents, given the queries, is reduced. In operation, after training is complete, the DSSM maps an input item into an output item expressed in a semantic space, using the trained model. To facilitate training and runtime operation, a dimensionality-reduction module (DRM) can reduce the dimensionality of the input item that is fed to the DSSM. A search engine may use the above-summarized functionality to convert a query and a plurality of documents into the common semantic space, and then determine the similarity between the query and documents in the semantic space. The search engine may then rank the documents based, at least in part, on the similarity measures.
92 Citations
20 Claims
-
1. One or more computing devices comprising:
-
a processing device; and a projection module executable on the processing device, the projection module comprising; a dimensionality-reduction module configured to; receive an input item that represents linguistic information comprising a plurality of input words from a vocabulary space having a first dimensionality; and transform the input item into a lower-dimension item that represents individual input words in another space having a second dimensionality that is smaller than the first dimensionality of the vocabulary space; and a deep structured semantic module configured to, after the input item has been transformed into the lower-dimension item; receive the lower-dimension item from the dimensionality-reduction module; and project, using a model, the lower-dimension item into an output item other than the lower-dimension item, the output item being expressed in a semantic space, and the model being discriminatively trained based on click-through data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method implemented by one or more computing devices the method comprising:
-
receiving a first input item which represents first linguistic information provided in a vocabulary space having a first dimensionality; converting the first input item into a phonetic representation of the first input item; projecting, using a model, the phonetic representation of the first input item into a first output item, the first output item being expressed in a semantic space; receiving a second output item which represents second linguistic information, the second output item also being expressed in the semantic space; and determining a similarity between the first output item and the second output item in the semantic space to obtain a similarity measure between the first linguistic information and the second linguistic information, said projecting using a deep neural network, and the model being discriminatively trained based on click-through data. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
a processing device; and a computer readable storage medium storing instructions which, when executed by the processing device, cause the processing device to; receive an input item that represents linguistic information comprising an input word from a vocabulary space having a first dimensionality; represent the input item as a plurality of n-grams; map the input into a lower-dimension item that represents the plurality of n-grams in another space having a second dimensionality that is smaller than the first dimensionality of the vocabulary space of the linguistic information; and use a model to project the lower-dimension item that represents the plurality of n-grams into a semantic space to obtain a semantic output item representing the input item, wherein the model is trained using click-through data. - View Dependent Claims (17, 18, 19, 20)
-
Specification