Convolutional Latent Semantic Models and their Applications
First Claim
1. A method, implemented by one or more computing devices, for processing linguistic items, comprising:
- receiving a word sequence formed by a sequence of words;
forming a plurality of window vectors, each representing a set of z consecutive words in the word sequence;
transforming the window vectors into a plurality of local contextual feature (LCF) vectors, based on a first part of a convolutional latent semantic model;
generating a global feature vector by selecting, for each dimension of the LCF vectors, a maximum value specified by the LCF vectors, with respect to that dimension; and
projecting the global feature vector into a concept vector, based on a second part of the convolutional latent semantic model,the convolutional latent semantic model being trained based on click-through data.
3 Assignments
0 Petitions
Accused Products
Abstract
Functionality is described herein for transforming first and second symbolic linguistic items into respective first and second continuous-valued concept vectors, using a deep learning model, such as a convolutional latent semantic model. The model is designed to capture both the local and global linguistic contexts of the linguistic items. The functionality then compares the first concept vector with the second concept vector to produce a similarity measure. More specifically, the similarity measure expresses the closeness between the first and second linguistic items in a high-level semantic space. In one case, the first linguistic item corresponds to a query, and the second linguistic item may correspond to a phrase, or a document, or a keyword, or an ad, etc. In one implementation, the convolutional latent semantic model is produced in a training phase based on click-through data.
-
Citations
20 Claims
-
1. A method, implemented by one or more computing devices, for processing linguistic items, comprising:
-
receiving a word sequence formed by a sequence of words; forming a plurality of window vectors, each representing a set of z consecutive words in the word sequence; transforming the window vectors into a plurality of local contextual feature (LCF) vectors, based on a first part of a convolutional latent semantic model; generating a global feature vector by selecting, for each dimension of the LCF vectors, a maximum value specified by the LCF vectors, with respect to that dimension; and projecting the global feature vector into a concept vector, based on a second part of the convolutional latent semantic model, the convolutional latent semantic model being trained based on click-through data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer readable storage medium for storing computer readable instructions, the computer readable instructions implementing a method when executed by one or more processing devices, the method comprising:
-
receiving a query; transforming the query into a query concept vector in a high-level semantic space, using a deep learning model; comparing the query concept vector with an ad-related concept vector, the ad-related concept vector also being expressed in the semantic space, and being produced by transforming an ad-related linguistic item using the deep learning model, said comparing producing a query-to-item similarity measure indicating a semantic similarity between the query and the ad-related linguistic item. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system, implemented by one or more computing devices, for processing linguistic items, comprising:
-
a similarity determination system configured to; transform a first linguistic item into a first concept vector expressed in a high-level semantic space, using a convolutional neural network, or receive a first concept vector that has been previously produced; transform a second linguistic item into a second concept vector expressed in the high-level semantic space, using the convolutional neural network, or receive a second concept vector that has been previously produced; and compare the first concept vector with the second concept vector to produce a similarity measure; and a post-processing module configured to perform an action on the second linguistic item based on the similarity measure, the convolutional neural network being trained based on click-through data.
-
Specification