Named entity recognition in query
First Claim
Patent Images
1. A computer-implemented method of recognizing named entities in a query with a recognition module operating on a processor, the method comprising:
- receiving an input query from a user;
detecting a named entity in the input query;
representing the input query as one or more triples (e, t, c), wherein e represents the detected named entity, t represents a context of the detected named entity, and c represents a classification for the detected named entity that is determined based at least in part on a trained probabilistic topic model and a predefined taxonomy;
calculating a joint probability for individual ones of the one or more triples;
identifying a largest joint probability associated with a respective one of the one or more triples;
predicting an assigned classification for the detected named entity based at least in part on the identified largest joint probability; and
outputting the detected named entity and the assigned classification to the user.
2 Assignments
0 Petitions
Accused Products
Abstract
Named Entity Recognition in Query (NERQ) involves detection of a named entity in a given query and classification of the named entity into one or more predefined classes. The predefined classes may be based on a predefined taxonomy. A probabilistic approach may be taken to detecting and classifying named entities in queries, the approach using either query log data or click through data and Weakly Supervised Latent Dirichlet Allocation (WS-LDA) to construct and train a topic model.
23 Citations
20 Claims
-
1. A computer-implemented method of recognizing named entities in a query with a recognition module operating on a processor, the method comprising:
-
receiving an input query from a user; detecting a named entity in the input query; representing the input query as one or more triples (e, t, c), wherein e represents the detected named entity, t represents a context of the detected named entity, and c represents a classification for the detected named entity that is determined based at least in part on a trained probabilistic topic model and a predefined taxonomy; calculating a joint probability for individual ones of the one or more triples; identifying a largest joint probability associated with a respective one of the one or more triples; predicting an assigned classification for the detected named entity based at least in part on the identified largest joint probability; and outputting the detected named entity and the assigned classification to the user. - View Dependent Claims (2, 3, 4)
-
-
5. A method of recognizing named entities in a query with a recognition module operating on a processor, the method comprising:
-
defining a seed set comprising a seed named entity; assigning a classification to the seed named entity based on a predefined taxonomy; training a probabilistic topic model based on the seed named entity and the classification; receiving an input query from a user; detecting, by the processor, another named entity in the input query; representing the input query as one or more triples (e, t, c), wherein e represents the detected other named entity, t represents a context of the detected other named entity, and c represents a particular classification for the detected other named entity that is determined based at least in part on the trained probabilistic topic model and the predefined taxonomy; calculating a joint probability for individual ones of the one or more triples; identifying a largest joint probability associated with a respective one of the one or more triples; predicting, by the recognition module, an assigned classification for the detected other named entity based at least in part on the identified largest joint probability; and outputting the detected other named entity and the assigned classification to the user. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
a memory; and a processor coupled to the memory, the processor executing components comprising; an offline training component configured to; construct training data comprising a first named entity and a classification for the first named entity, and train a probabilistic topic model based on the first named entity and the classification for the first named entity using a weakly supervised learning method; and an online prediction component configured to; detect another named entity in an input query, represent the input query as one or more triples (e, t, c), wherein e represents the detected other named entity, t represents a context of the detected other named entity, and c represents a particular classification for the detected other named entity that is determined based at least in part on the trained probabilistic topic model and a predefined taxonomy, calculate a joint probability for individual ones of the one or more triples, identify a largest joint probability associated with a respective one of the one or more triples, predict an assigned classification for the detected other named entity based at least in part on the identified largest joint probability, and output the detected other named entity and the assigned classification for the detected other named entity to a user. - View Dependent Claims (18, 19, 20)
-
Specification