System And Method For Identifying Topics For Short Text Communications
First Claim
Patent Images
1. A system for identifying topics for short text communications, comprising:
- an extraction module to extract tokens from a short text communication;
a query module to generate a query using the extracted tokens and to apply the query to a set of documents;
an result identification module to identify those documents in the set that match the query as search results;
a threshold module to identify salient terms associated with each of the search results and to apply a threshold to the identified salient terms; and
a topic module to select the salient terms that satisfy the threshold as topics for the short text communication.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for identifying topics for short text communications is provided. Tokens are extracted from a short text communication. A query is generated using the extracted tokens. The query is applied to a set of documents. Those documents in the set that match the query are identified as search results. Salient terms associated with each of the search results are identified. A threshold is applied to the identified salient terms. The salient terms that satisfy the threshold are selected as topics for the short text communication.
-
Citations
20 Claims
-
1. A system for identifying topics for short text communications, comprising:
-
an extraction module to extract tokens from a short text communication; a query module to generate a query using the extracted tokens and to apply the query to a set of documents; an result identification module to identify those documents in the set that match the query as search results; a threshold module to identify salient terms associated with each of the search results and to apply a threshold to the identified salient terms; and a topic module to select the salient terms that satisfy the threshold as topics for the short text communication. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for identifying topics for short text communications, comprising:
-
extracting tokens from a short text communication; generating a query using the extracted tokens and applying the query to a set of documents; identifying those documents in the set that match the query as search results; identifying salient terms associated with each of the search results and applying a threshold to the identified salient terms; and selecting the salient terms that satisfy the threshold as topics for the short text communication. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification