Extracting quotes from customer reviews
First Claim
1. A non-transitory computer-readable storage medium having computer-executable instructions stored thereon that, when executed by a computer, cause the computer to:
- receive customer reviews for a category of items from a plurality of customers, wherein the category of items encompasses a particular item and items of a similar type;
aggregate the customer reviews for the category of items;
parse a plurality of sentences from the customer reviews, wherein the parsing comprises breaking compound sentences in the customer reviews based on one or more of commas, coordinating conjunctions, or transition terms;
classify the plurality of sentences by sentiment by utilizing a logistic regression classifier trained using sentiment scores of terms generated based on sentences contained in training data and manually labeled as to sentiment, wherein positive, negative and mixed sentiment scores of the terms are used as positive coefficients in the logistic regression classifier and neutral sentiment scores of the terms are used as negative coefficients in the logistic regression classifier;
remove those sentences having a neutral sentiment from the plurality of sentences;
generate a list of topics from the plurality of sentences utilizing latent Dirichlet allocation;
assign individual sentences in the plurality of sentences to a topic in the list of topics;
select a relevant topic from the list of topics for the particular item, wherein the relevant topic comprises the topic from the list of topics assigned to the most sentences parsed from customer reviews regarding the particular item;
select a representative sentence from the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item, the representative sentence having a highest relevance to the relevant topic and expressing a majority sentiment from among the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item; and
send the selected representative sentence to a customer computing device, wherein the selected representative sentence is presented to another customer on the customer computing device.
1 Assignment
0 Petitions
Accused Products
Abstract
Technologies are described herein for extracting quotes for an item from customer reviews. Individual sentences or phrases contained in customer reviews regarding an item or category of items are parsed into a collection of sentences. A list of topics is generated from the collection of sentences, and each sentence or phrases is assigned a topic from the list of topics. The most relevant topics from the list of topics are identified for a particular item, and one or more sentences or phrases from customer reviews for the particular item are selected for each of the most relevant topics that best represent the sentences associated with the item and assigned to that topic, according to content and sentiment.
54 Citations
19 Claims
-
1. A non-transitory computer-readable storage medium having computer-executable instructions stored thereon that, when executed by a computer, cause the computer to:
-
receive customer reviews for a category of items from a plurality of customers, wherein the category of items encompasses a particular item and items of a similar type; aggregate the customer reviews for the category of items; parse a plurality of sentences from the customer reviews, wherein the parsing comprises breaking compound sentences in the customer reviews based on one or more of commas, coordinating conjunctions, or transition terms; classify the plurality of sentences by sentiment by utilizing a logistic regression classifier trained using sentiment scores of terms generated based on sentences contained in training data and manually labeled as to sentiment, wherein positive, negative and mixed sentiment scores of the terms are used as positive coefficients in the logistic regression classifier and neutral sentiment scores of the terms are used as negative coefficients in the logistic regression classifier; remove those sentences having a neutral sentiment from the plurality of sentences; generate a list of topics from the plurality of sentences utilizing latent Dirichlet allocation; assign individual sentences in the plurality of sentences to a topic in the list of topics; select a relevant topic from the list of topics for the particular item, wherein the relevant topic comprises the topic from the list of topics assigned to the most sentences parsed from customer reviews regarding the particular item; select a representative sentence from the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item, the representative sentence having a highest relevance to the relevant topic and expressing a majority sentiment from among the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item; and send the selected representative sentence to a customer computing device, wherein the selected representative sentence is presented to another customer on the customer computing device. - View Dependent Claims (2, 3)
-
-
4. A system for summarizing customer reviews regarding a particular item, the system comprising:
-
one or more application servers configured to receive, from a plurality of customers, customer reviews for a category of items comprising the particular item and items of a similar type; and a quote extraction module executing on the one or more application servers and configured to aggregate the customer reviews for the category of items, parse a plurality of sentences from the aggregated customer reviews, wherein the parsing comprises breaking compound sentences in the customer reviews based on one or more of commas, coordinating conjunctions, or transition terms, classify the plurality of sentences by sentiment by utilizing a logistic regression classifier trained using sentiment scores of terms generated based on sentences contained in training data and manually labeled as to sentiment, wherein positive, negative and mixed sentiment scores of the terms are used as positive coefficients in the logistic regression classifier and neutral sentiment scores of the terms are used as negative coefficients in the logistic regression classifier, remove those sentences having a neutral sentiment from the plurality of sentences, generate a list of topics from the plurality of sentences utilizing latent Dirichlet allocation, assign each sentence in the plurality of sentences to a topic in the list of topics, select a relevant topic from the list of topics for a particular item, wherein the relevant topic comprises the topic from the list of topics assigned to the most sentences parsed from the customer reviews regarding the particular item, select a representative sentence from the plurality of sentences assigned to the relevant topic and parsed from customer reviews regarding the particular item, the representative sentence having a highest relevance to the relevant topic and expressing a majority sentiment from among the plurality of sentences assigned to the relevant topic and parsed from customer reviews regarding the particular item, and cause the selected representative sentence to be sent to a customer computing device, wherein the selected representative sentence is presented to another customer on the customer computing device. - View Dependent Claims (5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method of extracting a quote regarding a relevant topic from customer reviews for a particular item, the method comprising executing instructions in a computer system to perform the operations of:
-
receiving customer reviews for a category of items from a plurality of customers, wherein the category of items encompasses a particular item and items of a similar type; aggregating, by the computer system, the customer reviews for the category of items; parsing, by the computer system, a plurality of sentences from the customer reviews, wherein the parsing comprises breaking compound sentences in the customer reviews based on one or more of commas, coordinating conjunctions, or transition terms; classifying, by the computer system, the plurality of sentences by sentiment by utilizing a logistic regression classifier trained using sentiment scores of terms generated based on sentences contained in training data and manually labeled as to sentiment, wherein positive, negative and mixed sentiment scores of the terms are used as positive coefficients in the logistic regression classifier and neutral sentiment scores of the terms are used as negative coefficients in the logistic regression classifier; removing, by the computer system, those sentences having a neutral sentiment from the plurality of sentences; generating, by the computer system, a list of topics from the plurality of sentences utilizing latent Dirichlet allocation; assigning individual sentences in the plurality of sentences to a topic in the list of topics; selecting, by the computer system, the relevant topic from the list of topics for the particular item, wherein the relevant topic comprises the topic from the list of topics assigned to the most sentences parsed from the customer reviews regarding the particular item; selecting, by the computer system, a representative sentence from the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item, the representative sentence having a highest relevance to the relevant topic and expressing a majority sentiment from among the plurality of sentences assigned to the relevant topic and parsed from the customer reviews regarding the particular item; and sending, by the computer system, the selected representative sentence to a customer computing device, wherein the selected representative sentence is presented to another customer on the customer computing device. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification