Analyzing distributed group discussions
First Claim
1. A computer-implemented method comprising:
- analyzing, by one or more configured computing systems, contents of a plurality of textual comments to identify topics that are mentioned in the contents, wherein the plurality of textual comments are supplied by human users from multiple geographical locations to multiple information services during a specified prior time period;
using, by the one or more configured computing systems, the contents of the plurality of textual comments to automatically determine a subset of the identified topics that are part of a specified content category for the specified prior time period, the using of the contents including;
generating, by the one or more configured computing systems, a plurality of comment groups based on the identified topics, each of the generated comment groups being associated with one of the identified topics and including one or more of the plurality of textual comments based on the contents of the one or more textual comments mentioning the associated identified topic;
identifying, by the one or more configured computing systems, a subset of the plurality of textual comments that are associated with the specified content category based on the contents of the textual comments of the identified subset, the textual comments of the identified subset being included in multiple of the generated comment groups;
determining, by the one or more configured computing systems, a subset of the multiple generated comment groups that correspond to the specified content category for the specified prior time period, the determining including excluding at least one first comment group whose included textual comments appear in the identified subset less than a determined minimum threshold, the determining further including excluding at least one second comment group whose included textual comments appear in the identified subset more than a determined maximum threshold; and
selecting, by the one or more configured computing systems and for each generated comment group in the subset of generated comment groups, the associated topic for the generated comment group to include in the subset of the identified topics that are part of the specified content category for the specified prior time period, wherein the subset of the identified topics includes multiple topics;
providing, by the one or more configured computing systems, indications of the multiple topics as representing the specified content category for the specified prior time period;
tracking, by the one or more configured computing systems and for at least one comment group in the subset, changes between the specified prior time period and one or more other time periods in at least one of textual comments supplied from the multiple geographical locations for the topic associated with the at least one comment group, or textual comments supplied to the multiple information sources for the topic associated with the at least one comment group; and
providing, by the one or more configured computing systems, information about the tracked changes between the specified prior time period and the one or more other time periods for the topic associated with the at least one comment group.
3 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described for analyzing user-supplied information, including in at least some situations to predict future aspects of additional related information that will be supplied by users. The user-supplied information that is analyzed may, for example, include distributed group discussions that involve numerous users and occur via user comments made to one or more social networking sites and/or other computer-accessible sites. The analysis of user-supplied information may, for example, include determining particular topics that are of interest for a specified category during one or more periods of time, quantifying an amount of user interest in particular topics and the category during the period of time, predicting future amounts of user interest in the particular topics and the category during one or more future period of times, and taking one or more further actions based on the predicted information.
28 Citations
30 Claims
-
1. A computer-implemented method comprising:
-
analyzing, by one or more configured computing systems, contents of a plurality of textual comments to identify topics that are mentioned in the contents, wherein the plurality of textual comments are supplied by human users from multiple geographical locations to multiple information services during a specified prior time period; using, by the one or more configured computing systems, the contents of the plurality of textual comments to automatically determine a subset of the identified topics that are part of a specified content category for the specified prior time period, the using of the contents including; generating, by the one or more configured computing systems, a plurality of comment groups based on the identified topics, each of the generated comment groups being associated with one of the identified topics and including one or more of the plurality of textual comments based on the contents of the one or more textual comments mentioning the associated identified topic; identifying, by the one or more configured computing systems, a subset of the plurality of textual comments that are associated with the specified content category based on the contents of the textual comments of the identified subset, the textual comments of the identified subset being included in multiple of the generated comment groups; determining, by the one or more configured computing systems, a subset of the multiple generated comment groups that correspond to the specified content category for the specified prior time period, the determining including excluding at least one first comment group whose included textual comments appear in the identified subset less than a determined minimum threshold, the determining further including excluding at least one second comment group whose included textual comments appear in the identified subset more than a determined maximum threshold; and selecting, by the one or more configured computing systems and for each generated comment group in the subset of generated comment groups, the associated topic for the generated comment group to include in the subset of the identified topics that are part of the specified content category for the specified prior time period, wherein the subset of the identified topics includes multiple topics; providing, by the one or more configured computing systems, indications of the multiple topics as representing the specified content category for the specified prior time period; tracking, by the one or more configured computing systems and for at least one comment group in the subset, changes between the specified prior time period and one or more other time periods in at least one of textual comments supplied from the multiple geographical locations for the topic associated with the at least one comment group, or textual comments supplied to the multiple information sources for the topic associated with the at least one comment group; and providing, by the one or more configured computing systems, information about the tracked changes between the specified prior time period and the one or more other time periods for the topic associated with the at least one comment group. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium having stored contents that cause one or more computing systems to perform a method, the method comprising:
-
analyzing, by the one or more computing systems, a plurality of user-supplied content items to identify a plurality of attributes that are each associated with one or more of the plurality of user-supplied content items; generating, by the one or more computing systems, a plurality of comment groups based on the identified attributes, each of the generated comment groups being associated with one of the identified attributes and including one or more of the plurality of user-supplied content items that have the associated identified attribute; identifying, by the one or more computing systems, a subset of the plurality of user-supplied content items that are associated with a specified content category based on contents of the user-supplied content items of the identified subset, the user-supplied content items of the identified subset being included in multiple of the generated comment groups; determining, by the one or more computing systems, a subset of the multiple generated comment groups that correspond to the specified content category, the determining including excluding one or more comment groups each having included user-supplied content items that appear in the identified subset less than a determined minimum threshold or more than a determined maximum threshold; and providing, by the one or more computing systems, information identifying one or more determined topics for the specified content category based on the associated identified attributes for one or more of the generated comment groups in the determined subset. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A system, comprising:
-
one or more hardware processors of one or more computing systems; and one or more modules that, when executed by at least one of the one or more hardware processors, cause the at least one hardware processors to determine multiple topics associated with a specified content category based on information supplied by users, the determining of the multiple topics including; analyzing a plurality of user-supplied content items to identify a plurality of attributes that are each associated with one or more of the plurality of user-supplied content items, including identifying topics associated with the plurality of user-supplied content items, and wherein the plurality of user-supplied content items are supplied by a plurality of users during a time period; using the identified attributes of the plurality of user-supplied content items to automatically determine a subset of the identified topics that are part of the specified content category for the time period, the using of the identified attributes including; generating a plurality of comment groups based on the identified attributes, each of the generated comment groups being associated with one of the identified attributes and including one or more of the plurality of user-supplied content items that have the associated identified attribute; identifying a subset of the plurality of user-supplied content items that are associated with the specified content category based on the identified attributes of the user-supplied content items of the identified subset, the user-supplied content items of the identified subset being included in multiple of the generated comment groups; determining a subset of the multiple generated comment groups that correspond to the specified content category, the determining including excluding one or more comment groups each having included user-supplied content items that appear in the identified subset less than a determined minimum threshold or more than a determined maximum threshold; and selecting, for each generated comment group in the subset of generated comment groups, an associated topic for the generated comment group to include in the determined subset of the identified topics that are part of the specified content category for the time period; providing indications of the identified topics of the determined subset as being associated with the specified content category for the time period; tracking changes, between the time period and one or more other time periods in topics, in topics determined as being associated with the specified content category; and providing information about the tracked changes between the time period and the one or more other time periods in the topics determined as being associated with the specified content category. - View Dependent Claims (26, 27, 28, 29, 30)
-
Specification