Methods and systems for determining a risk of an emotional response of an audience
First Claim
Patent Images
1. A method for using a vector data structure to increase a speed of determining a risk of an emotional response of an audience to at least one publication, the method is implemented by a server computing device in communication with a database, the method comprising:
- receiving, by the server computing device, a plurality of documents in an electronic format;
identifying, by the server computing device for each of a plurality of risk factors, a first set of the documents relevant to the risk factor and a second set of the documents not relevant to the risk factor, wherein each risk factor is a predetermined characteristic that influences the emotional response of the audience;
analyzing, by the server computing device for each of the risk factors, the corresponding first set of documents to determine occurrences and distributions of words within the first set of documents, and storing in the database, for each of the risk factors, an associated prototype vector, wherein each prototype vector includes a word set, each of the words in the word set weighted by a normalized frequency of occurrence of the word in the corresponding first set of documents;
comparing, by the server computing device, each prototype vector to occurrences and distributions of the words in the associated word set in the second set of documents not relevant to the associated risk factor to verify the prototype vector;
receiving, by the server computing device, the at least one publication, the at least one publication comprising one of a news item, an article, an opinion, a blog entry, and a social media message, the at least one publication communicated over a network, and converting the at least one publication from a first format to a second format by converting one of images, video, and audio to text that represents the at least one publication;
retrieving, by the server computing device from the database, the prototype vectors, each prototype vector stored in the database as a multi-dimensional data structure having a first column that includes each of the words in the word set, a second column that includes the normalized frequency of occurrence of each of the words in the word set, and a third column that includes the weight assigned to each of the words in the word set;
generating, by the server computing device, a test vector corresponding to each prototype vector, wherein each test vector includes each word in the corresponding prototype vector that also appears in the at least one publication;
determining, by the server computing device, a magnitude of each risk factor by comparing each test vector to the corresponding prototype vector;
retrieving, by the server computing device from the database, a model for weighting and summing the determined magnitudes of the risk factors, the model generated by the server computing device by performing principal component analysis (PCA) on a predetermined number of publications and performing an orthogonal transformation to determine a first principal component of the risk of the emotional response of the audience and a second principal component of the risk of the emotional response of the audience, and extracting eigenvalues from a correlation coefficient matrix to determine which risk factors most influence the audience;
determining, by the server computing device, a risk of an emotional response of the audience by weighting and summing the magnitudes of the risk factors according to the retrieved model, and transmitting the risk of the emotional response of the audience to at least one client computing device; and
generating based on the risk of the emotional response before an emotional response occurs and outputting, by the server computing device to a user interface displayed on a display of the at least one client computing device, an electronic user interactive visual representation of the risk of the emotional response in a first geographic region and a second geographic region that is different from the first geographic region that simultaneously shows a variation of the risk of the emotional response in the first geographic region and the second geographic region that is different from the first geographic region, the electronic user interactive visual representation showing on the display for both the first geographic region and the second geographic region a respective point on a graph of the first principal component of the risk of the emotional response of the audience having a range from negative one representing a low risk to positive one representing a high risk along a first axis and a second principal component of the risk of the emotional response of the audience having a range from negative one representing the low risk to positive one representing the high risk along a second axis.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for determining a risk of an emotional response of an audience to at least one publication is described. Additionally, a computing device for determining a risk of an emotional response of an audience to at least one publication is described. Moreover, a computer-readable storage device having processor-executable instructions embodied thereon, for determining a risk of an emotional response of an audience to at least one publication, is described.
-
Citations
18 Claims
-
1. A method for using a vector data structure to increase a speed of determining a risk of an emotional response of an audience to at least one publication, the method is implemented by a server computing device in communication with a database, the method comprising:
-
receiving, by the server computing device, a plurality of documents in an electronic format; identifying, by the server computing device for each of a plurality of risk factors, a first set of the documents relevant to the risk factor and a second set of the documents not relevant to the risk factor, wherein each risk factor is a predetermined characteristic that influences the emotional response of the audience; analyzing, by the server computing device for each of the risk factors, the corresponding first set of documents to determine occurrences and distributions of words within the first set of documents, and storing in the database, for each of the risk factors, an associated prototype vector, wherein each prototype vector includes a word set, each of the words in the word set weighted by a normalized frequency of occurrence of the word in the corresponding first set of documents; comparing, by the server computing device, each prototype vector to occurrences and distributions of the words in the associated word set in the second set of documents not relevant to the associated risk factor to verify the prototype vector; receiving, by the server computing device, the at least one publication, the at least one publication comprising one of a news item, an article, an opinion, a blog entry, and a social media message, the at least one publication communicated over a network, and converting the at least one publication from a first format to a second format by converting one of images, video, and audio to text that represents the at least one publication; retrieving, by the server computing device from the database, the prototype vectors, each prototype vector stored in the database as a multi-dimensional data structure having a first column that includes each of the words in the word set, a second column that includes the normalized frequency of occurrence of each of the words in the word set, and a third column that includes the weight assigned to each of the words in the word set; generating, by the server computing device, a test vector corresponding to each prototype vector, wherein each test vector includes each word in the corresponding prototype vector that also appears in the at least one publication; determining, by the server computing device, a magnitude of each risk factor by comparing each test vector to the corresponding prototype vector; retrieving, by the server computing device from the database, a model for weighting and summing the determined magnitudes of the risk factors, the model generated by the server computing device by performing principal component analysis (PCA) on a predetermined number of publications and performing an orthogonal transformation to determine a first principal component of the risk of the emotional response of the audience and a second principal component of the risk of the emotional response of the audience, and extracting eigenvalues from a correlation coefficient matrix to determine which risk factors most influence the audience; determining, by the server computing device, a risk of an emotional response of the audience by weighting and summing the magnitudes of the risk factors according to the retrieved model, and transmitting the risk of the emotional response of the audience to at least one client computing device; and generating based on the risk of the emotional response before an emotional response occurs and outputting, by the server computing device to a user interface displayed on a display of the at least one client computing device, an electronic user interactive visual representation of the risk of the emotional response in a first geographic region and a second geographic region that is different from the first geographic region that simultaneously shows a variation of the risk of the emotional response in the first geographic region and the second geographic region that is different from the first geographic region, the electronic user interactive visual representation showing on the display for both the first geographic region and the second geographic region a respective point on a graph of the first principal component of the risk of the emotional response of the audience having a range from negative one representing a low risk to positive one representing a high risk along a first axis and a second principal component of the risk of the emotional response of the audience having a range from negative one representing the low risk to positive one representing the high risk along a second axis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A server computing device for using a vector data structure to increase a speed of determining a risk of an emotional response of an audience to at least one publication, the server computing device is communicatively coupled to a database, the server computing device comprising:
-
a processor; and a memory device in communication with the processor, the processor to; receive a plurality of documents in an electronic format; identify, for each of a plurality of risk factors, a first set of the documents relevant to the risk factor and a second set of the documents not relevant to the risk factor, wherein each risk factor is a predetermined characteristic that influences the emotional response of the audience; analyze, for each of the risk factors, the corresponding first set of documents to determine occurrences and distributions of words within the first set of documents, and store in the database, for each of the risk factors, an associated prototype vector, wherein each prototype vector includes a word set, each of the words in the word set weighted by a normalized frequency of occurrence of the word in the corresponding first set of documents; compare each prototype vector to occurrences and distributions of the words in the associated word set in the second set of documents not relevant to the associated risk factor to verify the prototype vector; receive the at least one publication, the at least one publication comprising one of a news item, an article, an opinion, a blog entry, and a social media message, the at least one publication communicated over a network, and convert the at least one publication from a first format to a second format by converting one of images, video, and audio to text that represents the at least one publication; retrieve, from the database, the prototype vectors, each prototype vector stored in the database as a multi-dimensional data structure having a first column that includes each of the words in the word set, a second column that includes the normalized frequency of occurrence of each of the words in the word set, and a third column that includes the weight assigned to each of the words in the word set; generate a test vector corresponding to each prototype vector, wherein each test vector includes each word in the corresponding prototype vector that also appears in the at least one publication; determine a magnitude of each risk factor by comparing each test vector to the corresponding prototype vector; retrieve, from the database, a model for weighting and summing the determined magnitudes of the risk factors, the model generated by the server computing device by performing principal component analysis (PCA) on a predetermined number of publications and performing an orthogonal transformation to determine a first principal component of the risk of the emotional response of the audience and a second principal component of the risk of the emotional response of the audience, and extracting eigenvalues from a correlation coefficient matrix to determine which risk factors most influence the audience; determine a risk of an emotional response of the audience by weighting and summing the magnitudes of the risk factors according to the retrieved model, and transmitting the risk of the emotional response of the audience to at least one client computing device; and generate based on the risk of the emotional response before an emotional response occurs and output to a user interface displayed on a display of the at least one client computing device an electronic user interactive graphical representation of the risk of the emotional response in a first geographic region and a second geographic region that is different from the first geographic region that simultaneously shows a variation of the risk of the emotional response in the first geographic region and the second geographic region that is different from the first geographic region, the electronic user interactive visual representation showing on the display for both the first geographic region and the second geographic region a respective point on a graph of the first principal component of the risk of the emotional response of the audience having a range from negative one representing a low risk to positive one representing a high risk along a first axis and a second principal component of the risk of the emotional response of the audience having a range from negative one representing the low risk to positive one representing the high risk along a second axis. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage device having processor-executable instructions embodied thereon for using a vector data structure to increase a speed of determining a risk of an emotional response of an audience to at least one publication, wherein when executed by a server computing device communicatively coupled to a database, the processor-executable instructions cause the server computing device to:
-
receive a plurality of documents in an electronic format; identify, for each of a plurality of risk factors, a first set of the documents relevant to the risk factor and a second set of the documents not relevant to the risk factor, wherein each risk factor is a predetermined characteristic that influences the emotional response of the audience; analyze, for each of the risk factors, the corresponding first set of documents to determine occurrences and distributions of words within the first set of documents, and store in the database, for each of the risk factors, an associated prototype vector, wherein each prototype vector includes a word set, each of the words in the word set weighted by a normalized frequency of occurrence of the word in the corresponding first set of documents; compare each prototype vector to occurrences and distributions of the words in the associated word set in the second set of documents not relevant to the associated risk factor to verify the prototype vector; receive the at least one publication, the at least one publication comprising one of a news item, an article, an opinion, a blog entry, and a social media message, the at least one publication communicated over a network, and convert the at least one publication from a first format to a second format by converting one of images, video, and audio to text that represents the at least one publication; retrieve, from the database, the prototype vectors, each prototype vector stored in the database as a multi-dimensional data structure having a first column that includes each of the words in the word set, a second column that includes the normalized frequency of occurrence of each of the words in the word set, and a third column that includes the weight assigned to each of the words in the word set; generate a test vector corresponding to each prototype vector, wherein each test vector includes each word in the corresponding prototype vector that also appears in the at least one publication; determine a magnitude of each risk factor by comparing each test vector to the corresponding prototype vector; retrieve, from the database, a model for weighting and summing the determined magnitudes of the risk factors, the model generated by the server computing device by performing principal component analysis (PCA) on a predetermined number of publications and performing an orthogonal transformation to determine a first principal component of the risk of the emotional response of the audience and a second principal component of the risk of the emotional response of the audience, and extracting eigenvalues from a correlation coefficient matrix to determine which risk factors most influence the audience; determine a risk of an emotional response of the audience by weighting and summing the magnitudes of the risk factors according to the retrieved model, and transmit the risk of the emotional response of the audience to at least one client computing device; and generate based on the risk of the emotional response before an emotional response occurs and output to a user interface displayed on a display of the at least one client computing device, an electronic user interactive graphical representation of the risk of the emotional response in a first geographic region and a second geographic region that is different from the first geographic region that simultaneously shows a variation of the risk of the emotional response in the first geographic region and the second geographic region that is different from the first geographic region, the electronic user interactive visual representation showing on the display for both the first geographic region and the second geographic region a respective point on a graph of the first principal component of the risk of the emotional response of the audience having a range from negative one representing a low risk to positive one representing a high risk along a first axis and a second principal component of the risk of the emotional response of the audience having a range from negative one representing the low risk to positive one representing the high risk along a second axis. - View Dependent Claims (18)
-
Specification