Automated media analysis and document management system
First Claim
Patent Images
1. A web-based media analysis computer system for analyzing at least one media content that includes text, comprising:
- i. a memory comprising at least one database being accessible to the computer system;
ii. the computer system implementing an uploader that uploads said at least one media content from at least one content provider over a communication network;
iii. the computer system implementing a parser that converts each of said at least one media content into serialized data, filters out unessential data from said serialized data according to a predefined list of unessential data, rationalizes nouns, pronouns and names in said each of said at least one media content, and extracts attributes and categories of said each of said at least one media content, one or more quotes, and attributes of said one or more quotes from said serialized and filtered data of said each of said at least one media content using regular expressions and predefined parsing rules, relationally stores and cross-references said serialized and filtered data, said extracted attributes and said categories of said each of said at least one media content, said one or more quotes, and said attributes of said one or more quotes into a plurality of tables in said at least one database, wherein said attributes of said one or more quotes comprise the name of the quoted person or organization; and
iv. the computer system implementing an analysis module that retrieves data from said plurality of said tables in said at least one database and wherein said analysis module further comprises a toning engine that determines a tone level of said each of said at least one media, said toning engine is provided with tone level probabilities of meaningful minimum sections, tone level probabilities for attributes, and tone level probabilities for categories in said at least one database, said tone engine;
a. parses each of said at least one media content and splitting into serialized meaningful minimum sections;
b. retrieves, from said at least one database, said tone level probabilities for each of said serialized meaningful minimum sections that cause said each of said at least one media content to be toned at, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said serialized meaningful minimum sections;
c. retrieves, from said at least one database, said extracted attributes of said each of said at least one media content and said tone level probabilities for each of said attributes of said each of said at least one media content, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes;
d. retrieves, from said at least one database, said extracted categories of said each of said at least one media content and said tone level probabilities for each of said categories of said each of said at least one media content, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and
establishes said most probable tone level of said each of said at least one media content by weighing and ranking said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said each of said serialized meaningful minimum sections, said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes, and said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and
e. generates a report in response to a request from a client browser.
4 Assignments
0 Petitions
Accused Products
Abstract
A web-based media analysis system, consisting of automated media analysis and document management tools, which processes news articles by parsing the news contents or documents and assigning, relating, and extracting information from the news contents for media analysis and relationally storing them in at least one database. The system further comprises a toning engine for toning articles accurately, based on words, attributes and categories of the article, and optionally based on the author of the article, if applicable.
36 Citations
25 Claims
-
1. A web-based media analysis computer system for analyzing at least one media content that includes text, comprising:
-
i. a memory comprising at least one database being accessible to the computer system; ii. the computer system implementing an uploader that uploads said at least one media content from at least one content provider over a communication network; iii. the computer system implementing a parser that converts each of said at least one media content into serialized data, filters out unessential data from said serialized data according to a predefined list of unessential data, rationalizes nouns, pronouns and names in said each of said at least one media content, and extracts attributes and categories of said each of said at least one media content, one or more quotes, and attributes of said one or more quotes from said serialized and filtered data of said each of said at least one media content using regular expressions and predefined parsing rules, relationally stores and cross-references said serialized and filtered data, said extracted attributes and said categories of said each of said at least one media content, said one or more quotes, and said attributes of said one or more quotes into a plurality of tables in said at least one database, wherein said attributes of said one or more quotes comprise the name of the quoted person or organization; and iv. the computer system implementing an analysis module that retrieves data from said plurality of said tables in said at least one database and wherein said analysis module further comprises a toning engine that determines a tone level of said each of said at least one media, said toning engine is provided with tone level probabilities of meaningful minimum sections, tone level probabilities for attributes, and tone level probabilities for categories in said at least one database, said tone engine; a. parses each of said at least one media content and splitting into serialized meaningful minimum sections; b. retrieves, from said at least one database, said tone level probabilities for each of said serialized meaningful minimum sections that cause said each of said at least one media content to be toned at, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said serialized meaningful minimum sections; c. retrieves, from said at least one database, said extracted attributes of said each of said at least one media content and said tone level probabilities for each of said attributes of said each of said at least one media content, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes; d. retrieves, from said at least one database, said extracted categories of said each of said at least one media content and said tone level probabilities for each of said categories of said each of said at least one media content, and determines most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and
establishes said most probable tone level of said each of said at least one media content by weighing and ranking said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said each of said serialized meaningful minimum sections, said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes, and said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; ande. generates a report in response to a request from a client browser. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method for managing and analyzing at least one media content comprising the steps of:
-
i. uploading said at least one media content from at least one content provider; ii. converting each of said at least one media content into serialized data, filtering out unessential data from said serialized data according to a predefined list of unessential data, rationalizing nouns, pronouns and names therein, extracting attributes and categories of said each of said at least one media content, one or more quotes, and attributes of said one or more quotes in said each of said at least one media content using regular expressions, wherein said attributes of said one or more quotes comprise the name of the quoted person or organization; iii. relationally storing and cross-referencing said serialized and filtered data and said extracted attributes and said categories of said each of said at least one media content, said one or more quotes, and said attributes of said one or more quotes into a plurality of tables in at least one database; and iv. retrieving data from said plurality of tables in said at least one database in response to a request from a client browser and generating a report and toning said at least one media content prior to said generating said report by carrying out the process comprising the steps of; a. parsing each of said at least one media content and splitting into serialized meaningful minimum sections; b. retrieving, from said at least one database, tone level probabilities for each of said serialized meaningful minimum sections that cause said each of said at least one media content to be toned, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said serialized meaningful minimum sections; c. retrieving, from said at least one database, tone level probabilities for each of said attributes of said each of said at least one media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes; d. retrieving, from said at least one database, tone level probabilities for each of said categories of said each of said at least one media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and e. establishing said most probable tone level of said each of at least one media content by weighing and ranking said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said each of said serialized meaningful minimum sections, said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes, and said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A tangible computer readable medium storing executable computer program instructions which, when executed at a server, cause the server to perform a process for analyzing at least one media content, the process comprising the steps of:
-
i. unloading said at least one media content from at least one content provider over a communication network; ii. converting each of said at least one media content into serialized data, filtering out unessential data from said serialized data according to a predefined list of unessential data, extracting attributes and categories of said each of said at least one media content, and extracting, one or more quotes and attributes of said one or more quotes in said each of said at least one media content using regular expressions, wherein said attributes of said one or more quotes comprise the name of the quoted person or organization; iii. relationally storing and cross-referencing said serialized and filtered data, said extracted attributes, said extracted categories and said one or more quotes into a plurality of tables in at least one database; iv. retrieving data from said plurality of said tables in said at least one database and generating a report in response to request from a user; and v. toning said at least one media content prior to said generating said report by carrying out the process steps of; a. parsing each of said at least one media content and splitting into serialized meaningful minimum sections; b. retrieving, from said at least one database, tone level probabilities for each of said serialized meaningful minimum sections that cause said each of said at least one media content to be toned at, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said serialized meaningful minimum sections; c. retrieving, from said at least one database, tone level probabilities for each of said attributes of said media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes; d. retrieving, from said at least one database, tone level probabilities for each of said categories of said each of said at least one media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and e. establishing said most probable tone level of said each of said at least one media content by weighing and ranking said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said each of said serialized meaningful minimum sections, said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes, and said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer-implemented method of toning at least one media contents comprising the steps of:
-
i. parsing each of said at least one media content, splitting and serializing into meaningful minimum sections, and extracting attributes and categories of said each of said at least one media content, one or more quotes, and attributes of said one or more quotes in said each of said at least one media content; ii. retrieving, from at least one database, tone level probabilities for each of said serialized meaningful minimum sections that cause said each of said at least one media content to be toned at, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said serialized meaningful minimum sections; iii. retrieving, from said at least one database, tone level probabilities for each of said attributes of said each of said at least one media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes; iv. retrieving, from said at least one database, tone level probabilities for each of said categories of said each of said at least one media content, and determining most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories; and v. establishing said most probable tone level of said each of said at least one media content by weighing and ranking said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said each of said serialized meaningful minimum sections, said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said attributes, and said most probable tone levels of said each of said at least one media content based on said tone level probabilities of said categories. - View Dependent Claims (21, 22, 23, 24, 25)
-
Specification