Method, system, apparatus, program code and means for determining a redundancy of information
First Claim
1. A processor-implemented method for determining that first information in an input data packet is not redundant with second information previously stored in a database system, the method comprising:
- receiving an input data packet comprising first information, wherein the first information comprises one or more portions;
processing the received input data packet in accordance with coarse and fine political risk data filters to identify one or more political risk relevant portions;
determining via a processor portion scores, each portion score associated with each of the political risk relevant portions;
applying tags to at least one of the political risk relevant portions;
determining a first score comprising a total of the portion scores associated with the political risk relevant tagged portions;
generating a database query based on the political risk relevant tagged portions in said input data packet and on the first score;
receiving second information based on said database query;
comparing said second information with said first information to identify at least a first portion of said first information that is different than said second information based on the first score and a redundancy limit; and
facilitating storage of said at least a first portion of said first information in said database system based on the comparison of the first score and the redundancy limit.
11 Assignments
0 Petitions
Accused Products
Abstract
Some embodiments include a system, method, apparatus and means for determining that first information in an input data packet is not redundant with second information previously stored in a database system, includes receiving the input data packet, generating a database query based on one or more tagged portions in the input data packet, comparing second information retrieved by the database query with the first information to identify at least a first portion of the first information that is different than the second information, and causing storage of the at least a first portion of the first information in the database system.
75 Citations
14 Claims
-
1. A processor-implemented method for determining that first information in an input data packet is not redundant with second information previously stored in a database system, the method comprising:
-
receiving an input data packet comprising first information, wherein the first information comprises one or more portions; processing the received input data packet in accordance with coarse and fine political risk data filters to identify one or more political risk relevant portions; determining via a processor portion scores, each portion score associated with each of the political risk relevant portions; applying tags to at least one of the political risk relevant portions; determining a first score comprising a total of the portion scores associated with the political risk relevant tagged portions; generating a database query based on the political risk relevant tagged portions in said input data packet and on the first score; receiving second information based on said database query; comparing said second information with said first information to identify at least a first portion of said first information that is different than said second information based on the first score and a redundancy limit; and facilitating storage of said at least a first portion of said first information in said database system based on the comparison of the first score and the redundancy limit. - View Dependent Claims (2, 3, 4, 7, 8, 9, 10, 11)
-
-
5. A system, comprising:
-
a processor; and a storage device in communication with said processor and storing instructions adapted to be executed by said processor to; receive an input data packet comprising first information, wherein the first information comprises one or more portions; process the received input data packet in accordance with coarse and fine political risk data filters to identify one or more political risk relevant portions; determine portion scores, each portion score associated with each of the political risk relevant portions; apply tags to at least one of the political risk relevant portions; determine a first score comprising a total of the portion scores associated with the political risk relevant tagged portions; generate a database query based on the political risk relevant tagged portions in said input data packet and on the first score; receive second information based on said database query; compare said second information with said first information to identify at least a first portion of said first information that is different than said second information based on the first score and a redundancy limit; and facilitate storage of said at least a first portion of said first information in said database system based on the comparison of the first score and the redundancy limit. - View Dependent Claims (12, 13, 14)
-
-
6. A processor-readable medium comprising a storage element, the storage element storing processor-issuable instructions to:
-
receive an input data packet comprising first information, wherein the first information comprises one or more portions; process the received input data packet in accordance with coarse and fine political risk data filters to identify one or more political risk relevant portions; determine portion scores, each portion score associated with each of the political risk relevant portions; apply a tag to at least one of the political risk relevant portions; determine a first score comprising a total of the of the portion scores associated with the political risk relevant tagged portions; generate a database query based on the political risk relevant tagged portions in said input data packet and on the first score; receive second information based on said database query; compare said second information with said first information to identify at least a first portion of said first information that is different than said second information based on the first score and a redundancy limit; and store said at least a first portion of said first information in said database system based on the comparison of the first score and the redundancy limit.
-
Specification