SELECTION OR RELIABLE KEY WORDS FROM UNRELIABLE SOURCES IN A SYSTEM AND METHOD FOR CONDUCTING A SEARCH
First Claim
Patent Images
1. A system to select data, comprising:
- a reception component that receives at least one data entry from at least one data source;
a processor component to determine the entropy of a word extracted from the at least one data entry;
a filtering component to select reliable words, wherein reliable words are words with low entropy values, the filtering component further excludes words with high entropy values; and
a transmission component to output a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides for a system to select data including a reception component that receives at least one data entry from at least one data source, a processor component to determine the entropy of a word extracted from the at least one data entry, a filtering component to select reliable words, wherein reliable words are words with low entropy values, the filtering component further excluding words with high entropy values, and a transmission component to output a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted.
-
Citations
6 Claims
-
1. A system to select data, comprising:
-
a reception component that receives at least one data entry from at least one data source; a processor component to determine the entropy of a word extracted from the at least one data entry; a filtering component to select reliable words, wherein reliable words are words with low entropy values, the filtering component further excludes words with high entropy values; and a transmission component to output a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted. - View Dependent Claims (2)
-
-
3. A method for selecting data, comprising:
-
receiving at least one data entry from at least one data source; determining the entropy of a word extracted from the at least one data entry; selecting reliable words, wherein reliable words are words with low entropy values, and excluding words with high entropy values; and outputting a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted. - View Dependent Claims (4)
-
-
5. A computer-readable medium, having stored thereon a set of instructions which, when executed by at least one processor of at least one computer, executes a method for selecting data comprising:
-
receiving at least one data entry from at least one data source; determining the entropy of a word extracted from the at least one data entry; selecting reliable words, wherein reliable words are words with low entropy values, and excluding words with high entropy values; and outputting a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted. - View Dependent Claims (6)
-
Specification