APPARATUS AND METHOD FOR FILTERING DUPLICATE DATA IN RESTRICTED RESOURCE ENVIRONMENT
First Claim
1. An apparatus to filter duplicate data in a resource-restricted environment, the apparatus comprising:
- a cell array unit configured to comprise one or more cells;
a duplication check unit configured to check whether input data is duplicative, and set a value of a cell of the one or more cells that matches the input data; and
a duplication probability calculation unit configured to, in response to the input data being determined as duplicate data by the duplication check unit, calculate a probability of duplication of the input data using the set value of the cell.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for stably filtering duplicate data in various resource-restricted environments such as a mobile device and medical equipment are provided. The apparatus includes a cell array unit configured to comprise one or more cells; a duplication check unit configured to check whether input data is duplicate and set a value of a cell that matches the input data; and a duplication probability calculation unit configured to, in response to the input data being determined as duplicate data by the duplication check unit, calculate a probability of duplication of the input data using the set value of the cell. Data which may be duplicate data among a large amount of input data is not arbitrarily deleted, but is provided to an application along with a probability of duplication of the data. Accordingly, a false positive error that occurs in Bloom filter is prevented, and thereby system stability can be improved.
-
Citations
20 Claims
-
1. An apparatus to filter duplicate data in a resource-restricted environment, the apparatus comprising:
-
a cell array unit configured to comprise one or more cells; a duplication check unit configured to check whether input data is duplicative, and set a value of a cell of the one or more cells that matches the input data; and a duplication probability calculation unit configured to, in response to the input data being determined as duplicate data by the duplication check unit, calculate a probability of duplication of the input data using the set value of the cell. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of filtering duplicate data in a resource-restricted environment, the method comprising:
-
checking whether input data is duplicative; setting a value of a cell that matches the input data; and in response to the input data being determined as duplicate data, calculating a probability of duplication of the input data using the set value of the cell. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus comprising:
a processor configured to increment at least one count value based on input data, determine whether the input data is probable duplicate data, and determine a probability of duplication of the input data based on the at least one count value in response to the input data being determined to be probable duplicate data. - View Dependent Claims (20)
Specification