Method for retrieving items represented by particles from an information database
First Claim
Patent Images
1. A method for converting a set of words to a corresponding set of particles, comprising the steps of:
- acquiring the set of particles, each particle uniquely corresponds to a word in the set of words;
defining a cost as a function of a size of the set particles, a frequency of occurrence of the particle in the set of particles, and a length of the particle, such that the cost decreases if the size of the set particles is decreased, the cost decreases if the frequency of occurrence of the particle in the set of particles is increased, and the cost decreases if a number of phonemes in the particle is decreased;
partitioning each particle in the set of particles into a prefix particle and a suffix particle, if the partitioning minimizes the cost; and
repeating the partitioning until a desired number of unique particles in the set is achieved, wherein the steps of the method are performed by a processor.
1 Assignment
0 Petitions
Accused Products
Abstract
A set of words is converted to a corresponding set of particles, wherein the words and the particles are unique within each set. For each word, all possible partitionings of the word into particles are determined, and a cost is determined for each possible partitioning. The particles of the possible partitioning associated with a minimal cost are added to the set of particles.
6 Citations
12 Claims
-
1. A method for converting a set of words to a corresponding set of particles, comprising the steps of:
-
acquiring the set of particles, each particle uniquely corresponds to a word in the set of words; defining a cost as a function of a size of the set particles, a frequency of occurrence of the particle in the set of particles, and a length of the particle, such that the cost decreases if the size of the set particles is decreased, the cost decreases if the frequency of occurrence of the particle in the set of particles is increased, and the cost decreases if a number of phonemes in the particle is decreased; partitioning each particle in the set of particles into a prefix particle and a suffix particle, if the partitioning minimizes the cost; and repeating the partitioning until a desired number of unique particles in the set is achieved, wherein the steps of the method are performed by a processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification