Method and device for determining and outputting the similarity between two data strings
First Claim
1. A method comprising:
- receiving a first data string in an electronic component,receiving a second data string in said electronic component,determining pairs of consecutively following data entities in said first data string in a processing unit,determining the relative positions of said pairs of consecutively following data entities in said first data string in said processing unit,allocating a position label to each of said data entities in the first data string in said processing unit,numbering same data entities according to their relative position in accordance with the position label in said processing unit,determining similar data entities with the same order in said second data string in said processing unit,determining the relative positions of said determined data entities in said second data string in said processing unit,determining a matching measure by determining how far the relative positions of data entities in said second data string match with the relative positions of consecutively following data entities in said first data string in said processing unit, anddetermining a similarity measure which corresponds to the matching measure of at least one comparison result in said processing unit,repeating said determination of said similarity measure with a number of received second data strings in said processing unit, andoutputting by an interface said determined similarity measures for said data strings according to the amount of similarity to said first data string,wherein said first data string of entities and said second data string of entities are data strings relating to one of associative text string, genome analysis, speech recognition, and musical melody.
6 Assignments
0 Petitions
Accused Products
Abstract
The present invention discloses a method and device for determining and outputting a similarity measure between two data strings each data string comprising data entities, comprising: receiving a first data string, receiving a second data string, which is characterized by determining consecutively following data entities in the first data string, determining the relative positions of the consecutively following data entities in the first data string, determining similar data entities with the same order in the second data string, determining the relative positions of the determined data entities in the second data string, determining a matching measure by determining how far the relative positions of data entities in the second data string match with the relative positions of consecutively following data entities in the first data string, and outputting a similarity measure which corresponds to the matching measure of at least one comparison result.
13 Citations
16 Claims
-
1. A method comprising:
-
receiving a first data string in an electronic component, receiving a second data string in said electronic component, determining pairs of consecutively following data entities in said first data string in a processing unit, determining the relative positions of said pairs of consecutively following data entities in said first data string in said processing unit, allocating a position label to each of said data entities in the first data string in said processing unit, numbering same data entities according to their relative position in accordance with the position label in said processing unit, determining similar data entities with the same order in said second data string in said processing unit, determining the relative positions of said determined data entities in said second data string in said processing unit, determining a matching measure by determining how far the relative positions of data entities in said second data string match with the relative positions of consecutively following data entities in said first data string in said processing unit, and determining a similarity measure which corresponds to the matching measure of at least one comparison result in said processing unit, repeating said determination of said similarity measure with a number of received second data strings in said processing unit, and outputting by an interface said determined similarity measures for said data strings according to the amount of similarity to said first data string, wherein said first data string of entities and said second data string of entities are data strings relating to one of associative text string, genome analysis, speech recognition, and musical melody. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An electronic device comprising:
-
a component configured to receive a first data string of entities and a second data string of entities, said first data string of entities and said second data string of entities being data strings relating to one of associative text string, genome analysis, speech recognition, and musical melody, a processing unit configured to determine pairs of consecutively following data entities in said first data string, determine the relative positions of said pairs of consecutively following data entities in said first data string, allocate a position label to each of said data entities in the first data string, number same data entities according to their relative position in accordance with the position label; determine similar data entities with the same order in said second data string, determine the relative positions of said determined data entities in said second data string, and determine a matching measure by determining how far the relative positions of data entities in said second data string match with the relative positions of consecutively following data entities in said first data string, and repeat said determination of said similarity measure with a number of received second data strings, and an interface configured to output a similarity measure for said second data string and said number of second data strings according to the amount of similarity to said first data string. - View Dependent Claims (15, 16)
-
Specification