Method for transforming words to unique numerical representation
First Claim
Patent Images
1. A computer-implemented method for transforming words to unique numerical representations, comprising:
- receiving a text including multiple words; and
transforming each of the received words into a unique numeral representation by using an A to Z helix transformation function such that the transformed unique numerical representation does not result in multiple similar numerical representations.
1 Assignment
0 Petitions
Accused Products
Abstract
Multiple words in a text are transformed to unique numerical representations for text mining applications. A web server receives the text, including multiple words in a natural language. A key-word extractor extracts one or more key-words from the received words. A morphologizer morphologizes the extracted key-words based on similarities of fundamental characteristics in the extracted key-words. An analyzer transforms each of the morphologized words to a unique numerical representation such that the transformed unique numerical representation does not result in multiple similar numerical representations.
10 Citations
13 Claims
-
1. A computer-implemented method for transforming words to unique numerical representations, comprising:
-
receiving a text including multiple words; and
transforming each of the received words into a unique numeral representation by using an A to Z helix transformation function such that the transformed unique numerical representation does not result in multiple similar numerical representations. - View Dependent Claims (2, 3, 4, 5, 12)
-
-
6. A computer-implemented method for transforming words expressed in letters of an alphabet based language to unique numerical representations, comprising:
-
receiving a text including multiple words; and
transforming each of the received words into a unique numeral representation such that the transformed unique numerical representation does not result in multiple similar numerical representations, wherein each of the received words is transformed into the unique numerical representation using an A to Z helix transformation function, wherein the A to Z helix transformation function comprises;
wherein W is a unique number obtained for a word having a length of l+1 letters, wherein the letters in the word W can be represented as β
l β
(l−
1) β
(l−
2) . . . β
0, and also wherein β
i represents the letter in the ith location of the alphabet in a particular language having n distinct letters in the alphabet of the language.
-
-
7. A computer-implemented system for transforming words in a text to unique numerical representations, comprising:
-
a web server to receive the text including multiple words in a natural language;
a key-word extractor to extract one or more key-words from the received words;
a morphologizer to morphologize the extracted key-words based on similarities in fundamental characteristics of the extracted key-words; and
an analyzer to transform each of the morphologized words to a unique numerical representation by using an A to Z helix transformation function such that the transformed unique numerical representation does not result in multiple similar numerical representations. - View Dependent Claims (8, 9, 10, 13)
-
-
11. A computer-implemented system for transforming words in a text expressed in letters of an alphabet based language to unique numerical representations, comprising:
-
a web server to receive the text including multiple words in a natural language;
a key-word extractor to extract one or more key-words from the received words;
a morphologizer to morphologize the extracted key-words based on similarities in fundamental characteristics of the extracted key-words; and
an analyzer to transform each of the morphologized words to a unique numerical representation such that the transformed unique numerical representation does not result in multiple similar numerical representations, wherein the analyzer transforms each of the morphologized words to a unique numerical representation using an A to Z helix transformation function, wherein the A to Z helix transformation function comprises;
wherein W is a unique number obtained for a word having a length of l+1 letters, wherein the letters in the word W can be represented as β
lβ
(l−
1) β
(l−
2) . . . β
0, and also wherein β
i represents the letter in the ith location of the alphabet in a particular language having n distinct letters in the alphabet of the language.
-
Specification