Labeling product identifiers and navigating products
First Claim
1. A method comprising:
- extracting description information of multiple products;
clustering the description information of the multiple products belonging to a particular model into a first text;
processing the first text by segmenting the first text to one ofremove from the first text one or more terms whose term frequencies are higher than a first set threshold, andremove from the first text one or more terms whose term frequencies are lower than a second set threshold;
clustering, after processing the first text, first texts of products belonging to different models into a second text;
applying a subject analysis to the second text to obtain one or more subjects;
defining one or more names for the one or more subjects respectively;
assigning a respective name of a respective subject correlated to description information of a respective product as an identifier of the respective product; and
labeling the respective product by using the identifier,wherein the applying the subject analysis to the second text to obtain one or more subjects comprises;
setting a number of subjects in one or more subject models;
applying the subject analysis to the second text by using a text analysis method based on the one or more subject models;
obtaining a number of subsets corresponding to the number of subjects from a set of terms in the second text, the number of subsets being equal to the number of subjects, a respective subset corresponding to a respective subject; and
according to the respective subset that one or more terms in the description information of the products locate, correlating the description information of the products to the respective subject corresponding to the respective subset.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure provides example methods and apparatuses of labeling product identifiers and methods of navigating products. Description information of one or more products is extracted. The description information of the products is clustered into a text. A subject analysis is applied to the text by using a text analysis method based on subject models to obtain one or more subjects and definition names for the subjects. A subject that is correlated to the description information of the product is used as an identifier of the product to label the product. The present techniques label the products with identifiers that have one or more user dimension attributes so that users may easily and intuitively find their desired products.
-
Citations
14 Claims
-
1. A method comprising:
-
extracting description information of multiple products; clustering the description information of the multiple products belonging to a particular model into a first text; processing the first text by segmenting the first text to one of remove from the first text one or more terms whose term frequencies are higher than a first set threshold, and remove from the first text one or more terms whose term frequencies are lower than a second set threshold; clustering, after processing the first text, first texts of products belonging to different models into a second text; applying a subject analysis to the second text to obtain one or more subjects; defining one or more names for the one or more subjects respectively; assigning a respective name of a respective subject correlated to description information of a respective product as an identifier of the respective product; and labeling the respective product by using the identifier, wherein the applying the subject analysis to the second text to obtain one or more subjects comprises; setting a number of subjects in one or more subject models; applying the subject analysis to the second text by using a text analysis method based on the one or more subject models; obtaining a number of subsets corresponding to the number of subjects from a set of terms in the second text, the number of subsets being equal to the number of subjects, a respective subset corresponding to a respective subject; and according to the respective subset that one or more terms in the description information of the products locate, correlating the description information of the products to the respective subject corresponding to the respective subset. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
extracting description information of multiple products; clustering the description information of the multiple products belonging to a particular model into a first text; processing the first text by segmenting the first text to one of; remove from the first text one or more terms whose term frequencies are higher than a first set threshold, and remove from the first text one or more terms whose term frequencies are lower than a second set threshold; clustering, after processing the first text, first texts of products belonging to different models into a second text; applying a subject analysis to the second text to obtain one or more subjects; correlating the multiple products to the one or more subjects; and navigating the multiple products according to a respective subject correlated to a respective product, wherein the applying the subject analysis to the second text to obtain one or more subjects comprises; setting a number of subjects in one or more subject models; applying the subject analysis to the second text by using a text analysis method based on the one or more subject models; obtaining a number of subsets corresponding to the number of subjects from a set of terms included in the second text, the number of subsets being equal to the number of subjects, a respective subset corresponding to a respective subject; and according to the respective subset that one or more terms in the description information of the products locate, correlating the description information of the products to the respective subject corresponding to the respective subset. - View Dependent Claims (8, 9, 10, 14)
-
-
11. An apparatus comprising a memory and a processor that executes computer executable instructions stored in the memory to cause the processor to:
-
extract description information of multiple products; cluster the description information of the multiple products belonging to a particular model into a first text; process the first text by segmenting the first text to one of remove from the first text one or more terms whose term frequencies are higher than a first set threshold, and remove from the first text one or more terms whose term frequencies are lower than a second set threshold; cluster, after the first text is processed, first texts of products belonging to different models into a second text; apply a subject analysis to the second text to obtain one or more subjects and defines one or more names for the one or more subjects, respectively; and assign a respective name of a respective subject correlated to description information of a respective product as an identifier of the respective product and label the respective product by using the identifier, wherein the computer executable instructions stored in the memory cause the processor to; set a number of subjects in one or more subject models; apply the subject analysis to the second text by using a text analysis method based on the one or more subject models; obtain a number of subsets corresponding to the number of subjects from a set of terms included in the second text, the number of subsets being equal to the number of subjects, a respective subset corresponding to a respective subject; and according to the respective subset that one or more terms in the description information of the products locate, correlate the description information of the products to the respective subject corresponding to the respective subset. - View Dependent Claims (12, 13)
-
Specification