Dictionary for hierarchical attributes from catalog items
First Claim
1. A system comprising:
- a dictionary manager embodied via executable instructions stored on a computer-readable storage medium, the dictionary manager including;
an item input engine configured to obtain a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category;
a brand input engine configured to obtain a plurality of brand character strings associated with the category;
a brand matching engine configured to determine, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings;
a matching indicator engine configured to generate a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination by the brand matching engine;
a correlated segment engine configured to obtain, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings;
a hierarchy generator configured to generate a dictionary hierarchy based on the obtained correlated segments;
a catalog input engine configured to obtain the catalog, the catalog including a list of items represented by item character strings;
a brand expansion engine configured to request an expanded list of brand values based on an initial list of brand values associated with the category; and
a category determination engine configured to request a determination of the category associated with the plurality of item character strings,the item input engine configured to obtain the plurality of item character strings, each item character string representing an item in the catalog, each item associated with the category, based on the determination of the category determined by the category determination engine.
2 Assignments
0 Petitions
Accused Products
Abstract
A plurality of items included in a catalog may be obtained, each item associated with an item category. Brand indicators may be obtained, each brand indicator associated with the item category. Brand indicators associated with each of the items may be determined, and the each item may be assigned to a partition group associated with the brand indicator that is associated with the each item. Correlated string tokens that are correlated, greater than a predetermined correlation threshold value, with the brand indicator associated with the partition group that is associated with the each one of the items, the correlated string tokens associated with the each one of the plurality of items, may be determined. A dictionary hierarchy may be generated based on the one or more correlated string tokens.
21 Citations
20 Claims
-
1. A system comprising:
-
a dictionary manager embodied via executable instructions stored on a computer-readable storage medium, the dictionary manager including; an item input engine configured to obtain a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category; a brand input engine configured to obtain a plurality of brand character strings associated with the category; a brand matching engine configured to determine, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings; a matching indicator engine configured to generate a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination by the brand matching engine; a correlated segment engine configured to obtain, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings; a hierarchy generator configured to generate a dictionary hierarchy based on the obtained correlated segments; a catalog input engine configured to obtain the catalog, the catalog including a list of items represented by item character strings; a brand expansion engine configured to request an expanded list of brand values based on an initial list of brand values associated with the category; and a category determination engine configured to request a determination of the category associated with the plurality of item character strings, the item input engine configured to obtain the plurality of item character strings, each item character string representing an item in the catalog, each item associated with the category, based on the determination of the category determined by the category determination engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
obtaining a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category; obtaining a plurality of brand character strings associated with the category; determining via a device processor, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings; generating a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination; obtaining, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings; generating a dictionary hierarchy based on the obtained correlated segments; obtaining the catalog, the catalog including a list of items represented by item character strings; requesting an expanded list of brand values based on an initial list of brand values associated with the category; and requesting a determination of the category associated with the plurality of item character strings, the plurality of item character strings obtained, based on the determination of the determined category. - View Dependent Claims (12, 13, 14)
-
-
15. A computer program product tangibly embodied on a machine readable storage device and including executable code configured to cause one or more processors to:
-
obtain a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category; obtain a plurality of brand character strings associated with the category; determine, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings; generate a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination; obtain, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings; generate a dictionary hierarchy based on the obtained correlated segments; obtain the catalog, the catalog including a list of items represented by item character strings; request an expanded list of brand values based on an initial list of brand values associated with the category; and request a determination of the category associated with the plurality of item character strings, the plurality of item character strings obtained, based on the determination of the determined category. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification