DICTIONARY FOR HIERARCHICAL ATTRIBUTES FROM CATALOG ITEMS
First Claim
1. A system comprising:
- a dictionary manager embodied via executable instructions stored on a computer-readable storage medium, the dictionary manager including;
an item input engine configured to obtain a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category;
a brand input engine configured to obtain a plurality of brand character strings associated with the category;
a brand matching engine configured to determine, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings;
a matching indicator engine configured to generate a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination by the brand matching engine;
a correlated segment engine configured to obtain, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings; and
a hierarchy generator configured to generate a dictionary hierarchy based on the obtained correlated segments.
2 Assignments
0 Petitions
Accused Products
Abstract
A plurality of items included in a catalog may be obtained, each item associated with an item category. Brand indicators may be obtained, each brand indicator associated with the item category. Brand indicators associated with each of the items may be determined, and the each item may be assigned to a partition group associated with the brand indicator that is associated with the each item. Correlated string tokens that are correlated, greater than a predetermined correlation threshold value, with the brand indicator associated with the partition group that is associated with the each one of the items, the correlated string tokens associated with the each one of the plurality of items, may be determined. A dictionary hierarchy may be generated based on the one or more correlated string tokens.
-
Citations
20 Claims
-
1. A system comprising:
a dictionary manager embodied via executable instructions stored on a computer-readable storage medium, the dictionary manager including; an item input engine configured to obtain a plurality of item character strings, each item character string representing an item in a catalog, each item associated with a category; a brand input engine configured to obtain a plurality of brand character strings associated with the category; a brand matching engine configured to determine, for each item character string included in the plurality of item character strings, whether a matched substring of the each item character string represents a match with one of the obtained brand character strings; a matching indicator engine configured to generate a matching indicator indicating that the each item character string includes a match with the one of the obtained brand character strings, based on a match result of the matching determination by the brand matching engine; a correlated segment engine configured to obtain, for each one of the obtained brand character strings, and for each one of the item character strings that includes a match with the each one of the obtained brand character strings, one or more correlated segments, other than the matched substring, of the each one of the item character strings that includes a match with the each one of the obtained brand character strings, based on determining that the obtained correlated segments are correlated, greater than a predetermined correlation threshold, with the each one of the obtained brand character strings; and a hierarchy generator configured to generate a dictionary hierarchy based on the obtained correlated segments. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. A method comprising:
-
obtaining a plurality of items included in a catalog, each item associated with an item category; obtaining a plurality of brand indicators, each brand indicator associated with the item category; for each one of the plurality of items, determining one of the obtained brand indicators that is associated with the each one of the plurality of items, and assigning the each one of the plurality of items to a partition group of items that are associated with the determined brand indicator; determining, via a device processor, one or more correlated string tokens that are correlated, greater than a predetermined correlation threshold value, with the brand indicator associated with the partition group that is associated with the each one of the plurality of items, the correlated string tokens associated with the each one of the plurality of items; and generating a dictionary hierarchy based on the one or more correlated string tokens. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer program product tangibly embodied on a computer-readable storage medium and including executable code that, when executed, causes at least one data processing apparatus to:
-
obtain a plurality of items included in a catalog, each item associated with an item category; obtain a plurality of hierarchical attribute types associated with the plurality of items; obtain a plurality of first attribute indicators, each first attribute indicator associated with the item category; for each one of the plurality of items, determine one of the obtained first attribute indicators that is associated with the each one of the plurality of items, and assign the each one of the plurality of items to a partition group of items that are associated with the determined first attribute indicator; determine one or more correlated string tokens that are correlated, greater than a predetermined correlation threshold value, with the first attribute indicator associated with the partition group that is associated with the each one of the plurality of items, the correlated string tokens associated with the each one of the plurality of items; and generate a dictionary hierarchy based on the one or more correlated string tokens. - View Dependent Claims (19, 20)
-
Specification