×

Method and system for determining sets of variant items

  • US 9,418,138 B2
  • Filed: 09/10/2015
  • Issued: 08/16/2016
  • Est. Priority Date: 12/22/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • performing, by one or more computers having at least one processor and memory;

    accessing data that includes, for each individual item of individual ones of a plurality of items, a corresponding one or more text strings that describe, and are distinct from, the individual item;

    for each particular item of at least some of the individual ones of the plurality of items;

    for each of one or more other items of the individual ones of the plurality of items that are distinct from the particular item, comparing the one or more text strings describing the other item with the one or more text strings describing the particular item;

    based at least in part on said comparing, identifying at least one of the one or more other items that are each distinct from, but a potential variant of, the particular item;

    subsequent to said identifying, and for each identified other item of at least some of the one or more identified other items, generating an aligned pair, wherein one member of the aligned pair comprises the one or more text strings describing the identified other item, and the other member of the aligned pair comprises the one or more text strings describing the particular item, and wherein said generating comprises aligning the text in the one member with respect to the text in the other member; and

    subsequent to said generating, and for each aligned pair of at least some of the one or more aligned pairs;

    determining one or more misalignments between the text in one member of the aligned pair and the text in the other member of the aligned pair; and

    assigning a similarity score to the aligned pair, wherein the similarity score depends at least in part on the determined one or more misalignments, and indicates a degree of confidence that the particular item that corresponds to the one or more text strings of one member of the aligned pair, and the other item that corresponds to the one or more text strings of the other member of the aligned pair, are distinct variants of each other;

    based at least in part on a plurality of the generated aligned pairs, and on the similarity scores assigned to each of those aligned pairs, determining one or more variant sets of items from the plurality of items, wherein each variant set comprises multiple items of the plurality of items such that each item of the variant set is indicated to be a variant of a same item;

    generating a network-based page based at least in part on the determined one or more variant sets of items; and

    transmitting the generated network-based page over a communication network to a client computing device.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×