×

Enhancing sitelinks with creative content

  • US 10,650,066 B2
  • Filed: 03/15/2013
  • Issued: 05/12/2020
  • Est. Priority Date: 01/31/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of matching sitelinks with content items, comprising:

  • identifying, by a data processing system having one or more processors and memory, a primary result associated with a content provider, responsive to a search query from a client device;

    identifying, by the data processing system, for presentation with the primary result, a sitelink associated with the content provider for the primary result and including a sitelink uniform resource locator (URL) indexed in a sitelink database, the sitelink URL including a sitelink parameter;

    generating, by the data processing system, a canonicalized sitelink URL for the sitelink URL by removing the sitelink parameter;

    identifying, by the data processing system, for presentation with the primary result, from a content item database for the content provider, a plurality of content items associated with;

    a first URL including a first parameter, a second URL including a second parameter, and a third URL including a third parameter;

    generating, by the data processing system, a first canonicalized URL for the first URL by removing the first content item parameter;

    generating, by the data processing system, a second canonicalized URL for the second URL by removing the second content item parameter;

    generating, by the data processing system, a third canonicalized URL for the third URL by removing the third content item;

    determining, by the data processing system, that the first canonicalized URL is the same as the second canonicalized URL but differs from the third canonicalized URL;

    identifying, by the data processing system, both the first canonicalized URL and the second canonicalized URL as associated with a first content item from the plurality of content items based on determining that the first canonicalized URL is the same as the second canonicalized URL;

    identifying, by the data processing system, the third canonicalized URL as associated with a second content item different from the first content item from the plurality of content items based on determining that the first canonicalized URL differs from the third canonicalized URL;

    generating, by the data processing system, a first content item URL group for the first content item including the first canonicalized URL and the second canonicalized URL and a second content item URL group for the second content item including the third canonicalized URL, responsive to identifying both the first canonicalized content item URL and the second canonicalized URL as associated with the first content item and identifying the third canonicalized URL as associated with the second content item;

    determining, by the data processing system, a first score indicating a similarity between text of the first content item for the first content item URL group and text of the sitelink and a second score indicating a similarity between text of the second content item for the second content item URL group and the text of the sitelink;

    matching, by the data processing system, the sitelink with the first content item based on a comparison between the first score and the second score; and

    generating, by the data processing system, for presentation on the client device responsive to the search query, a results interface page associated with the content provider including the primary result and one or more secondary results including the sitelink, and the text of the first content item.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×