×

Identifying related concepts of URLs and domain names

  • US 8,417,695 B2
  • Filed: 10/30/2009
  • Issued: 04/09/2013
  • Est. Priority Date: 10/30/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method comprising:

  • using structural parsing to extract information from user input comprising a URL or domain name, the information comprising one or more of a protocol, a location, and a subdirectory;

    using semantic parsing of the information to identify a first one or more concepts represented by one or more tokens within the extracted information;

    determining whether the domain name can be mapped to one or more concepts in the concept association map by switching term positions or changing numbers;

    when the domain name can be mapped and if the mapped concepts have high score, identifying the concepts as seed concepts for further querying the concept association map;

    when the mapped concepts do not have a high enough score, or the domain name cannot be mapped, then determining whether the input domain name can be mapped to a concept in the concept association map by typographical error correction, the correction comprising one or more of insertion, deletion, and switching or replacement of 1 or 2 characters; and

    when the input domain name cannot be mapped by typographical error correction, or if concepts mapped as a result of typographical error correction do not have a high score, determining how to break the domain name into URL tokens by inserting separators at correction positions and correcting the tokens;

    querying a concept association map to retrieve a second one or more concepts related to the first one or more concepts, each of the concepts representing a unit of thought, expressed by a term, letter, or symbol, the concept association map comprising a representation of concepts, concept metadata, and relationships between the concepts;

    ranking the first one or more concepts and the second one or more concepts to create ranked concepts; and

    storing the ranked concepts for displaying to one or more users of the computer platform.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×