×

Content data indexing and result ranking

  • US 7,987,189 B2
  • Filed: 08/20/2007
  • Issued: 07/26/2011
  • Est. Priority Date: 07/01/2002
  • Status: Active Grant
First Claim
Patent Images

1. In a computing system having access to multiple content entities, each content entity including searchable content, a method for building a searchable content index for searching and retrieving content entities in an efficient manner that returns results of content entities expected to be found, the method comprising:

  • identifying searchable data within each of a plurality of content entities;

    dividing text portions of the searchable data within each of the plurality of content entities into words and tokens, and storing each of the words and tokens in a first table;

    removing from the first table each duplicate word and token;

    applying an alternative word set to the first table after each duplicate word and token has been removed, wherein applying the alternative word set to the table includes adding to the first table alternative words associated with one or more of the words or tokens in the first table;

    identifying all possible combinations of the words from the plurality of content entities; and

    creating a second table, wherein the second table is a double word table that includes only all possible unique two word combinations of words from the plurality of content entities.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×