×

System and methods for automatic clustering of ranked and categorized search objects

  • US 20100131563A1
  • Filed: 11/25/2008
  • Published: 05/27/2010
  • Est. Priority Date: 11/25/2008
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer implemented method of presenting a search report identifying documents relevant to an input query text, said method comprising the steps of:

  • a) first determining a primary top-n set of documents corresponding to a query text, wherein said query text is provided through a user interface, wherein said first determining step is operative to match said query text against a plurality of terms stored in a database, wherein said plurality of terms correspond to anchor texts occurring within documents of an analyzed document collection, wherein said plurality of terms are associated with sets of document addresses identifying the documents of anchor text occurrence, and wherein said primary top-n set of documents correspond to those top ranked based on frequency of occurrence of the matched subset of said plurality of terms;

    b) second determining a set of keywords occurring within said primary top-n set of documents, wherein said database stores a pre-established keyword ontology with keyword associated ranking values determined with respect to said analyzed document collection, and wherein said pre-established keyword ontology includes said set of keywords;

    c) clustering said set of keywords into an ordered plurality of keyword lists dependent on a ranked relatedness determined by reference to said pre-established keyword ontology, said step of clustering including the iterative steps ofi) computing a unified keyword ranking for each of said set of keywords with respect to said primary top-n set of documents and said pre-established keyword ontology keyword associated ranking values;

    ii) selecting a top-n subset of said set of keywords based on said unified keyword ranking as a keyword cluster; and

    iii) removing said top-n subset from said set of keywords and repeating said step of clustering until a predetermined number of clusters are found or exhausting said set of keywords;

    d) presenting, through said user interface, said ordered plurality of keyword lists as categorized keyword lists.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×