×

Content propagation for enhanced document retrieval

  • US 7,305,389 B2
  • Filed: 04/15/2004
  • Issued: 12/04/2007
  • Est. Priority Date: 04/15/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method providing computer-implemented content propagation for enhanced document retrieval, the method comprising:

  • identifying reference information directed to one or more documents, wherein the reference information identified from one or more sources of data, is independent from a data source comprising the one or more documents;

    extracting metadata that is proximally located to the reference information, which is surrounding the reference information and is semantically or contextually related to the reference information;

    calculating relevance between respective features of the metadata to content of associated ones of the one or more documents;

    indexing associated portions of the metadata with the relevance of features from the respective portions along with relevance scores, into original content of the document, for each document of the one or more documents,wherein the indexing generates one or more enhanced documents;

    analyzing the one or more enhanced documents to locate relevance information based on a search query;

    ranking the one or more enhanced documents based on relevance scores;

    communicating ranked results and snippet descriptions for the one or more enhanced documents, based on the search query;

    wherein the one or more sources of data comprise a search query log, and wherein calculating relevance further comprises;

    identifying search queries from the search query log, wherein the search queries have a relatively high frequency of occurrence (FOO) to search the data source;

    determining article(s) selected by an end-user from search query results, the article(s) being from the data source; and

    determining missing end-user selection(s), where a missing end-user selection is an article in the search query results that was not selected;

    wherein determining missing end-user selection(s) further comprises clustering heterogeneous objects using inter-layer links to determine importance measurements for features of the heterogeneous objects, the heterogeneous object comprising a first cluster of similar queries and a second cluster of related documents, the similar queries having been identified in the search query log, the similar queries being associated search result(s) comprising the one or more documents, the related documents being identified in the search result(s) independent of whether individual ones of the related documents were selected by an end-user from the search results.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×