Auto generation of suggested links in a search system

US 8,005,816 B2
Filed: 02/28/2007
Issued: 08/23/2011
Est. Priority Date: 03/01/2006
Status: Active Grant

First Claim

Patent Images

1. A method of automatically generating suggested links in a search system, the method comprising:

initiating a first crawl across an enterprise corpus owned by an enterprise;

discovering during the first crawl a link pointing to a data source, the data source being mis-characterized during the first crawl as outside a boundary of the enterprise corpus owned by the enterprise;

automatically storing the link as a first suggested link with other suggested links in a memory;

initiating a second crawl across the enterprise corpus after the automatically storing, the second crawl having a different seed uniform resource locator (URL) or different boundary rules than the first crawl;

encountering during the second crawl the data source actually within the same boundary of the enterprise corpus;

removing, using a processor operatively coupled to the memory, the first suggested link from the other suggested links based on encountering the data source, previously characterized as outside the boundary of the enterprise corpus, within the same boundary of the enterprise corpus during the second crawl; and

determining relevancy scoring for the other suggested links.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety of sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.

221 Citations

20 Claims

1. A method of automatically generating suggested links in a search system, the method comprising:
- initiating a first crawl across an enterprise corpus owned by an enterprise;
  
  discovering during the first crawl a link pointing to a data source, the data source being mis-characterized during the first crawl as outside a boundary of the enterprise corpus owned by the enterprise;
  
  automatically storing the link as a first suggested link with other suggested links in a memory;
  
  initiating a second crawl across the enterprise corpus after the automatically storing, the second crawl having a different seed uniform resource locator (URL) or different boundary rules than the first crawl;
  
  encountering during the second crawl the data source actually within the same boundary of the enterprise corpus;
  
  removing, using a processor operatively coupled to the memory, the first suggested link from the other suggested links based on encountering the data source, previously characterized as outside the boundary of the enterprise corpus, within the same boundary of the enterprise corpus during the second crawl; and
  
  determining relevancy scoring for the other suggested links.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. A method according to claim 1, further comprising:
    - displaying a subset of the suggested links to a user in response to a query, the subset being determined by the relevancy scoring for the suggested links.
  - 3. A method according to claim 2, further comprising:
    - displaying the suggested links separate from results for the query.
  - 4. A method according to claim 1, further comprising:
    - configuring a crawler to conduct one of the first and second crawls with boundary rules.
  - 5. A method according to claim 1, further comprising:
    - selecting how many suggested links to display to the user.
  - 6. A method according to claim 1, further comprising:
    - auto-generating keywords for the suggested links.
  - 7. A method according to claim 6, wherein:
    - auto-generating keywords includes capturing anchor text associated with the suggested link.
  - 8. A method according to claim 6, wherein:
    - auto-generating keywords includes capturing text next to the suggested link.
  - 9. A method according to claim 6, wherein:
    - auto-generating keywords includes traversing the suggested link and capturing text from the traversed link.
  - 10. A method according to claim 9, wherein:
    - the captured text from the traversed link includes words from a title of a document associated with the link.

11. A computer program product embedded in a computer readable storage medium for automatically generating suggested links in a search system, comprising:
- program code for initiating a first crawl across an enterprise corpus owned by an enterprise;
  
  program code for discovering during the first crawl a link pointing to a data source, the data source being mis-characterized during the first crawl as outside a boundary of the enterprise corpus owned by the enterprise;
  
  program code for automatically storing the link as a first suggested link with other suggested links in a memory;
  
  program code for initiating a second crawl across the enterprise corpus after the automatically storing, the second crawl having a different seed uniform resource locator (URL) or different boundary rules than the first crawl;
  
  program code for encountering during the second crawl the data source within the same boundary of the enterprise corpus;
  
  program code for removing the first suggested link from the other suggested links based on the encountering the data source, previously characterized as outside the boundary of the enterprise corpus, within the same boundary of the enterprise corpus during the second crawl; and
  
  program code for determining relevancy scoring for the other suggested links.
- View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
- - 12. A computer program product according to claim 11, further comprising:
    - program code for displaying a subset of the suggested links to a user in response to a query, the subset being determined by the relevancy scoring for the suggested links.
  - 13. A computer program product according to claim 12, further comprising:
    - program code for displaying the suggested links separate from results for the query.
  - 14. A computer program product according to claim 11, further comprising:
    - program code for configuring a crawler to conduct one of the first and second crawls with boundary rules.
  - 15. A computer program product according to claim 11, further comprising:
    - program code for selecting how many suggested links to display to the user.
  - 16. A computer program product according to claim 11, further comprising:
    - program code for auto-generating keywords for the suggested links.
  - 17. A computer program product according to claim 16, wherein:
    - program code for auto-generating keywords includes program code for capturing anchor text associated with the suggested link.
  - 18. A computer program product according to claim 16, wherein:
    - program code for auto-generating keywords includes program code for capturing text next to the suggested link.
  - 19. A computer program product according to claim 16, wherein:
    - program code for auto-generating keywords includes program code for traversing the suggested link and capturing text from the traversed link.

20. A computer system for automatically generating suggested links in a search system, comprising:
- at least one or more processors; and
  
  a memory operatively coupled with the one or more processors, the at least one or more processors executing instructions set forth in a coputer program for;
  
  initiating a first crawl across an enterprise corpus owned by an enterprise;
  
  discovery during the first crawl a link pointing to a data source, the data source being mis-characterized during the first crawl as outside a boundary of the enterprise corpus owned by the enterprise;
  
  automatically storing the link as a first suggested link with other suggested links in a memory;
  
  initiating a second crawl across the enterprise corpus after the automatically storing, the second crawl having a different sed uniform resource locator (URL) or different boundary rules than the first crawl;
  
  encountering during the second crawl the data source atually within the same boundary of the enterprise corpus;
  
  removing the first suggested link from the other suggested links based on encountering the data source, previously characterized as outside the boundary of the enterprise corpus, within the same boundary of the enterprise corpus during the second crawl; and
  
  determining relevancy scoring for the other suggested links.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Oracle International Corporation (Oracle Corporation)
Original Assignee
Oracle International Corporation (Oracle Corporation)
Inventors
Krishnaprasad, Muralidhar, Chang, Thomas
Primary Examiner(s)
Ali; Mohammad
Assistant Examiner(s)
Willis; Amanda

Application Number

US11/680,510
Publication Number

US 20070208713A1
Time in Patent Office

1,637 Days
Field of Search

707/709, 707/999.003, 707/E17.008, 707/737, 707/710, 709/203, 709/219
US Class Current

707/709
CPC Class Codes

G06F 16/20 of structured data, e.g. re...

G06F 16/951 Indexing; Web crawling tech...

Auto generation of suggested links in a search system

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

221 Citations

20 Claims

Specification

Use Cases

Quick Links

Others

Auto generation of suggested links in a search system

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

221 Citations

20 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others