Scalable information search and retrieval including use of special purpose searching resources
First Claim
1. A method for searching for data with public search engines and special purpose search resources, comprising:
- identifying a general purpose public search engine;
identifying a special purpose search resource;
submitting a search query to the public search engine, and receiving a first result in response thereto;
submitting the search query to the special purpose search resource, and receiving a second response thereto;
merging said first and said second result into a combined search result;
determining user restrictions for characteristics of search results, said characteristics including selected ones of written language type, result origin, and language complexity;
identifying characteristics for a selected one of said combined search result;
comparing said identified characteristics to said user restrictions to identify restricted characteristics; and
if restricted characteristics are identified, then excluding said selected one from said combined result.
9 Assignments
0 Petitions
Accused Products
Abstract
The system and method for augmenting searching for data with public search engines, by submitting searches to special purpose search resources, such as search capabilities provided by private companies, universities, government web pages, and the like is described. Toward this end, a general purpose search engine is identified, and a special purpose search resource identified. A search query is submitted to the public search engine, and a first result is received in response thereto. The search query is also submitted to the special purpose search resource, and a second response is received thereto. These first and said second results are merged into a combined search result. Techniques for identifying, determining and recording use, and verify continued availability of special purpose search resources are also disclosed.
298 Citations
29 Claims
-
1. A method for searching for data with public search engines and special purpose search resources, comprising:
-
identifying a general purpose public search engine;
identifying a special purpose search resource;
submitting a search query to the public search engine, and receiving a first result in response thereto;
submitting the search query to the special purpose search resource, and receiving a second response thereto;
merging said first and said second result into a combined search result;
determining user restrictions for characteristics of search results, said characteristics including selected ones of written language type, result origin, and language complexity;
identifying characteristics for a selected one of said combined search result;
comparing said identified characteristics to said user restrictions to identify restricted characteristics; and
if restricted characteristics are identified, then excluding said selected one from said combined result.
-
-
2. A method for searching for data with public search engines and special purpose search resources, comprising:
-
identifying a general purpose public search engine;
identifying a collection of known special purpose search resources;
determining a responsiveness factor for each special purpose search resource in said collection;
comparing said responsiveness factor to a threshold value;
excluding special purpose search resources according to said comparing;
submitting a search query to the public search engine, and receiving a first result in response thereto;
submitting the search query to at least one of the non-excluded serial purpose search resources, and receiving a second response thereto; and
merging said first and said second result into a combined search result. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9)
counting a number of duplicates within said combined search result so as to determine a relative reference ranking for duplicated search results; and
making the combined search result according to said determined relative reference ranking for duplicated search results.
-
-
4. The method of claim 2, in which the public search engine is provided by a search engine business, and wherein the special purpose search resource only searches within a restricted domain of sources directly affiliated with the special purpose resource.
-
5. The method of claim 2, wherein the public search engine searches an unrestricted domain of sources including sources unrelated to the public search engine, and
wherein the special purpose search resource only searches within a restricted domain of sources directly affiliated with the special purpose resource. -
6. The method of claim 2, wherein the public search engine searches within an arbitrary number of Internet domains, and the special purpose search resource searches within a restricted number of Internet domains.
-
7. The method of claim 2, further comprising:
-
associating a user-selectable control with a selected one of said combined search result; and
configuring the user-selectable control such that selection thereof causes a real-time distillation of said selected one into a condensed representation thereof.
-
-
8. The method of claim 7, further comprising:
-
providing a user interface; and
displaying said combined search result in the user interface.
-
-
9. The method of claim 8, wherein the user interface is a browser.
-
10. A method for searching for data with public search engines and special purpose search resources, comprising:
-
identifying a general purpose public search engine;
identifying a first special purpose search resource;
identifying a second special purpose search resource;
submitting a search query to the public search engine, and receiving a first result in response thereto;
submitting the search query to the first and the second special purpose search resource, and receiving a second response thereto;
merging said first and said second result into a combined search result;
determining that said first special purpose search resource and said second special purpose search resource return at least one duplicative search result;
computing responsiveness factors for said first and second special purpose search resource, and comparing said responsiveness factors so as to determine a slower special purpose resource; and
excluding the slower special purpose resource from subsequent searching.
-
-
11. A method for searching for data with public search engines and special purpose search resources, comprising:
-
identifying a confirmed special purpose search resource, wherein a process for determining at least one confirmed special purpose comprises the following steps;
performing a search with a search engine, said search utilizing terms commonly associated with searching capabilities provided by a network site;
receiving a first search result responsive to the search;
inspecting a data source for a selected one of said search result so as to determine an apparent search capabilities for said selected one;
submitting a test query to said apparent search capability; and
determining if a second search result is received in response to the test query, and if so, confirming said selected one is a special purpose search response; and
identifying a general purpose public search engine;
submitting a search query to the public search engine, and receiving a first result in response thereto;
submitting the search query to the confirmed special purpose search resource, and receiving a second response thereto;
merging said first and said second result into a combined search result. - View Dependent Claims (12, 13, 14, 15)
adding the confirmed special purpose search resource to a collection of confirmed special purpose search resource;
verifying whether said confirmed special purpose search resource provides search responses within a timeout period, and if not, incrementing an error counter; and
verifying whether the error counter exceeds a maximum error threshold, and if so, removing said confirmed special purpose search resource from said collection.
-
-
13. The method of claim 12, further comprising:
periodically testing said confirmed special purpose search resource for providing search responses within the timeout period, and if so, resetting the error counter to a zero-bound lesser value.
-
14. The method of claim 12, wherein each operation of the method for determining a confirmed special purpose search resource is performed automatically by a search server without manual intervention.
-
15. The method of claim 11, further comprising:
-
associating a characteristic profile with said confirmed special purpose search resource;
determining a search query network address to which subsequent searches for said confirmed special purpose search resource can be submitted; and
storing the search query network address within the characteristic profile;
wherein said submitting the search query to the special purpose search resource comprises retrieving said stored query network address and submitting the search query thereto.
-
-
16. A method for identifying a search resource, comprising:
-
performing a search with a general purpose search engine, said search utilizing terms commonly associated with searching capabilities provided by a network site;
receiving a first search result responsive to the search;
inspecting a source encoding for a selected one of said search result so as to determine an apparent search capability for said selected one;
submitting a test query to said apparent search capability, and determining if a second search result is received in response thereto;
wherein receipt of the second search result confirms a search capability for said selected one. - View Dependent Claims (17, 18, 19, 20)
determining a search query URI, for directly submitting subsequent searches to said confirmed search capability of said selected one.
-
-
19. The method of claim 16, wherein inspecting the source encoding comprises:
-
providing a pattern matching pattern for isolating a portion of the source encoding; and
applying said pattern to the source encoding so as to isolate the portion.
-
-
20. The method of claim 19, wherein said portion is a selected one of a title and a hypertext link.
-
21. A readable medium having encoded thereon instructions for searching for data with public search engines and special purpose search resources, said instructions, which when executed by a processor, are capable of directing the processor to:
-
identify a general purpose public search engine;
identify a special purpose search resource;
submit a search query to the public search engine, and receive a first result in response thereto;
submit the search query to the special purpose search resource, and receive a second response thereto;
merge said first and said second result into a combined search result;
determine user restrictions for characteristics of search results, said characteristics including selected ones of written language type, result origin, and language complexity;
identify characteristics for a selected one of said combined search result;
compare said identified characteristics to said user restrictions to identify restricted characteristics; and
determine if restricted characteristics are identified, and exclude said selected one from said combined result. - View Dependent Claims (22, 23, 24)
wherein the special purpose search resource only searches within a restricted domain of sources directly affiliated with the special purpose resource. -
24. The medium of claim 21, said programming instructions including further instructions to direct the processor to:
-
associate a user-selectable control with a selected one of said combined search result; and
configure the user-selectable control such that selection thereof causes a real-time distillation of said selected one into a condensed representation thereof.
-
-
-
25. A readable medium having encoded thereon instructions for searching for data with public search engines and special purpose search resources, said instructions, which when executed by a processor, are capable of directing the processor to:
-
identify a general purpose public search engine;
identify a collection of known special purpose search resources;
determine a responsiveness factor for each special purpose search resource in said collection;
compare said responsiveness factor to a threshold value;
exclude special purpose search resources according to said comparing;
submit a search query to the public search engine, and receive a first result in response thereto;
submit the search query to at least one of the non-excluded special purpose search resources, and receive a second response thereto; and
merge said first and said second result into a combined search result.
-
-
26. A readable medium having encoded thereon instructions for searching for data with public search engines and special purpose search resources, said instructions, which when executed by a processor, are capable of directing the processor to:
-
identify a general purpose public search engine;
identify a first special purpose search resource;
identify a second special purpose search resource;
submit a search query to the public search engine, and receive a first result in response thereto;
submit the search query to the first and the second special purpose search resource, and receive a second response thereto;
merge said first and said second result into a combined search result;
determine that said first special purpose search resource and said second special purpose search resource return at least one duplicative search result;
compute responsiveness factors for said first and second special purpose search resource, and compare said responsiveness factors so as to determine a slower special purpose resource; and
exclude the slower special purpose resource from subsequent searching.
-
-
27. A readable medium having encoded thereon instructions for searching for identifying a special purpose search resource, said instructions, which when executed by a processor, are capable of directing the processor to:
-
perform a search with a general purpose search engine, said search utilizing terms commonly associated with searching capabilities provided by a network site;
receive a first search result responsive to the search;
inspect a data source for a selected one of said search result so as to determine an apparent search capability for said selected one;
submit a test query to said approach search capability; and
determine if a second search result is received in response to the test query, said determining confirming a search capability for said selected one. - View Dependent Claims (28, 29)
determine a search query URL for directly entering subsequent searches to said confirmed search capability.
-
Specification