Filtering in search engines
First Claim
1. A method executed by one or more computers, the method comprising:
- receiving a search query, the search query including a query label;
identifying, from a data store, one or more uniform resource locator (URL) patterns, each of the one or more URL patterns including a component of a URL and at least one of a wildcard or a regular expression, and each of the one or more URL patterns being associated with a label that matches the query label;
constructing a filter including;
determining a filter size based on a length of the one or more URL patterns and a count of a number of URL patterns having each respective length; and
constructing the filter having the filter size; and
filtering one or more results of the search query using the filter.
3 Assignments
0 Petitions
Accused Products
Abstract
In an embodiment, the size of a filter in a search engine is dynamically adjusted in order to reduce filter error. The filter, for example, may be a domain filter that contains domain(s) of preferred search results. A search engine processes the query and the domain filter to create a set of search results that are relevant to the query terms and for which a selected portion of the search results satisfy the domain filter, in that such portion of the documents comes from domains in the filter. The search results may then be provided to the user, as a set of search results that are relevant to the query terms and which include some portion of results that match the query labels. The construction of the domain filter may be done with a Bloom filter approach, using domain prefixes from annotations.
-
Citations
42 Claims
-
1. A method executed by one or more computers, the method comprising:
-
receiving a search query, the search query including a query label; identifying, from a data store, one or more uniform resource locator (URL) patterns, each of the one or more URL patterns including a component of a URL and at least one of a wildcard or a regular expression, and each of the one or more URL patterns being associated with a label that matches the query label; constructing a filter including; determining a filter size based on a length of the one or more URL patterns and a count of a number of URL patterns having each respective length; and constructing the filter having the filter size; and filtering one or more results of the search query using the filter. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
one or more computers configured to perform operations comprising; receiving a search query, the search query including a query label; identifying, from a data store, one or more uniform resource locator (URL) patterns, each of the one or more URL patterns including a component of a URL and at least one of a wildcard or a regular expression, and each of the one or more URL patterns being associated with a label that matches the query label; constructing a filter including; determining a filter size based on a length of the one or more URL patterns and a count of a number of URL patterns having each respective length; and constructing the filter having the filter size; and filtering one or more results of the search query using the filter. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 30, 31, 32)
-
29. Computer instructions stored on a computer-readable storage medium, the computer instructions configured to cause one or more computers to perform operations comprising:
-
receiving a search query, the search query including a query label; identifying, from a data store, one or more uniform resource locator (URL) patterns, each of the one or more URL patterns including a component of a URL and at least one of a wildcard or a regular expression, and each of the one or more URL patterns being associated with a label that matches the query label; constructing a filter including; determining a filter size based on a length of the one or more URL patterns and a count of a number of URL patterns having each respective length; and constructing the filter having the filter size; and filtering one or more results of the search query using the filter. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
Specification