Search result sub-topic identification system and method
First Claim
1. A method of operating a computer system environment for the processing of search results which match a search query to provide for sub-topic identification of the search results, the method comprising:
- receiving a search result;
extracting snippets from said search result that contain said query;
truncating snippets on an instance of a boundary token;
identifying phrases within said snippets that include the query;
comparing all said phrases to determine optimal phrases; and
presenting said optimal phrases;
wherein said comparing all said phrases comprises comparisons between a first phrase and a second phrase, wherein said comparisons between combinations of two phrases comprises;
skipping comparisons where said first phrase starts with the query term and said second phrase ends with the query term;
eliminating a first phrase that is a superstring of said second phrase if said first phrase has a lower frequency of occurrence than said second phrase; and
eliminating said first phrase that is a substring of said second phrase if said first phrase has the same frequency of occurrence as said second phrase.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for sub-topic identification from a search result that matches a query, said method including the steps of receiving a search result, extracting snippets from said search result that contain said query, truncating snippets on an instance of a boundary token, identifying phrases within said snippets that include the query, comparing all said phrases to determine optimal phrases, and presenting said optimal phrases. The apparatus for sub-topic identification from a search result that matches a query may include a dedicated server or a proxy for processing the search and sub-topic query.
-
Citations
22 Claims
-
1. A method of operating a computer system environment for the processing of search results which match a search query to provide for sub-topic identification of the search results, the method comprising:
-
receiving a search result; extracting snippets from said search result that contain said query; truncating snippets on an instance of a boundary token; identifying phrases within said snippets that include the query; comparing all said phrases to determine optimal phrases; and presenting said optimal phrases; wherein said comparing all said phrases comprises comparisons between a first phrase and a second phrase, wherein said comparisons between combinations of two phrases comprises; skipping comparisons where said first phrase starts with the query term and said second phrase ends with the query term; eliminating a first phrase that is a superstring of said second phrase if said first phrase has a lower frequency of occurrence than said second phrase; and eliminating said first phrase that is a substring of said second phrase if said first phrase has the same frequency of occurrence as said second phrase. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification