System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers
1 Assignment
0 Petitions
Accused Products
Abstract
Google has recently made available at http://news.google.com an automated “newspaper”, which searches continuously about 4,500 news sources, and lets users view automatically generated headlines in a few general areas or lets users search for news by keywords. The automatic determination of which news items or news stories are most important is done by 3 main criteria: In how many sources the news item appeared, how important are the news sources in which it appeared, and how close it is to the top in each of these news sources. However, many problems still remain, such as for example: a. The choice of a single main news source and a single image for each item seems arbitrary to the user and limits the user. b. If the user clicks on the “related items” link for that item the user always gets a linear list of typically hundreds or even more than a thousand links to related news items, sorted either by relevance or by time, however, the new list is now without any images and without any clustering, so that many times news stories that are about the same event or even identical, may appear at different positions in the list of related links, and various other news items may appear between them and are typically also dispersed in various places. This makes it vary hard for the user to take advantage efficiently of the list of related items. The present invention solves the above problem by creating recursive clustering, so that preferably at any level in the tree the user can preferably either choose a specific news item from the cluster or from the shown sub-clusters or continue in the tree. Another improvement is that searching the Meta News by keywords can generate an automatic newspaper in a way similar to the original automatically generated newspaper. Many additional improvements to the concept of automated newspapers and/or news MetaSearch are also shown. Other improvements are suggested for improved shareware MetaSearch, improved Web pages search, and other types of searches.
-
Citations
59 Claims
-
1-20. -20. (canceled)
-
21. In an online search system, a method of improved News Meta-Search over a large number of Online news sources on the Internet or similar networks, comprising at least one of the following steps:
-
a. Switching between news items from the same cluster or sub-cluster displayed in a given position in an automatically generated newspaper page, wherein said switching is done automatically or with user intervention;
b. Switching between news images from the same cluster or sub-cluster displayed in a given position in an automatically generated newspaper page, wherein said switching is done automatically or with user intervention, and wherein said images are at least one of still images and streaming data;
c. Creating recursively sub-clusters of the displayed clusters or dub-clusters of news items that are related to a certain event, so that at least one of;
1. For each sub-cluster shown the user can either click on a chosen item from that cub-cluster or click on a link for seeing a list of additional items that belong to the sub-cluster. 2. When the user requests to see the list of additional items of the chosen sub-cluster, the new list can be again clustered similarly. 3. When the user requests to see the list of additional items of the cluster, the new list can be again clustered similarly. - View Dependent Claims (22, 25, 27, 35, 36, 42, 50)
-
-
23. (canceled)
-
24. (canceled)
-
26. (canceled)
-
28. In an online search system, an improved Online metasearch method comprising at least one of the following:
-
a. An improved Shareware Meta Search method wherein shareware programs appear in higher places in the search results according to how many of the included shareware sites list them, and at least one of the following;
In which position they are listed for the given searched keywords;
How important the shareware site is;
How many times they were already downloaded;
The shareware site'"'"'s rating for the shareware.b. An improved Online MIDI files Meta Search method wherein at least one of the following features exists;
After the system chooses a set of results that are sufficiently close to the search string, the system automatically sorts the song names by the most popular in descending order, and After choosing the desired file name, the system sorts available versions of that sons in descending order by the number of links available for each file size, so the user can reach immediately the desired MIDI file that has the best chance of being the best version of the desired song - View Dependent Claims (29)
-
-
30. (canceled)
-
31. An improved Online web pages search method comprising at least one of the following steps:
-
a. Taking into account the link relations between web pages for scoring the page but does not reduce the value of a link according to the number of other outgoing links in the linking pages, or reduces the value of a link according to the number of other outgoing links in the linking pages only slightly;
b. Improving slightly the rank for a page that has many outgoing links;
c. Taking into account usage statistics but uses it only for modifying the value of the link in the linking page but not for modifying directly the ranking of a page;
d. Taking into account usage statistics but uses it with one or more thresholds, so that usage lower than a certain factor does not continue to lower the score, and/or usage higher than a certain factor does not continue to increase the score;
e. Using also the anchor text of inbound links to determine the relevance of the linked page to the searched keywords and includes at least some semantic analysis of the anchor href text and/or also at least the surrounding or preceding nearby text, in order to be able to identify at least part of the meaning and/or avoid certain pitfalls that are relevant to the interpretation of the real meaning of the link;
f. Using also the anchor text of inbound links to determine the relevance of the linked page to the searched keywords and at least takes into account some basic language structures such as negation words or modifying words;
g. Allowing the user to define various parameters for scoring the results, wherein said parameters are at least one of;
The relative weight of usage statistics, the amount of reduction of the importance of a link as a result of the total number of links on the linking page, and, the amount of taking into consideration the newness of a web page so that less links to it are required;
h. Automatically identifying if a page is an alphabetic directory and gives higher weight to a link that is closer to the top of the page unless that page is an alphabetic directory;
i. Checking also if incoming links reside on the same IP address (even if the domain name is different) and their domain is owned by the same person or organization, in order to determine the value of the incoming links;
j. Taking into account the number of incoming links for each page and also the time factor of how long the page has existed is taken into account for determining the weight given to the number of links;
k. Taking automatically into account also the synonyms of the requested keywords, by at least one of;
1. Automatically including in the search results also pages that contain synonyms or close synonyms of the requested keywords. 2. Asking the user if he would like to include in the search results automatically also pages that contain close synonyms of the requested search keywords and remembers that as default for that user for following searches, and 3. Checking at least close synonyms of the user'"'"'s search keywords, and if there are more and/or better results with the synonyms then the system asks the user if he wants to switch over to the results of the search that was based on the synonyms, and/or asks the user if he wants to integrate the current results with the results of the search that was based on the synonyms;
l. Using semantic qualifiers when using keyword search for letting the search engine know that certain words are not part of the search string itself but are intended to act as the semantic qualifier;
m. Allowing the user to define words in the search string that are preferred but not necessary. - View Dependent Claims (33, 45, 47, 48, 52, 53, 54, 55, 57)
-
-
32. (canceled)
-
34. (canceled)
-
37-41. -41. (canceled)
-
43. (canceled)
-
44. (canceled)
-
46. (caneled).
-
49. (canceled)
-
51. (canceled)
-
56. (canceled)
-
58. (canceled)
-
59. (canceled)
Specification