×

System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers

  • US 8,589,373 B2
  • Filed: 09/14/2004
  • Issued: 11/19/2013
  • Est. Priority Date: 09/14/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method for an improved News Meta-Search over a large number of Online news sources on the Internet or similar networks, comprising providing a meta-search system which includes at least one server, and displaying news items to a user through a browser on a computer,wherein the server performs, under software instruction from the meta-search system, at least one of the steps of:

  • i. Switching between news items from the same cluster or sub-cluster which are displayed in a given position in an automatically generated newspaper page, wherein said switching is done automatically or with user intervention; and

    ii. Switching between news images from the same cluster or sub-cluster which are displayed in a given position in an automatically generated newspaper page, wherein said switching is done automatically or with user intervention, and wherein said images are at least one of still images and streaming data;

    wherein at least one of the following features exists;

    a. Recursive sub-clustering is performed and the recursive sub-clustering continues until there are sufficiently few items in the final sub-category or until the items are too different to group further;

    b. If the user searches for keywords in the News Meta Search, the results are displayed recursively in clusters and sub-cluster in a way similar to the automatically generated newspaper page;

    c. If the user searches for keywords in the News Meta Search, the results can have all the features that exist in the automatically generated newspaper page;

    d. The system enables the user to switch between a mode that displays also images and a mode without images;

    e. The same news item or same sub-cluster can belong to more than one cluster or sub-cluster, and thus it is shown and/or can be reached from all the sufficiently relevant clusters or sub-clusters to which it is related;

    f. The system enables the user to request to sort a list of related items by relevance and/or by time and date to create order between and/or within the sub-clusters, so that the system performs the sorting without interfering with the cluster structure itself;

    g. The system enables the user to request to sort the items by at least one of;

    1. The country of the source, so that the system orders or clusters the news items in addition or instead also according to the country of the news source, 2. The level of reliability of the source, so that the system orders or clusters the news items in addition or instead also according to the reliability of the news source;

    h. The system enables the user to view a graphical or textual hierarchical representation which shows simultaneously the multi-level structure of clusters and sub-clusters, showing more than two levels of the hierarchy at the same time, or showing the structure down to the end-nodes;

    i. The Meta News system automatically chooses only images that are within a certain reasonable range of sizes;

    j. As additional new related news items come in, the headlines and/or the images can be automatically updated even if the user does not click on any refresh button;

    k. The user gets a different indication when the items or images themselves have changed or new items or images are brought in (compared to the normal swapping between items), and said indication is at least one of sound indication and visual indication of the item that has changed or the new item that has been inserted;

    l. The html protocol and/or the html command set is expanded to allow an image to be requested with a given size limit, so that if the original image is bigger it is either truncated automatically to fit in the allowed window, or is automatically downscaled in order to fit completely into the allowed space;

    m. The html protocol and/or the html command set is expanded to allow an image to be requested with a given size limit, so that if the original image is bigger it is truncated automatically to fit in the allowed window and for said truncation the improved html protocol allows the web programmer to specify for each image the x-y coordinates of its central point of interest, and/or various heuristics are used by the browser or by the server in order to find the central point of interest automatically;

    n. When switching images contain also streaming data, at least one of the following is done;

    1. Automatic switching of images is disabled so that the user has to click on something in order to view related streaming data from a different source or other still images, and 2. Each streaming source remains in the position for a longer time than still images until switching to the next streaming source or to the next still image;

    o. The system determines which item to use as the main item of the general cluster by at least one of;

    1. First picking the sub-cluster that has the largest number of items and/or the most recent cluster that is big enough relative to other sub-clusters, 2. Picking the item within the chosen first sub-cluster which has the highest average similarity to other items in that sub-cluster and/or belongs to the largest sub-cluster of that sub-cluster and/or is most relevant within the cluster or within the sub-cluster and/or is most recent within the cluster or within the sub-cluster;

    p. When requesting News alerts, instead of being able to request only by specific keywords, the system enables the user to also at least one of;

    1. Mark a cluster or a specific sub-cluster, so that he/she is notified automatically on any new items that belong to that cluster or after sufficient changes have accumulated in the cluster, 2. Use semantic qualifiers, 3. Mark words in a way that indicates that synonyms should also be checked for these words, so that he/she will be notified also about items that contain synonyms of these marked words;

    and wherein at least one of the following features exists;

    q. In order to improve the clustering ability, the time the items were published is taken into account, with the assumption that the closer the time of publication between them, the higher the chance that two items are dealing with the same event;

    r. Temporal words or phrases used in the news item are used to decide when the event occurred, and this time is used to separate between news items that occurred before this time and items that occurred after this time and/or to help decide the similarity between items that might be referring to the same event;

    s. Temporal words or phrases used in the news item are used to decide when the event occurred, and in order to analyze the temporal phrases used in the item, the system is able to perform also at least some minimal type of semantic analysis and/or has at least knowledge of the relevant temporal nouns and relevant verbs; and

    t. When sorting automatically generated news clusters the number of items in each cluster is normalized by the time factor, since clusters that have exited for a longer time would normally have more items than a newer cluster even if the new cluster is more important.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×