×

Scatter-gather: a cluster-based method and apparatus for browsing large document collections

  • US 5,442,778 A
  • Filed: 11/12/1991
  • Issued: 08/15/1995
  • Est. Priority Date: 11/12/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. A document browsing method in a digital computer for a corpus of documents, comprising the steps of:

  • preparing an initial ordering of the corpus into a first plurality of clusters by using a first method that automatically performs the initial ordering without external inputs based on contents of the documents using the digital computer;

    determining a summary for each cluster of the first plurality of clusters prepared by said initial ordering of the corpus;

    selecting by a user at least one cluster of the first plurality of clusters based on the summary of each cluster; and

    automatically providing a further ordering of the user selected at least one cluster into a second plurality of clusters by automatically analyzing contents of documents of the selected at least one cluster using a second method comprising the steps of;

    grouping together all of the documents from the selected at least one cluster based on the content of each document, and thenassigning each of the documents to one cluster of the second plurality of clusters.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×