Methods and systems for organizing electronic documents
First Claim
Patent Images
1. A method for organizing electronic documents, said method comprising:
- generating a list of weighted keywords for one or more documents;
clustering related documents together based on a comparison of said weighted keywords; and
linking together portions of documents within a cluster based on a comparison of said weighted keywords.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for organizing electronic documents may include generating a list of weighted keywords for each document, clustering related documents together based on a comparison of the weighted keywords, and linking together portions of documents within a cluster based on a comparison of the weighted keywords.
-
Citations
61 Claims
-
1. A method for organizing electronic documents, said method comprising:
-
generating a list of weighted keywords for one or more documents;
clustering related documents together based on a comparison of said weighted keywords; and
linking together portions of documents within a cluster based on a comparison of said weighted keywords. - View Dependent Claims (2, 3, 4)
-
-
5. A method for generating keywords for a document, said method comprising:
-
identifying a plurality of words in the document;
identifying a role of each word;
computing a word weight for each word based on the role and position of the word in said document; and
selecting a number of keywords based on computed word weights. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method of generating a summary for documents using weighted keywords from a document keyword list, each keyword having a word weight, said method comprising:
-
counting a number of keyword occurrences in each sentence;
computing a sentence weight for each sentence based on said number of keyword occurences; and
generating a summary for a document containing one or more of sentences from said document that are selected based on said sentence weights. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A method for clustering a plurality of documents, each document having an associated keyword list containing keywords, each keyword having an associated word weight, said method comprising:
-
locating at least one keyword shared by at least two documents of said plurality of documents;
calculating a shared word weight; and
clustering documents with a shared word weight above a specified threshold.
-
-
22. A method for associating at least two text units, each text unit containing one or more weighted keywords, said method comprising:
-
defining a plurality of text units to compose a corpus of text units;
calculating a text unit relevancy metric for each text unit based on a comparison of said weighted keywords; and
selectively linking text units based on said text unit relevancy metrics. - View Dependent Claims (23, 24, 25, 26)
-
-
27. A program stored on a medium for storing computer-readable instructions, said program, when executed, causing a host device to:
-
analyze one or more documents;
generate a list of weighted keywords for each document;
cluster related documents together based on said weighted keywords; and
link together portions of clustered documents based on occurrences of said weighted keywords. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
38. A program stored on a medium for storing computer-readable instructions, said program, when executed, causing a host device to:
-
count a number of keyword occurrences in each sentence of a document;
compute a sentence weight for each of sentence; and
generate a summary for the document containing one or more sentences from said document based on said sentence weights. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A system for organizing electronic documents, said system comprising:
-
means for generating a list of weighted keywords for each document;
means for clustering related documents together based on said weighted keywords; and
means for linking together corresponding portions of said documents within a cluster based on said weighted keywords. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61)
-
Specification