Method and system for two-dimensional visualization of an information taxonomy and of text documents based on topical content of the documents
First Claim
1. A method for visually representing the semantic relatedness between a predetermined plurality of classes and a plurality of documents, each document stored in a computer document retrieval system, said plurality of documents collectively comprising a plurality of terms in computer-readable format, each document having a tag representing the topical relatedness of said document to each said class, said method comprising the steps of:
- generating a semantic space map in response to said terms and said tag of each document of said plurality of documents, said semantic space map representing the position in a plurality of dimensions of each class relative to every other said class, the spatial distance between said position of said each class and said every other class corresponding to the semantic relatedness of said each class to said every other class;
populating said semantic space map in response to said plurality of documents, said populated semantic space map representing the position in a plurality of dimensions of each class relative to every other class and of each document relative to each class, the spatial distance between said position of said each document relative to said each class corresponding to the semantic relatedness of said each document to said each class; and
displaying a visual representation of at least a portion of said populated semantic space map.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for aiding users in visualizing the relatedness of retrieved text documents and the topics to which they relate comprises training a classifier by semantically analyzing an initial group of manually-classified documents, positioning the classes and documents in two-dimensional space in response to semantic associations between the classes, and displaying the classes and documents. The displayed documents may be retrieved by an information storage and retrieval subsystem in any suitable manner.
-
Citations
22 Claims
-
1. A method for visually representing the semantic relatedness between a predetermined plurality of classes and a plurality of documents, each document stored in a computer document retrieval system, said plurality of documents collectively comprising a plurality of terms in computer-readable format, each document having a tag representing the topical relatedness of said document to each said class, said method comprising the steps of:
-
generating a semantic space map in response to said terms and said tag of each document of said plurality of documents, said semantic space map representing the position in a plurality of dimensions of each class relative to every other said class, the spatial distance between said position of said each class and said every other class corresponding to the semantic relatedness of said each class to said every other class; populating said semantic space map in response to said plurality of documents, said populated semantic space map representing the position in a plurality of dimensions of each class relative to every other class and of each document relative to each class, the spatial distance between said position of said each document relative to said each class corresponding to the semantic relatedness of said each document to said each class; and displaying a visual representation of at least a portion of said populated semantic space map. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for displaying a visual representation of documents and classes to which said documents relate, comprising:
-
a document retrieval subsystem having a user input device for receiving a user query, a user output device for displaying graphical representations of documents and a predetermined plurality of classes, and memory for storing said documents, each said document having a topical association with one of said classes, said document retrieval system providing retrieved documents in response to said user query; a classification subsystem for computing a semantic relatedness between each class and every other one of said classes and for producing a set of class scores for each stored document, each class score in said set representing a semantic relatedness between said stored document and one of said classes; and a visualization subsystem for producing a semantic space map in response to said semantic relatedness between each class and every other one of said classes, for populating said semantic space map with said stored documents in response to said sets of class scores and for displaying a populated semantic space map on said user output device.
-
-
16. A machine-readable computer data storage medium having stored therein a program, comprising:
-
a term frequency statistics generator for generating term frequency statistics in response to a plurality of pre-classified documents, each having a plurality of terms and each topically associated with a predetermined one of a predetermined plurality of classes; a semantic space map generator for generating a semantic association between each class of said plurality of classes and every other class of said plurality of classes in response to said term frequency statistics and topical association between each document and predetermined one of said plurality of classes; a multidimensional scaler for positioning said plurality of classes in a plurality of dimensions in a semantic space map in response to said semantic association between each said class and every other said class; a statistical classifier for producing a set of class scores for a document in response to frequencies of terms in said document and for positioning said document in said semantic space map in response to said set of class scores, said set of class scores representing the semantic association between said document and each said class; and a semantic space map populator for positioning said document corresponding to said set of class scores in said semantic space map. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification