Three-dimensional display of document set
First Claim
Patent Images
1. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
- Defining at least one topic or topic attribute;
Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute;
Quantitatively characterizing respective clusters;
Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and
Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute;
creating a plurality of topic or topic attribute layers, each of the layers corresponding to a topic or topic attribute applied to each respective cluster, and identifying a coordinate location within the representational space for each semantic object associated with each layer; and
defining an amplitude surface within the representational space associated with each layer by applying a smoothing function to the coordinate location representations of the semantic objects, and superimposing upon one another all of the amplitude surfaces of the respective layers.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for spatializing text content for enhanced visual browsing and analysis. The invention is applied to large text document corpora such as digital libraries, regulations and procedures, archived reports, and the like. The text content from these sources may be transformed to a spatial representation that preserves informational characteristics from the documents. The three-dimensional representation may then be visually browsed and analyzed in ways that avoid language processing and that reduce the analysts'"'"' effort.
282 Citations
7 Claims
-
1. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; creating a plurality of topic or topic attribute layers, each of the layers corresponding to a topic or topic attribute applied to each respective cluster, and identifying a coordinate location within the representational space for each semantic object associated with each layer; and defining an amplitude surface within the representational space associated with each layer by applying a smoothing function to the coordinate location representations of the semantic objects, and superimposing upon one another all of the amplitude surfaces of the respective layers.
-
-
2. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; wherein said outputting step seeks to preserve a distance between pairs of semantic objects in said representational space based on the defined quantitative difference of the respective semantic object from the quantitatively characterized respective cluster.
-
-
3. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; wherein the clustering step comprises producing a partition set on the plurality of semantic objects, by applying a clustering algorithm with primary emphasis on k-means and complete linkage hierarchical clustering to define a cluster centroid, the cluster centroid representing the quantitative characterization of a respective cluster. - View Dependent Claims (4)
-
-
5. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; wherein each of the quantitative characterization of a respective e cluster represents a cluster centroid, the outputting step comprising the steps of applying a Multi-dimensional Scaling Algorithm to cluster centroid coordinates in hyperspace, producing a vector for each semantic object with distance measures from the semantic object to each cluster centroid, and constructing an operator matrix and multiplying the operator matrix by the vector to produce coordinates for each semantic object.
-
-
6. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming clusters, using a computer processor, from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; wherein each of the quantitative characterization of a respective e cluster represents a cluster centroid, the outputting step comprising the steps of applying an Anchored Least Stress Algorithm to cluster centroid coordinates in hyperspace, producing a vector for each semantic object with distance measures from the semantic object to each cluster centroid, and constructing an operator matrix and multiplying the operator matrix by the vector to produce coordinates for each semantic object.
-
-
7. A method for visually representing on a display information from a plurality of semantic objects, comprising the steps of:
-
Defining at least one topic or topic attribute; Forming, using a computer processor, clusters from the plurality of semantic objects, wherein a cluster defines a group of semantic objects arranged by their respective relative significance to the at least one topic or topic attribute; Quantitatively characterizing respective clusters; Defining, for a plurality of semantic objects, a quantitative difference of the respective semantic object from the quantitatively characterized respective cluster; and Outputting for the display representations of at least one topic or topic attribute and the plurality of semantic objects into a representational space, wherein the representational space has a lower dimensionality associated with each of the semantic objects, an arrangement of representations is influenced by a semantic relation of a semantic object and the at least one topic or topic attribute; wherein the semantic objects have a native high dimensional space, and a distance between semantic objects in the representational space is non-linearly related to a distance between semantic objects in the native high dimensional space.
-
Specification