Determining trends using text mining
First Claim
1. A method for visualizing variations in a corpus of information, including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
- for each of the entries, extracting characteristics of information contained therein;
finding pairs of different characteristics that appear together in at least one of the entries;
determining an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear;
comparing the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
providing an indication of the comparative occurrence values of the pairs;
wherein the differentiating parameter defines an order, and wherein comparing the occurrence values comprises comparing the occurrence values in a first sub-group with the occurrence values in one or more previous sub-groups in the order.
12 Assignments
0 Petitions
Accused Products
Abstract
A method for visualizing variations in a corpus of information. The corpus includes a plurality of information entries, which are divided into a plurality of sub-groups according to a differentiating parameter of the entries. For each of the entries, characteristics of information contained therein are extracted and pairs of different characteristics that appear together in at least one of the entries are found. An occurrence value is determined for each of the pairs of characteristics in each sub-group in which both of the characteristics appear. The occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups are compared, and an indication of the comparative occurrence values of the pairs is provided.
-
Citations
24 Claims
-
1. A method for visualizing variations in a corpus of information, including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
for each of the entries, extracting characteristics of information contained therein;
finding pairs of different characteristics that appear together in at least one of the entries;
determining an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear;
comparing the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
providing an indication of the comparative occurrence values of the pairs;
wherein the differentiating parameter defines an order, and wherein comparing the occurrence values comprises comparing the occurrence values in a first sub-group with the occurrence values in one or more previous sub-groups in the order. - View Dependent Claims (2, 3, 4)
-
-
5. A method for visualizing variations in a corpus of information, including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
for each of the entries, extracting characteristics of information contained therein;
finding pairs of different characteristics that appear together in at least one of the entries;
determining an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear;
comparing the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
providing an indication of the comparative occurrence values of the pairs;
wherein providing the indication comprises displaying a graph, wherein displaying the graph comprises displaying a graph in which each term is represented by a node, the pairs of characteristics that are found are represented by edges, and substantially each edge is associated with the indication of the comparative appearance of the respective pair. - View Dependent Claims (6, 7)
-
-
8. A method for visualizing variations in a corpus of information, including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
for each of the entries, extracting characteristics of information contained therein;
finding pairs of different characteristics that appear together in at least one of the entries;
determining an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear;
comparing the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
providing an indication of the comparative occurrence values of the pairs;
wherein providing the indication comprises displaying a graph, wherein displaying the graph comprises displaying for each two-sub-groups a graph which compares the occurrence values in the two sub-groups, wherein the graphs of each two sub-groups are displayed as an animation sequence.
-
-
9. A method for visualizing variations in a corpus of information, including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
for each of the entries, extracting characteristics of information contained therein;
finding pairs of different characteristics that appear together in at least one of the entries;
determining an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear;
comparing the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
providing an indication of the comparative occurrence values of the pairs;
wherein providing the indication comprises displaying a graph, wherein displaying the graph comprises displaying a plurality of superimposed graphs, each of which represents the appearance of the pairs in a different sub-group. - View Dependent Claims (10)
-
-
11. Apparatus for visualizing variations in a corpus of information including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
a processor which finds pairs of characteristics which appear together in at least one of the documents, determines an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear, and compares the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
a display which displays an indication of the comparative occurrence values of the pairs;
wherein the differentiating parameter defines an order, and wherein the processor compares the occurrence values in a first sub-group with the occurrence values in one or more previous sub-groups in the order. - View Dependent Claims (12, 13, 14)
-
-
15. Apparatus for visualizing variations in a corpus of information including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
a processor which finds pairs of characteristics which appear together in at least one of the documents, determines an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear, and compares the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
a display which displays an indication of the comparative occurrence values of the pairs;
wherein the display displays a graph, wherein each node in the graph represents a term and each edge represents a found pair of characteristics, and substantially each edge is associated with the indication of the comparative appearance of the respective pair. - View Dependent Claims (16, 17)
-
-
18. Apparatus for visualizing variations in a corpus of information including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
a processor which finds pairs of characteristics which appear together in at least one of the documents, determines an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear, and compares the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
a display which displays an indication of the comparative occurrence values of the pairs;
wherein the display displays a graph;
said graph comprises a plurality of graphs each of which compares the occurrence values of the pairs in two sub-groups, wherein the plurality of graphs are displayed as an animation sequence.
-
-
19. Apparatus for visualizing variations in a corpus of information including a plurality of information entries which are divided into a plurality of sub-groups according to a differentiating parameter of the entries, comprising:
-
a processor which finds pairs of characteristics which appear together in at least one of the documents, determines an occurrence value for each of the pairs of characteristics in each sub-group in which both of the characteristics appear, and compares the occurrence values of at least some of the pairs of characteristics for at least two of the sub-groups; and
a display which displays an indication of the comparative occurrence values of the pairs;
wherein the display displays a graph, the graph comprises a plurality of superimposed graphs each of which represents the occurrence values of the pairs in a different sub-group. - View Dependent Claims (20)
-
-
21. A method for selecting a range of values of a variable, comprising:
-
providing a graphic user interface on a display, including a slide-piece that has an initial dimension and is translatable along an axis representing the variable such that each position of the slide-piece along the axis corresponds to a given value of the variable;
positioning the slide-piece at a first position on the axis, so as to indicate a first value of the variable; and
changing the dimension of the slide-piece so as to indicate a second value of the variable, whereby the first and second values of the variable define the selected range. - View Dependent Claims (22, 23)
-
-
24. A computer program product for selecting a range of values of a variable, the program having computer-readable program instructions embodied therein, which instructions, when read by a computer, cause the computer to:
-
provide a graphic user interface on a display, including a slide-piece that has an initial dimension and is translatable along an axis representing the variable such that each position of the slide-piece along the axis corresponds to a given value of the variable;
position the slide-piece at a first position on the axis, so as to indicate a first value of the variable; and
change the dimension of the slide-piece so as to indicate a second value of the variable, whereby the first and second values of the variable define the selected range.
-
Specification