SYSTEMS AND METHODS FOR ANALYZING ELECTRONIC TEXT
First Claim
1. A computer-implemented method for systematically analyzing an electronic text, comprising:
- receiving the electronic text from a plurality of sources;
determining an at least one term of interest to be identified in the electronic text;
identifying a plurality of locations within the electronic text including the at least one term of interest;
for each location within a plurality of locations, creating a snippet from a text segment around the at least one term of interest at the location within the electronic text;
creating multiple taxonomies for the at least one term of interest from the snippets, wherein the taxonomies include an at least one category; and
determining co-occurrences between the multiple taxonomies to determine associations between categories of a different taxonomies of the multiple taxonomies.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for systematically analyzing an electronic text are described. In one embodiment, the method includes receiving the electronic text from a plurality of sources. The method also includes determining an at least one term of interest to be identified in the electronic text. The method further includes identifying a plurality of locations within the electronic text including the at least one term of interest. The method also includes for each location within a plurality of locations, creating a snippet from a text segment around the at least one term of interest at the location within the electronic text. The method further includes creating multiple taxonomies for the at least one term of interest from the snippets, wherein the taxonomies include an at least one category. The method also includes determining co-occurrences between the multiple taxonomies to determine associations between categories of a different taxonomies of the multiple taxonomies.
93 Citations
20 Claims
-
1. A computer-implemented method for systematically analyzing an electronic text, comprising:
-
receiving the electronic text from a plurality of sources; determining an at least one term of interest to be identified in the electronic text;
identifying a plurality of locations within the electronic text including the at least one term of interest;for each location within a plurality of locations, creating a snippet from a text segment around the at least one term of interest at the location within the electronic text; creating multiple taxonomies for the at least one term of interest from the snippets, wherein the taxonomies include an at least one category; and determining co-occurrences between the multiple taxonomies to determine associations between categories of a different taxonomies of the multiple taxonomies. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for systematically analyzing an electronic text, comprising:
-
a module to receive the electronic text from a plurality of sources; a module to determine an at least one term of interest to be identified in the electronic text; a module to identify a plurality of locations within the electronic text including the at least one term of interest; a module to create for each location within a plurality of locations a snippet from a text segment around the at least one term of interest at the location within the electronic text; a module to create multiple taxonomies for the at least one term of interest from the snippets, wherein the taxonomies include an at least one category; and a module to determine co-occurrences between the multiple taxonomies to determine associations between categories of a different taxonomies of the multiple taxonomies. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product comprising a computer useable storage medium to store a computer readable program, wherein the computer readable program, when executed on a computer, causes the computer to perform operations comprising:
-
receiving the electronic text from a plurality of sources; determining an at least one term of interest to be identified in the electronic text; identifying a plurality of locations within the electronic text including the at least one term of interest; for each location within a plurality of locations, creating a snippet from a text segment around the at least one term of interest at the location within the electronic text; creating multiple taxonomies for the at least one term of interest from the snippets, wherein the taxonomies include an at least one category; and determining co-occurrences between the multiple taxonomies to determine associations between categories of a different taxonomies of the multiple taxonomies; determining co-occurrences between a category of a single taxonomy and the at least one term of interest to determine significance of the at least one term of interest; and sorting the at least one term of interest by significance; and outputting the sorted at least one term of interest. - View Dependent Claims (18, 19, 20)
-
Specification