Natural language text analytics
First Claim
1. A method comprising:
- using a computer having a processor, and a memory having computer instructions to cause the processor to perform steps, including;
filtering a plurality of unfiltered records having unstructured data according to at least one criterion into at least a first group and a second group, said first group and said second group each comprise at least two records, wherein said first group is different than said second group, the groups being stored in the memory;
determining a first proportion of occurrence for a term by comparing a first number of records having at least one occurrence of said term in said first group to a first total number of records in said first group;
determining a second proportion of occurrence for said term by comparing a second number of records to determine a number of occurrences of said term in said second group to a second total number of records in said second group;
determining a first standard error range according to said first proportion of occurrence of said term, a first total number of records within said first group, and a level of confidence;
determining a second standard error range according to said second proportion of occurrence of said term, a second total number of records within said second group, and said level of confidence; and
indicating, via an overlap identifier, when said first standard error range and said second error range overlap.
0 Assignments
0 Petitions
Accused Products
Abstract
A method of text analytics includes filtering a plurality of unfiltered records having unstructured data into at least a first group and a second group. The first group and said second group each include at least two records and the first group is different than the second group. The method includes determining a first proportion of occurrence for a term by comparing a first number of records having at least one occurrence of the term in the first group to a first total number of records in the first group, determining a second proportion of occurrence for the term by comparing a second number of records having at least one occurrence of the term in said second group to a second total number of records in the second group, and comparing the first proportion of occurrence to the second proportion of occurrence to yield a resultant comparison occurrence.
17 Citations
10 Claims
-
1. A method comprising:
-
using a computer having a processor, and a memory having computer instructions to cause the processor to perform steps, including; filtering a plurality of unfiltered records having unstructured data according to at least one criterion into at least a first group and a second group, said first group and said second group each comprise at least two records, wherein said first group is different than said second group, the groups being stored in the memory; determining a first proportion of occurrence for a term by comparing a first number of records having at least one occurrence of said term in said first group to a first total number of records in said first group; determining a second proportion of occurrence for said term by comparing a second number of records to determine a number of occurrences of said term in said second group to a second total number of records in said second group; determining a first standard error range according to said first proportion of occurrence of said term, a first total number of records within said first group, and a level of confidence; determining a second standard error range according to said second proportion of occurrence of said term, a second total number of records within said second group, and said level of confidence; and indicating, via an overlap identifier, when said first standard error range and said second error range overlap. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory storage medium comprising:
instructions that are readable by a processor and cause said processor to; filter a plurality of unfiltered records having unstructured data according to at least one criterion into at least a first group and a second group, said first group and said second group each comprise at least two records, wherein said first group is different than said second group; determine a first proportion of occurrence for a term by comparing a first number of records having at least one occurrence of said term in said first group to a first total number of records in said first group; determine a second proportion of occurrence for said term by comparing a second number of records to determine a number of occurrences of said term in said second group to a second total number of records in said second group; compare said first proportion of occurrence to said second proportion of occurrence to yield a resultant comparison occurrences; determine a first standard error range according to said first proportion of occurrence of said term, a first total number of records within said first group, and a level of confidence; determine a second standard error range according to said second proportion of occurrence of said term, a second total number of records within said second group, and said level of confidence; and indicate, via an overlap identifier, when said first standard error range and said second error range overlap.
-
8. A method comprising:
-
using a computer having a processor and a memory having computer instructions to cause the processor to perform steps, including; allocating each record of a plurality of records a numerical value according to a sentiment to yield a plurality of sentiment records; filtering said plurality of sentiment records according to a criterion to yield filtered records; determining a sentiment value for a term according to a number of said filtered records having said term and said numerical value, wherein said sentiment value is selected from the group consisting of;
a mean, a variance and a deviation; anddetermining a proportion of occurrence of a term by comparing a number of said filtered records having at least one occurrence of said term to a total number of records in said filtered records. - View Dependent Claims (9)
-
-
10. A non-transitory storage medium comprising:
-
instructions that are readable by a processor and cause said processor to; allocate each record of a plurality of records a numerical value according to a sentiment to yield a plurality of sentiment records; filter said plurality of sentiment records according to a criterion to yield filtered records; determine a sentiment value for a term according to a number of said filtered records having said term and said numerical value, wherein said sentiment value is selected from the group consisting of;
a mean, a variance and a deviation; anddetermining a proportion of occurrence of a term by comparing a number of said filtered records having at least one occurrence of said term to a total number of records in said filtered records.
-
Specification