×

Keyword Frequency Analysis System

  • US 20160154797A1
  • Filed: 12/01/2014
  • Published: 06/02/2016
  • Est. Priority Date: 12/01/2014
  • Status: Active Grant
First Claim
Patent Images

1. A keyword frequency analysis system, comprising:

  • a memory operable to store a plurality of sets of records, wherein each set of records is associated with a dimension and comprises a first keyword and a second keyword;

    an interface operable to;

    receive the plurality of sets of records;

    receive a request to determine whether the first keyword is a selected one of overrepresented or underrepresented in a first set of records, the request comprising a selection of a method to calculate an expected frequency of the first keyword;

    one or more hardware processors communicatively coupled to the interface and the memory and operable to;

    determine a frequency of the first keyword in each set of records;

    determine a frequency of the second keyword in each set of records;

    determine the method to calculate the expected frequency of the first keyword based on the selection of the method in the request to determine whether the first keyword is a selected one of overrepresented or underrepresented in the first set of records;

    calculate the expected frequency of the first keyword in the first set of records associated with a first dimension using the method, the expected frequency of the first keyword being a number of times the first keyword should appear in the first set of records, the expected frequency of the first keyword based on the frequency of the first keyword and the frequency of the second keyword;

    determine a difference between the frequency of the first keyword and the expected frequency;

    compare the difference to a threshold, the threshold indicating whether the difference is large enough to determine one of a selected group of overrepresentation or underrepresentation;

    in response to determining that the difference is not greater than the first threshold, communicate a message indicating that the first keyword is not overrepresented and not underrepresented;

    in response to determining that the difference is greater than the first threshold;

    determine whether the frequency of the first keyword is less than the expected frequency;

    in response to determining that the frequency of the first keyword is less than the expected frequency;

    determine that the first keyword is underrepresented in the first set of records;

    determine a degree of underrepresentation by comparing the threshold and the difference between the frequency of the first keyword and the expected frequency;

    translate the frequency of the first keyword, the frequency of the second keyword, the degree of underrepresentation, and the expected frequency into the keyword report, the keyword report comprising the expected frequency, the degree of underrepresentation, and the determination that the first keyword is underrepresented in the first set of records; and

    communicate the keyword report for display.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×