×

Automated collective term and phrase index

  • US 9,864,741 B2
  • Filed: 09/23/2015
  • Issued: 01/09/2018
  • Est. Priority Date: 09/23/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • obtaining, by a computer system, data files from a knowledge corpus of an enterprise;

    identifying, by the computer system, key terms within the data files;

    determining, by the computer system, for each identified key term, a frequency of occurrence and location within the data files;

    generating, by the computer system, knowledge units from the data files based on the determined frequencies of occurrence and the determined locations of the key terms in the data files;

    selecting, by computing system, a knowledge unit from the generated knowledge units for extraction of n-grams;

    deriving, by the computing system, a term vector for the knowledge unit based at least on the determined frequencies of occurrence and the determined locations of the key terms in the knowledge unit;

    identifying, by the computing system, the key terms in the term vector based at least on the frequency of occurrence of each key term in the knowledge unit;

    extracting, by the computing system, n-grams using the key terms in the term vector;

    scoring, by the computing system, each of the extracted n-grams as a function of at least a frequency of occurrence of each of the n-grams across the knowledge corpus of the enterprise; and

    adding, by the computing system, one or more of the extracted n-grams to an index based on the scoring.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×