AUTOMATED ONTOLOGY DEVELOPMENT

US 20190325324A1
Filed: 07/01/2019
Published: 10/24/2019
Est. Priority Date: 02/06/2013
Status: Active Grant

First Claim

Patent Images

1-20. -20. Canceled

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods of automated ontology development include a corpus of communication data. The corpus of communication data includes communication data from a plurality of interactions and is processed. A plurality of terms are extracted from the corpus. Each term of the plurality is a plurality of words that identify a single concept within the corpus. An ontology is automatedly generated from the extracted terms.

Citations

40 Claims

1-20. -20. Canceled

21. A method of automated ontology development for processing communication data via a computer system, wherein the ontology is a structural representation of language elements and relationships between those language elements within a domain stored in the memory of the computer system the method comprising:
- processing a corpus of communication data, the corpus comprising communication data from a plurality of interactions;
  
  extracting a plurality of terms from the corpus, wherein each term of the plurality is a plurality of words that identify a single concept within the corpus;
  
  automatedly generating an ontology from the extracted term by at least creating two context vectors for each of the plurality of terms and comparing the context vectors for each of the plurality of terms to one another to categorize the terms into a plurality of relations, wherein a first of the two context vectors of a given term predicts terms that will appear to the left of the given term, wherein a second of the two context vectors predicts terms that will appear to the right of the given term; and
  
  storing the automatedly generated ontology in an ontology database in the memory of the computer system.
- View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
- - 22. The method of claim 21, wherein processing the corpus further comprises:
    - receiving raw communication data; and
      
      applying a rank filter to select a portion of the raw communication data as the corpus of communication data.
  - 23. The method of claim 22, wherein the raw communication data comprises transcriptions of interactions, agent scripts, service manuals, and product manuals.
  - 24. The method of claim 21, wherein processing the corpus further comprises:
    - identifying scripts within the corpus, wherein scripts are recurring patterns of three or more words.
  - 25. The method of claim 24, wherein processing the corpus further comprises:
    - zoning the communication data to segment the communication data into meaning units.
  - 26. The method of claim 25, wherein the plurality of terms are extracted from the corpus on a meaning unit-by-meaning unit basis.
  - 27. The method of claim 21, wherein the plurality of interactions are customer service interactions and the ontology is tailored for use in analyzing customer service interactions.
  - 28. The method of claim 21, wherein the ontology comprises a plurality of terms, a plurality of relations, and a plurality of themes identified from the corpus.
  - 29. The method of claim 21, wherein the plurality of interactions is from multiple platforms.
  - 30. The method of claim 21, wherein the first of the two context vectors of a given term is a list of terms that predicts terms that will appear to the left of a given term, the second of the two context vectors is a second list of terms that predicts terms that will appear to the right of the given term, and each of the context vectors includes up to a predetermined number of potential terms in the first or second list of terms.
  - 31. The method of claim 21, wherein automatedly generating the ontology further comprises:
    - comparing the plurality of relations to one another to categorize the relations into a plurality of themes.

32. A method of automated ontology development, the method comprising:
- processing a corpus of communication data, the corpus comprising communication data from a plurality of interactions, by zoning the communication data to segment the communication data into a plurality of meaning units;
  
  extracting a plurality of terms from each of the plurality of meaning units, wherein each term of the plurality is a plurality of words that identify a single concept within the corpus;
  
  automatedly generating an ontology that comprises the extracted terms by at least creating two context vectors for each of the plurality of terms and comparing the context vectors for each of the plurality of terms to one another to categorize the terms into a plurality of relations, wherein a first of the two context vectors of a given term predicts terms that will appear to the left of the given term, wherein a second of the two context vectors predicts terms that will appear to the right of the given term; and
  
  storing the automatedly generated ontology in an ontology database.
- View Dependent Claims (33, 34, 35, 36, 37)
- - 33. The method of claim 32, wherein processing the corpus further comprises:
    - receiving raw communication data; and
      
      applying a rank filter to select a portion of the raw communication data as the corpus of communication data.
  - 34. The method of claim 33, wherein the rank filter selects data files from the raw communication data that include a threshold of identified related terms to the domain of the ontology that is to be developed.
  - 35. The method of claim 34, wherein the raw communication data comprises interaction data from the interactions from multiple platforms including interactions made via one or more of by phone, email, internet chat, text message, web page comment, social media interaction, customer surveys, an audio recording, streaming audio, a transcription of spoken content, or written correspondence.
  - 36. The method of claim 32, wherein automatedly generating the ontology further comprises:
    - comparing the plurality of relations to one another to categorize the relations into a plurality of themes.
  - 37. The method of claim 36, wherein the ontology further comprises the plurality of relations and the plurality of themes.

38. A system for automated ontology development, the system comprising:
- a communication data database populated with communication data;
  
  a processor communicatively connected to the database of communication data and communicatively connected to a computer readable medium programmed with computer readable code that upon execution by the processor causes the processor to;
  
  process a corpus of communication data received from the database;
  
  extract a plurality of terms from the corpus, wherein each term of the plurality is a plurality of words that identify a single concept within the corpus; and
  
  automatedly generate an ontology from the extracted terms by at least creating two context vectors for each of the plurality of terms and comparing the context vectors for each of the plurality of terms to one another to categorize the terms into a plurality of relations, wherein a first of the two context vectors of a given term predicts terms that will appear to the left of the given term, wherein a second of the two context vectors predicts terms that will appear to the right of the given term; and
  
  an ontology database upon which the processor stores the automatedly generated ontology.
- View Dependent Claims (39, 40)
- - 39. The system of claim 38, wherein the communication data comprises transcriptions of interactions, agent scripts, service manuals, and product manuals.
  - 40. The system of claim 38, further comprising:
    - a script database communicatively connected to the processor; and
      
      wherein execution of the computer readable code by the processor further causes the processor to;
      
      surface a plurality of scripts from the communication data;
      
      store the plurality of scripts at the script database; and
      
      apply the plurality of scripts from the script database to the corpus of communication data to identify scripts within the corpus of communication data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Verint Systems Incorporated
Original Assignee
Verint Systems Limited (Verint Systems Incorporated)
Inventors
Romano, Roni, Horesh, Yair, Dreyfuss, Jeremie

Granted Patent

US 10,679,134 B2
Time in Patent Office

Days
Field of Search
US Class Current
CPC Class Codes

G06F 16/367 Ontology

G06N 5/022 Knowledge engineering; Know...

AUTOMATED ONTOLOGY DEVELOPMENT

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

40 Claims

Specification

Solutions

Use Cases

Quick Links

AUTOMATED ONTOLOGY DEVELOPMENT

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

40 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links