SYSTEMS AND METHODS FOR IDENTIFYING CONCEPTS AND KEYWORDS FROM SPOKEN WORDS IN TEXT, AUDIO, AND VIDEO CONTENT

US 20130311181A1
Filed: 07/29/2013
Published: 11/21/2013
Est. Priority Date: 09/21/2009
Status: Abandoned Application

First Claim

Patent Images

1. A system for identifying, summarizing, and communicating topics and keywords included within spoken content, which comprises a server that is configured to:

(a) receive one or more input files containing spoken content from an external source;

(b) process the input files using speech-to-text transcription when the spoken content is formatted as a video or audio file; and

(c) apply an algorithm to the transcribed text in order to analyze the spoken content, wherein the algorithm calculates a total score for each word included within the transcribed text, wherein the total score is calculated using a plurality of metrics which comprise;

(i) a length of each word in relation to a mean length of words;

(ii) frequency of letter groups used within each word;

(iii) frequency of repetition of each word and word sequences;

(iv) a part of speech that is represented by each word; and

(v) membership of each word within a custom set of words.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems for identifying, summarizing, and communicating topics and keywords included within an input file are disclosed. The systems include a server that receives one or more input files from an external source; conducts a speech-to-text transcription (when the input file is an audio or video file); and applies an algorithm to the text in order to analyze the content therein. The algorithm calculates a total score for each word included within the text, which is calculated using a variety of metrics that include: a length of each word in relation to a mean length of words, the frequency of letter groups used within each word, the frequency of repetition of each word and word sequences, a part of speech that is represented by each word, and membership of each word within a custom set of words. The systems are further capable of generating a graphical representation of each input file, which depicts those parts of the input file that exhibit a higher total score from those that do not. In addition, the systems allow users to publish commentary—through an email interface—to such graphical representations of the input files.

27 Citations

View as Search Results

10 Claims

1. A system for identifying, summarizing, and communicating topics and keywords included within spoken content, which comprises a server that is configured to:
- (a) receive one or more input files containing spoken content from an external source;
  
  (b) process the input files using speech-to-text transcription when the spoken content is formatted as a video or audio file; and
  
  (c) apply an algorithm to the transcribed text in order to analyze the spoken content, wherein the algorithm calculates a total score for each word included within the transcribed text, wherein the total score is calculated using a plurality of metrics which comprise;
  
  (i) a length of each word in relation to a mean length of words;
  
  (ii) frequency of letter groups used within each word;
  
  (iii) frequency of repetition of each word and word sequences;
  
  (iv) a part of speech that is represented by each word; and
  
  (v) membership of each word within a custom set of words.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The system of claim 1, wherein the system calculates a sub-score for each of the plurality of metrics, whereupon the total score is calculated as a sum of the sub-scores.
  - 3. The system of claim 1, wherein each of the plurality of metrics may be:
    - (a) multiplied by a weighting factor;
      
      (b) adjusted based on metadata contained within the input files, wherein said metadata are selected from a group consisting of intensity, loudness metrics, confidence level in transcription, clarity of word, speed of speech, location within a speech, and combinations of the foregoing;
      
      or(c) a combination of (a) and (b).
  - 4. The system of claim 3, wherein the server is further configured to generate a graphical user interface that portrays a beginning and an end of each input file or excerpt thereof.
  - 5. The system of claim 4, wherein the graphical user interface that portrays a beginning and an end of each input file visually identifies and labels segments of the input file that are correlated with a higher or lower total score that is calculated by the algorithm.
  - 6. The system of claim 5, wherein the server is operably connected to a centralized website within which a plurality of users may access, view, and publish comments to the graphical user interface that portrays a beginning and an end of each input file.
  - 7. The system of claim 5, wherein the server is operably connected to, or in communication with, a telecommunications, VOIP, or Asterisk PBX system.
  - 8. The system of claim 5, wherein the server is configured to email a defined number of users a message, which allows recipients of said email to access, view, and publish comments to a graphical user interface that portrays a beginning and an end of each input file.
  - 9. The system of claim 8, wherein the defined number of users may add or delete comments, and publish such comments to the graphical user interface, through an email interface by including commands within a reply message to said email, wherein said commands specify (i) whether a comment is being added or deleted and (ii) a numeric position within the graphical user interface where the comment should be added or deleted.
  - 10. The system of claim 9, wherein the defined number of users may add commentary to the comments published by other users through the email interface.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
VoiceBase, Inc. (LivePerson Incorporated)
Original Assignee
VoiceBase, Inc. (LivePerson Incorporated)
Inventors
Bachtiger, Walter, Jannink, Jan, Blazensky, Jay

Application Number

US13/953,635
Publication Number

US 20130311181A1
Time in Patent Office

Days
Field of Search
US Class Current

704/235
CPC Class Codes

G06F 16/345   Summarisation for human users

G06F 16/685   using automatically derived...

G10L 15/26   Speech to text systems G10L...

SYSTEMS AND METHODS FOR IDENTIFYING CONCEPTS AND KEYWORDS FROM SPOKEN WORDS IN TEXT, AUDIO, AND VIDEO CONTENT

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

27 Citations

10 Claims

Specification

Use Cases

Quick Links

Others

SYSTEMS AND METHODS FOR IDENTIFYING CONCEPTS AND KEYWORDS FROM SPOKEN WORDS IN TEXT, AUDIO, AND VIDEO CONTENT

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

27 Citations

10 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others