Speech analytics system and system and method for determining structured speech
First Claim
1. A method for generating a database index for a transcript based on automatic identification by a communication processing system of different types of structured speech in the transcript, the method comprising:
- receiving, by a communication processing system, a transcript of an audio recording created by a speech analytics system;
analyzing, by the communication processing system, text in the transcript to determine repetitions within the text that are indicative of structured speech;
calculating, by the communication processing system, a duration distribution of the repetitions within the text to ascertain, by the communication processing system, whether a first segment of the transcript comprises a first type of structured speech, wherein the first type of structured speech includes interactive voice response (IVR) generated speech;
calculating, by the communication processing system, a length of the repetitions within the text to ascertain, by the communication processing system, whether a second segment of the transcript comprises a second or third type of structured speech, wherein the communication processing systems determines that the second segment comprises the third type of structured speech as opposed to the second type of structured speech when the length of the repetitions found in the text is greater than a predetermined threshold, wherein the second type of structured speech includes scripts spoken by agents and the third type of speech includes figures of speech; and
generating, by the communication processing system, a database index for the transcript such that the first segment is marked in a transcript database as comprising the first type of structured speech and the second segment is marked in the transcript database as comprising the second or third type of structured speech.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for converting speech to text in a speech analytics system is provided. The method includes receiving audio data containing speech made up of sounds from an audio source, processing the sounds with a phonetic module resulting in symbols corresponding to the sounds, and processing the symbols with a language module and occurrence table resulting in text. The method also includes determining a probability of correct translation for each word in the text, comparing the probability of correct translation for each word in the text to the occurrence table, and adjusting the occurrence table based on the probability of correct translation for each word in the text.
-
Citations
6 Claims
-
1. A method for generating a database index for a transcript based on automatic identification by a communication processing system of different types of structured speech in the transcript, the method comprising:
-
receiving, by a communication processing system, a transcript of an audio recording created by a speech analytics system; analyzing, by the communication processing system, text in the transcript to determine repetitions within the text that are indicative of structured speech; calculating, by the communication processing system, a duration distribution of the repetitions within the text to ascertain, by the communication processing system, whether a first segment of the transcript comprises a first type of structured speech, wherein the first type of structured speech includes interactive voice response (IVR) generated speech; calculating, by the communication processing system, a length of the repetitions within the text to ascertain, by the communication processing system, whether a second segment of the transcript comprises a second or third type of structured speech, wherein the communication processing systems determines that the second segment comprises the third type of structured speech as opposed to the second type of structured speech when the length of the repetitions found in the text is greater than a predetermined threshold, wherein the second type of structured speech includes scripts spoken by agents and the third type of speech includes figures of speech; and generating, by the communication processing system, a database index for the transcript such that the first segment is marked in a transcript database as comprising the first type of structured speech and the second segment is marked in the transcript database as comprising the second or third type of structured speech. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification