System and method for multi-modal audio mining of telephone conversations
First Claim
1. A system for multi-modal audio mining of telephone data, the system comprising:
- one or more circuits and/or processors configured to;
retrieve a data record from a call database;
augment the retrieved data record with subscriber information and spatial information;
analyze the augmented data record after it has been augmented to verify that each data element of the augmented data record is formatted consistent with a relevant format of a plurality of application specific data warehouses;
map each data element within the augmented data record to the plurality of application specific data warehouses; and
provide a multi-modal query and visualization of the augmented data record to identify events of interest,wherein the augmented data record includes the spatial information, the subscriber information, a transcript of the monitored telephone conversation, a plurality of characteristics of the monitored telephone conversation and the retrieved data that includes audio of the monitored telephone conversation and metadata information of the monitored telephone conversation.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method for the automated monitoring of inmate telephone calls as well as multi-modal search, retrieval and playback capabilities for said calls. A general term for such capabilities is multi-modal audio mining. The invention is designed to provide an efficient means for organizations such as correctional facilities to identify and monitor the contents of telephone conversations and to provide evidence of possible inappropriate conduct and/or criminal activity of inmates by analyzing monitored telephone conversations for events, including, but not limited to, the addition of third parties, the discussion of particular topics, and the mention of certain entities.
364 Citations
17 Claims
-
1. A system for multi-modal audio mining of telephone data, the system comprising:
-
one or more circuits and/or processors configured to; retrieve a data record from a call database; augment the retrieved data record with subscriber information and spatial information; analyze the augmented data record after it has been augmented to verify that each data element of the augmented data record is formatted consistent with a relevant format of a plurality of application specific data warehouses; map each data element within the augmented data record to the plurality of application specific data warehouses; and provide a multi-modal query and visualization of the augmented data record to identify events of interest, wherein the augmented data record includes the spatial information, the subscriber information, a transcript of the monitored telephone conversation, a plurality of characteristics of the monitored telephone conversation and the retrieved data that includes audio of the monitored telephone conversation and metadata information of the monitored telephone conversation. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for data-mining a monitored telephone conversation, the system comprising:
-
a data transformer configured to receive data from a data source, the received data including audio of a monitored telephone conversation and metadata information of the monitored telephone conversation; a data augmenter configured to augment the received data with telephone subscriber data and spatial information data; a speech recognizer configured to transcribe the monitored telephone conversation, detect a plurality of characteristics of the monitored telephone conversation, and associate the detected plurality of characteristics with a transcript of the monitored telephone conversation; and a data mapper configured to generate a data record for the monitored telephone conversation and store the data record to a multimedia data warehouse, the data record including at least the transcript of the monitored telephone conversation, the plurality of characteristics of the monitored telephone conversation and the augmented received data that includes the audio, the metadata information, the telephone subscriber data and the spatial information data. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method for multi-modal audio mining of telephone data, the method comprising:
-
retrieving a data record from a call database; augmenting the retrieved data record with subscriber information and spatial information; analyzing the augmented data record to verify that each data element of the augmented data record is formatted consistent with a relevant format of a plurality of application specific data warehouses; mapping each data element within the augmented data record to the plurality of application specific data warehouses; and providing a multi-modal query and visualization of the augmented data record to identify events of interest, wherein the augmented data record includes the spatial information, the subscriber information, a transcript of a monitored telephone conversation, a plurality of characteristics of the monitored telephone conversation and the retrieved data that includes audio of the monitored telephone conversation and metadata information of the monitored telephone conversation. - View Dependent Claims (14, 15, 16, 17)
-
Specification