Data processing system for autonomously building speech identification and tagging data
First Claim
1. A method for autonomously creating a corpus of a conversation, the method comprising:
- monitoring a conversation between one or more speakers;
identifying the spoken languages of the conversation;
identifying one or more topics being discussed within the conversation;
in response to identifying the topics being discussed, creating a plurality of metadata tags for each topic of the conversation, wherein the metadata tags include one or more of;
a description of the speakers for a portion of the conversation,a description of the languages spoken for a portion of the conversation,a summary of the topic of the conversation for a portion of the conversation,a plurality of links to other related topics of the conversation, and a plurality of links to other related topics of a previously analyzed conversation;
storing the metadata tags in a link database;
determining a spoken emotional pattern of an autonomously selected topic of the conversation;
creating a corpus of the conversation, wherein the corpus includes a text transcription of the conversation, and also includes an identification of the spoken emotional pattern and metadata tags of the conversation.
3 Assignments
0 Petitions
Accused Products
Abstract
A method, system, and computer program product for autonomously transcribing and building tagging data of a conversation. A corpus processing agent monitors a conversation and utilizes a speech recognition agent to identify the spoken languages, speakers, and emotional patterns of speakers of the conversation. While monitoring the conversation, the corpus processing agent determines emotional patterns by monitoring voice modulation of the speakers and evaluating the context of the conversation. When the conversation is complete, the corpus processing agent determines synonyms and paraphrases of spoken words and phrases of the conversation taking into consideration any localized dialect of the speakers. Additionally, metadata of the conversation is created and stored in a link database, for comparison with other processed conversations. A corpus, a transcription of the conversation containing metadata links, is then created. The corpus processing agent also determines the frequency of spoken keywords and phrases and compiles a popularity index.
283 Citations
18 Claims
-
1. A method for autonomously creating a corpus of a conversation, the method comprising:
-
monitoring a conversation between one or more speakers; identifying the spoken languages of the conversation; identifying one or more topics being discussed within the conversation; in response to identifying the topics being discussed, creating a plurality of metadata tags for each topic of the conversation, wherein the metadata tags include one or more of; a description of the speakers for a portion of the conversation, a description of the languages spoken for a portion of the conversation, a summary of the topic of the conversation for a portion of the conversation, a plurality of links to other related topics of the conversation, and a plurality of links to other related topics of a previously analyzed conversation; storing the metadata tags in a link database; determining a spoken emotional pattern of an autonomously selected topic of the conversation; creating a corpus of the conversation, wherein the corpus includes a text transcription of the conversation, and also includes an identification of the spoken emotional pattern and metadata tags of the conversation. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A corpus processing agent comprising:
-
a processor; a memory coupled to the processor; a audio capture device; a processing logic for; monitoring a conversation between one or more speakers; identifying the spoken languages of the conversation; identifying one or more topics being discussed within the conversation; in response to identifying the topics being discussed, creating a plurality of metadata tags for each topic of the conversation, wherein the metadata tags include one or more of; a description of the speakers for a portion of the conversation, a description of the languages spoken for a portion of the conversation, a summary of the topic of the conversation for a portion of the conversation, a plurality of links to other related topics of the conversation, and a plurality of links to other related topics of a previously analyzed conversation; storing the metadata tags in a link database; determining a spoken emotional pattern of an autonomously selected topic of the conversation; creating a corpus of the conversation, wherein the corpus includes a text transcription of the conversation, and also includes an identification of the spoken emotional pattern and metadata tags of the conversation. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product having a plurality of instructions embodied therein, wherein the plurality of instructions, when executed by a processing device, allows a machine to:
-
monitor a conversation between one or more speakers; identify the spoken languages of the conversation; identify one or more topics being discussed within the conversation; in response to identifying the topics being discussed, create a plurality of metadata tags for each topic of the conversation, wherein the metadata tags include one or more of; a description of the speakers for a portion of the conversation, a description of the languages spoken for a portion of the conversation, a summary of the topic of the conversation for a portion of the conversation, a plurality of links to other related topics of the conversation, and a plurality of links to other related topics of a previously analyzed conversation; store the metadata tags in a link database; determine a spoken emotional pattern of an autonomously selected topic of the conversation; create a corpus of the conversation, wherein the corpus includes a text transcription of the conversation, and also includes an identification of the spoken emotional pattern and metadata tags of the conversation. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification