Time ordered indexing of audio data
First Claim
Patent Images
1. A method, comprising:
- identifying attributes including one or more types of accents and one or more types of human languages from a multi-party audio information stream;
encoding each identified attribute from the audio information stream into a time ordered index, each of the identified attributes sharing a common time reference;
comparing results from different human language models at approximately the same time to generate an integrated time ordered index of the identified attributes; and
triggering an event to occur upon an identification of unique voice characteristics of a speaker in less than five seconds.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatuses in which attributes including one or more types of accents and one or more types of human languages from an audio information stream are identified. Each identified attribute from the audio information stream is encoded into a time ordered index. Each of the identified attributes shares a common time reference. Different human language models are compared at approximately the same time to generate an integrated time ordered index.
224 Citations
32 Claims
-
1. A method, comprising:
-
identifying attributes including one or more types of accents and one or more types of human languages from a multi-party audio information stream; encoding each identified attribute from the audio information stream into a time ordered index, each of the identified attributes sharing a common time reference; comparing results from different human language models at approximately the same time to generate an integrated time ordered index of the identified attributes; and triggering an event to occur upon an identification of unique voice characteristics of a speaker in less than five seconds. - View Dependent Claims (2, 3)
-
-
4. A machine-readable storage medium that stores instructions, which when executed by a machine, cause the machine to perform operations comprising:
-
identifying attributes including one or more types of accents and one or more types of human languages from a multi-party audio information stream; encoding each identified attribute from the audio information stream into a time ordered index, each of the identified attributes sharing a common time reference; comparing results from different human language models at approximately the same time to generate an integrated time ordered index of the identified attributes; and correlating a first identified attribute of the information stream with a second identified attribute having a similar time code, wherein the similar time code comprises the first identified attribute possessing a start time approximately the same as the second identified attribute or an overlapping of the durations associated with the first identified and the second identified attribute. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. An apparatus, comprising:
-
means for identifying attributes including one or more types of accents and one or more types of human languages from a multi-party audio information stream; means for encoding each identified attribute from the audio information stream into a time ordered index, each of the identified attributes sharing a common time reference; means for comparing results from different human language models at approximately the same time to generate an integrated time ordered index of the identified attributes; and means for correlating a first identified attribute of the information stream with a second identified attribute having a similar time code, wherein the similar time code comprises the first identified attribute possessing a start time approximately the same as the second identified attribute or an overlapping of the durations associated with the first identified and the second identified attribute. - View Dependent Claims (19)
-
-
20. A machine-readable storage medium that stores instructions, which when executed by a machine, cause the machine to perform operations comprising:
-
converting spoken words in an information stream to written text, the information stream containing audio information; generating a separate encoded file for every word, wherein each encoded file shares a common time; and generating a link to relevant material based upon the spoken words and synchronizing a display of the link in less than five seconds from analyzing the information stream.
-
-
21. An apparatus comprising:
-
a software engine having one or more attribute filters to detect attributes from a multi-party audio information stream, identify the attributes, and assign a time ordered indication with each of the identified attributes, the software engine having an index control module to facilitate an integrated time order indexing of the identified attributes a computer readable storage medium to store the software engine; and a manipulation module to perform operations on a first set of attributes in order to manipulate a second set of attributes. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. An apparatus comprising:
-
a software engine having one or more attribute filters to detect attributes from a multi-party audio information stream, identify the attributes, and assign a time ordered indication with each of the identified attributes, the software engine having an index control module to facilitate an integrated time order indexing of the identified attributes a computer readable storage medium to store the software engine; and a triggering and synchronization module to dynamically trigger a link and synchronize the appearance of the link based upon a transcribed text from the information stream. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification