Apparatus and method for providing voice recognition for multiple speakers
First Claim
Patent Images
1. A method for converting all audio information from a plurality of participants to combined textual information, comprising the steps of:
- performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;
inserting time stamps into each stream of textual information; and
combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the all audio information.
27 Assignments
0 Petitions
Accused Products
Abstract
Utilizing individual voice recognition units for each speaker in a conference to perform automatic transcription of that speaker'"'"'s contribution to the conference. The output of each of the voice recognition units is then merged on a time basis to produce a textual transcription of the entire telecommunication conference call.
-
Citations
19 Claims
-
1. A method for converting all audio information from a plurality of participants to combined textual information, comprising the steps of:
-
performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;
inserting time stamps into each stream of textual information; and
combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the all audio information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A processor-readable medium comprising processor-executable instructions configured for:
-
performing independent voice recognition operations on audio information from each of the plurality of participants thereby creating a stream of textual information for each of the plurality of participants;
inserting time stamps into each stream of textual information; and
combining the streams of textual information and time stamps into combined textual information on the basis of the time stamps in each of the plurality of streams of textual information whereby the combined textual information is textual information of the audio. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An apparatus for converting all audio information from a plurality of participants to combined information, comprising:
-
a plurality of voice recognition units each assigned to one of the plurality of participants whereby audio information from each of the assigned participant is processed to produce a textual stream with time stamps; and
a controller response to the textual streams with time stamps for combining the streams into combined textual information. - View Dependent Claims (18, 19)
-
Specification