CONFERENCE TRANSCRIPTION BASED ON CONFERENCE DATA
First Claim
1. A method comprising:
- receiving conference data from at least one of a plurality of conference participants;
sending text associated with the conference data to a speech recognition engine;
receiving a plurality of input media streams from the plurality of conference participants;
generating an output media stream from the plurality of input media streams; and
sending the output media stream to the plurality of conference participants and to the speech recognition engine,wherein the conference data includes a shared material or a conference roster.
1 Assignment
0 Petitions
Accused Products
Abstract
In one implementation, a collaboration server is a conference bridge or other network device configured to host an audio and/or video conference among a plurality of conference participants. The collaboration server sends conference data and a media stream including speech to a speech recognition engine. The conference data may include the conference roster or text extracted from documents or other files shared in the conference. The speech recognition engine updates a default language model according to the conference data and transcribes the speech in the media stream based on the updated language model. In one example, the performance of default language model, the updated language model, or both may be tested using a confidence interval or submitted for approval of the conference participant.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving conference data from at least one of a plurality of conference participants; sending text associated with the conference data to a speech recognition engine; receiving a plurality of input media streams from the plurality of conference participants; generating an output media stream from the plurality of input media streams; and sending the output media stream to the plurality of conference participants and to the speech recognition engine, wherein the conference data includes a shared material or a conference roster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus comprising:
-
a memory storing conference data received from at least one of a plurality of conference participants; a controller configured to obtain text based on the conference data and configured to generate an output media stream from a plurality of input media streams received from the plurality of conference participants; and a communication interface configured to send the output media stream and the text to a speech recognition engine. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer readable storage medium comprising instructions configured to:
-
receive shared data associated with a conference from at least one of a plurality of conference participants, the shared data being other than audio data; extract text from the shared data; update a default language model based on the text; and transcribe at least a portion of a media stream from the conference using the updated language model. - View Dependent Claims (20)
-
Specification