Speech recognition and transcription among users having heterogeneous protocols
First Claim
1. A system for facilitating the exchange of speech recognition and transcription among users, the system comprising:
- (a) at least one system transaction manager and at least one post processing manager, both using a uniform system protocol wherein the transaction manager is i) adapted to receive a speech information request from at least one user employing a first user legacy protocol and flag the information request as requiring post processing, and, ii) configured to route a response to the speech information request to one or more users employing a second user legacy protocol, the speech information request comprised of spoken text and commands, including spoken commands, wherein the response comprises at least a transcription of spoken text and the post processed information requested, and wherein the post processing manager is configured to i) receive structured transcription from a speech recognition and/or transcription engine, ii) operate upon the transcribed response, including spoken commands in accordance with the speech information request, and, iii) rout the requested response to a post processing application if specified in the speech information request;
(b) at least one application service adapter configured to provide bi-directional communication conversion between the first user legacy protocol and the uniform system protocol and between the second user legacy protocol and the uniform system protocol and capable of bi-directional communication with the system transaction manager; and
,(c) at least one speech recognition and/or transcription engine communicating with the system transaction manager, wherein the speech recognition and/or transcription engine is configured to receive the flagged speech information request containing spoken text and commands, including spoken commands, from the system transaction manager to generate a transcription in response to the speech information request and to route the response comprised of transcribed spoken text and commands, including transcribed spoken commands to the post processing manager.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes a system transaction manager having a “system protocol,” to receive a speech information request from an authorized user. The speech information request is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications. A speech recognition and/or transcription engine (ASR), in communication with the system transaction manager, receives the speech information request, generates a transcribed response, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR.
36 Citations
41 Claims
-
1. A system for facilitating the exchange of speech recognition and transcription among users, the system comprising:
-
(a) at least one system transaction manager and at least one post processing manager, both using a uniform system protocol wherein the transaction manager is i) adapted to receive a speech information request from at least one user employing a first user legacy protocol and flag the information request as requiring post processing, and, ii) configured to route a response to the speech information request to one or more users employing a second user legacy protocol, the speech information request comprised of spoken text and commands, including spoken commands, wherein the response comprises at least a transcription of spoken text and the post processed information requested, and wherein the post processing manager is configured to i) receive structured transcription from a speech recognition and/or transcription engine, ii) operate upon the transcribed response, including spoken commands in accordance with the speech information request, and, iii) rout the requested response to a post processing application if specified in the speech information request; (b) at least one application service adapter configured to provide bi-directional communication conversion between the first user legacy protocol and the uniform system protocol and between the second user legacy protocol and the uniform system protocol and capable of bi-directional communication with the system transaction manager; and
,(c) at least one speech recognition and/or transcription engine communicating with the system transaction manager, wherein the speech recognition and/or transcription engine is configured to receive the flagged speech information request containing spoken text and commands, including spoken commands, from the system transaction manager to generate a transcription in response to the speech information request and to route the response comprised of transcribed spoken text and commands, including transcribed spoken commands to the post processing manager. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of providing transcribed speech, including spoken text, embedded and spoken commands among users, the method comprising:
-
(a) generating a speech information request wherein the speech information request is comprised of spoken text and commands, including spoken commands obtained using a first user legacy protocols; (b) routing the speech information request through a user application service adapter capable of bi-directional communication with a system transaction manager using a uniform system protocol, wherein the system transaction manager comprises at least one post processing manager; (c) flagging information requests requiring post processing; (d) generating a response to the speech information request using at least one speech recognition and/or transcription engine, the response comprised of a transcription of spoken text and spoken commands; (e) routing the requested response to a the post processing manager, wherein the post processing manager is configured to receive the transcription from a speech recognition and/or transcription engine; (f) operating upon the transcribed speech, including spoken commands in accordance with the speech information request; (g) routing the requested response to a post processing application, if designated in the speech information request; (h) generating a post processed response wherein the response is comprised of transcribed spoken text and content in accordance with the request, including transcribed spoken commands; and
, (i) routing the post processed response to a user having a second legacy protocol through a user application service adapter capable of bi-directional communication with a system transaction manager. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
-
19. A system for facilitating the exchange of streamed speech recognition and transcription among users, the system comprising:
-
(a) at least one system transaction manager using a uniform system protocol, including at least one post processing manager, wherein transaction manager is i) adapted to receive a streamed speech information request from at least one user employing a first user legacy protocol and flag the information request as requiring post processing, and, ii) configured to route a requested response to a speech information request to one or more users employing a second user legacy protocol, the speech information request comprised of spoken text and commands, including spoken commands, wherein the requested response comprises a transcription of spoken text and the post processed information requested, and wherein the post processing manager is configured to i) receive structured transcription from a speech recognition and/or transcription engine, ii) operate upon the transcribed speech, including spoken commands in accordance with the speech information request, and, iii) rout the requested response to a post processing application, if designated in the speech information request; (b) at least one application service adapter configured to provide bi-directional communication between the first user legacy protocol and the uniform system protocol, and between the second user legacy protocol and the uniform system protocol, and capable of bi-directional communication with the system transaction manager; and
,(c) at least one speech recognition and/or transcription engine communicating with the system transaction manager, wherein the speech recognition and/or transcription engine is configured to receive the flagged streamed speech information request containing spoken text and commands, including spoken commands, from the system transaction manager, to generate a transcription in response to the speech information request and to route the response comprised of transcribed spoken text and transcribed spoken commands to the post processing manager. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A system for facilitating streamed speech recognition and/or structured transcription among users having heterogeneous system protocols the system comprising:
-
(a) at least one system transaction manager and at least one post processing manager, both using a uniform system protocol, wherein the transaction manager is i) adapted to receive a streamed speech information request from at least one user employing a first user legacy protocol, and flag the information request requiring post processing, and, ii) configured to route a requested response to one or more users employing a second user legacy protocol, the speech information request comprised of free form dictation of speech, including spoken text and commands, including spoken commands, wherein the requested response comprises a transcription of spoken text and the post processed information requested, and, wherein the post processing manager is configured to i) receive structured transcription from a speech recognition and/or transcription engine, ii) operate upon the transcribed speech, including spoken commands in accordance with the speech information request, and, iii) rout the requested response to a post processing application, if requested in the speech information request; (b) a user interface, including an application service adapter configured to provide bi-directional conversion between the first user legacy protocol and the uniform system protocol and between the second user legacy protocol and the uniform system protocol, and, capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and system transaction manager commands, the user interface being in bi-directional communication with the system transaction manager; and (c) at least one speech recognition and/or transcription engine for constrained speech recognition, communicating with the system transaction manager, wherein the speech recognition and/or transcription engine is configured to receive the flagged speech information request containing spoken text and commands, including spoken commands from the system transaction manager, to generate a structured transcription in response to the speech information request and to route the response comprised of structured transcribed spoken text and transcribed spoken commands to the post processing manager. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
Specification