System and method for providing remote automatic speech recognition services via a packet network
First Claim
Patent Images
1. A method of operating an automatic speech recognition service remotely accessible by a client over a packet network, comprising the steps of:
- a. receiving from the client over the packet network information corresponding to a grammar used for speech recognition;
b. receiving from the client over the packet network information representing speech;
c. recognizing the received speech information by applying an automatic speech recognition algorithm in accordance with the grammar; and
d. sending information based upon the recognized speech over the packet network to the client.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method of operating an automatic speech recognition service using a client-server architecture is used to make ASR services accessible at a client location remote from the location of the main ASR engine. The present invention utilizes client-server communications over a packet network, such as the Internet, where the ASR server receives a grammar from the client, receives information representing speech from the client, performs speech recognition, and returns information based upon the recognized speech to the client.
289 Citations
66 Claims
-
1. A method of operating an automatic speech recognition service remotely accessible by a client over a packet network, comprising the steps of:
-
a. receiving from the client over the packet network information corresponding to a grammar used for speech recognition;
b. receiving from the client over the packet network information representing speech;
c. recognizing the received speech information by applying an automatic speech recognition algorithm in accordance with the grammar; and
d. sending information based upon the recognized speech over the packet network to the client. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system for operating an automatic speech recognition service remotely accessible by a client over a packet network, comprising:
-
a. as programmable processor;
b. memory;
c. an audio input device; and
d. a communications interface for establishing a communications link with the client over the packet network;
wherein said processor is programmed to execute a series of steps comprising;
i. receiving from the client over the packet network information corresponding to a grammar used for speech recognition;
ii. receiving from the client over the packet network information representing speech;
iii. recognizing the received speech information by applying an automatic speech recognition algorithm in accordance with the grammar; and
iv. sending information based upon the recognized speech over the packet network to the client. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. An article of manufacture, comprising a computer-readable medium having stored thereon instructions for operating an automatic speech recognition service remotely accessible by a client over a packet network, said instructions which, when performed by a processor, cause the processor to execute a series of steps comprising:
-
a. receiving from the client over the packet network information corresponding to a grammar used for speech recognition;
b. receiving from the client over the packet network information representing speech;
c. recognizing the received speech information by applying an automatic speech recognition algorithm in accordance with the grammar; and
d. sending information based upon the recognized speech over the packet network to the client. - View Dependent Claims (36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51)
-
-
52. A method for recognizing speech over a packet network, comprising:
-
receiving a grammar from a client;
receiving a plurality of digitized speech packets from the client;
processing the plurality of digitized speech packets using a speech recognition algorithm and the grammer to produce a textual representation of the plurality of digtized speech packets; and
sending information based upon the textual representation to the client. - View Dependent Claims (53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66)
receiving a network address associated with the grammar from the client; and
receiving the grammar from the network address.
-
-
58. The method of claim 57, wherein said network address comprises a uniform resource locator.
-
59. The method of claim 52, wherein said plurality of digitized speech packets comprises streaming digitized audio.
-
60. The method of claim 52, wherein said plurality of digitized speech packets comprises a plurality of compressed digitized speech packets.
-
61. The method of claim 52, wherein said plurality of digitized speech packets comprises a plurality of speech features extracted by the client.
-
62. The method of claim 52, wherein said information comprises an initial version of said textual representation.
-
63. The method of claim 52, wherein said information comprises streaming text.
-
64. The method of claim 52, wherein said information comprises a final version of said textual representation.
-
65. The method of claim 52, wherein said information comprises digitized speech packets.
-
66. The method of claim 52, further comprising the step of activating a grammar rule in response to a request received from the client.
Specification