Speech recognition method, device and system based on artificial intelligence
First Claim
1. A speech recognition method based on artificial intelligence, comprising:
- collecting speech data to be recognized in a speech recognition process at a client device;
sending uplink data stream from the client device to a server via an uplink connection to the server, wherein the uplink data stream comprises the speech data; and
receiving, at the client device, downlink data stream sent by the server via a downlink connection to the server in parallel with sending, from the client device, the uplink data stream to the server, wherein the downlink data stream comprises result data, and the result data is obtained by the server performing speech recognition according to the speech data;
wherein each of a uniform resource locator (URL) of the uplink connection and a URL of the downlink connection comprises a session identification of the speech recognition process, such that the server determines a correspondence relationship between the uplink connection and the downlink connection according to the session identifications, the URL of the uplink connection being distinct from the URL of the downlink connection.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure provides a speech recognition method, device and system based on artificial intelligence. The method includes: collecting speech data to be recognized in a speech recognition process; sending uplink data stream to a server via an uplink connection to the server, in which the uplink data stream includes the speech data; and receiving downlink data stream sent by the server via a downlink connection to the server in parallel with sending the uplink data stream to the server, in which the downlink data stream includes result data, and the result data is obtained by the server performing speech recognition according to the speech data.
-
Citations
12 Claims
-
1. A speech recognition method based on artificial intelligence, comprising:
-
collecting speech data to be recognized in a speech recognition process at a client device; sending uplink data stream from the client device to a server via an uplink connection to the server, wherein the uplink data stream comprises the speech data; and receiving, at the client device, downlink data stream sent by the server via a downlink connection to the server in parallel with sending, from the client device, the uplink data stream to the server, wherein the downlink data stream comprises result data, and the result data is obtained by the server performing speech recognition according to the speech data; wherein each of a uniform resource locator (URL) of the uplink connection and a URL of the downlink connection comprises a session identification of the speech recognition process, such that the server determines a correspondence relationship between the uplink connection and the downlink connection according to the session identifications, the URL of the uplink connection being distinct from the URL of the downlink connection. - View Dependent Claims (2, 3, 4)
-
-
5. A speech recognition method based on artificial intelligence, comprising:
-
receiving, at a server, an uplink data stream sent by a client via an uplink connection to the client; performing, by the server, speech recognition on speech data in the uplink data stream to obtain result data; and sending, from the server, downlink data stream to the client via a downlink connection to the client in parallel with receiving uplink data stream sent by the client, wherein the downlink data stream comprises the result data; wherein sending downlink data stream to the client via the downlink connection to the client comprises; acquiring the downlink connection with a URL containing a session identification same as a session identification contained in a URL of the uplink connection, wherein the session identifications are corresponding to speech recognition processes one by one, the URL of the uplink connection being distinct from the URL of the downlink connection; and sending the downlink data stream to the client via the acquired downlink connection. - View Dependent Claims (6, 7, 8)
-
-
9. A speech recognition device based on artificial intelligence, comprising:
-
a processor; and a memory, configured to store one or more software modules executable by the processor, wherein the one or more software modules comprise; a collecting module, configured to collect speech data to be recognized in a speech recognition process at a client device; a sending module, configured to send uplink data stream from the client device to a server via an uplink connection to the server, wherein the uplink data stream comprises the speech data; and a receiving module, configured to receive, at the client device, downlink data stream sent by the server a downlink connection to the server in parallel with sending, from the client device, the uplink data stream to the server, wherein the downlink data stream comprises result data, and the result data is obtained by the server performing speech recognition according to the speech data; wherein each of a URL of the uplink connection and a URL of the downlink connection comprises a session identification of the speech recognition process, such that the server determines a correspondence relationship between the uplink connection and the downlink connection according to the session identifications, the URL of the uplink connection being distinct from the URL of the downlink connection. - View Dependent Claims (10, 11, 12)
-
Specification