Speech recognition method and apparatus using device information
First Claim
Patent Images
1. A server comprising:
- a memory configured to store an acoustic model (AM) database for storing at least one AM, the AM database comprising a general AM and at least one device-adapted AM;
a receiver configured to obtain, from a device located outside the server, a device ID, which is encrypted, for identifying the device and speech data; and
a processor configured to, based on identifying that a device-adapted AM corresponding to the device ID is stored in the AM database, perform speech recognition on the speech data by using the device-adapted AM, and based on identifying that the device-adapted AM corresponding to the device ID is not stored in the AM database, perform speech recognition on the speech data by using the general AM, and output a result of the speech recognition,wherein the memory is further configured to store a usage log database for storing a usage data log that includes the result of the speech recognition and the speech data corresponding to the result of the speech recognition,wherein the processor is further configured to select a device ID that needs device adaptation, by monitoring the usage log database, remove speech data that is unnecessary for the device adaptation from the usage data log corresponding to the selected device ID, and generate a device-adapted AM corresponding to the selected device ID, by using speech data of the usage data log from which the speech data that is unnecessary is removed,wherein the processor is further configured to identify that the speech data is unnecessary based on a modification record of the result of the speech recognition, andwherein the modification record of the result of the speech recognition includes a more reliable result obtained by user'"'"'s modification upon identifying that a more reliable result of the speech recognition exists.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition method includes: storing at least one acoustic model (AM); obtaining, from a device located outside the ASR server, a device ID for identifying the device; obtaining speech data from the device; selecting an AM based on the device ID; performing speech recognition on the speech data by using the selected AM; and outputting a result of the speech recognition.
37 Citations
8 Claims
-
1. A server comprising:
-
a memory configured to store an acoustic model (AM) database for storing at least one AM, the AM database comprising a general AM and at least one device-adapted AM; a receiver configured to obtain, from a device located outside the server, a device ID, which is encrypted, for identifying the device and speech data; and a processor configured to, based on identifying that a device-adapted AM corresponding to the device ID is stored in the AM database, perform speech recognition on the speech data by using the device-adapted AM, and based on identifying that the device-adapted AM corresponding to the device ID is not stored in the AM database, perform speech recognition on the speech data by using the general AM, and output a result of the speech recognition, wherein the memory is further configured to store a usage log database for storing a usage data log that includes the result of the speech recognition and the speech data corresponding to the result of the speech recognition, wherein the processor is further configured to select a device ID that needs device adaptation, by monitoring the usage log database, remove speech data that is unnecessary for the device adaptation from the usage data log corresponding to the selected device ID, and generate a device-adapted AM corresponding to the selected device ID, by using speech data of the usage data log from which the speech data that is unnecessary is removed, wherein the processor is further configured to identify that the speech data is unnecessary based on a modification record of the result of the speech recognition, and wherein the modification record of the result of the speech recognition includes a more reliable result obtained by user'"'"'s modification upon identifying that a more reliable result of the speech recognition exists.
-
-
2. A speech recognition method comprising:
-
storing an acoustic model (AM) database comprising a general AM and at least one device-adapted AM; obtaining, from a device located outside a server, a device ID, which is encrypted, for identifying the device; obtaining speech data from the device; based on identifying that a device-adapted AM corresponding to the device ID is stored in the AM database, performing speech recognition on the speech data by using the device-adapted AM, and based on identifying that the device-adapted AM corresponding to the device ID is not stored in the AM database, performing speech recognition on the speech data by using the general AM; outputting a result of the speech recognition; storing a usage data log that includes the result of the speech recognition and the speech data corresponding to the result of the speech recognition in a usage log database; selecting a device ID that needs device adaptation, by monitoring the usage log database and removing speech data that is unnecessary for the device adaptation from the usage data log corresponding to the selected device ID; and generating a device-adapted AM corresponding to the selected device ID, by using speech data of the usage data log from which the speech data that is unnecessary is removed, wherein the removing the speech data that is unnecessary comprises identifying that the speech data is unnecessary based on a modification record of the result of the speech recognition, and wherein the modification record of the result of the speech recognition includes a more reliable result obtained by user'"'"'s modification upon identifying that a more reliable result of the speech recognition exists. - View Dependent Claims (3)
-
-
4. A device comprising:
-
a memory configured to store a device ID, which is encrypted, for identifying a device; and at least one processor configured to execute instructions stored in the memory to implement; an input interface configured to obtain an input of a speech for speech recognition; a speech generator configured to generate speech data by processing the speech; a transmitter configured to transmit the device ID and the speech data to an automatic speech recognition (ASR) server; and a receiver configured to obtain a result of the speech recognition performed on the speech data from the ASR server, the result of the speech recognition comprising an identification of whether a device-adapted acoustic model (AM) corresponding to the device ID is stored in an acoustic model AM database in the server, wherein the speech recognition further comprises selecting a device ID that needs device adaptation, by monitoring a usage log database, remove speech data unnecessary for the device adaptation from the usage data log corresponding to the selected device ID, and generating a device-adapted AM corresponding to the selected device ID, by using speech data of the usage data log from which the unnecessary speech data is removed, wherein the processor is further configured to identify that the speech data is unnecessary based on a modification record of the result of the speech recognition, and wherein the modification record of the result of the speech recognition includes a more reliable result obtained by user'"'"'s modification upon identifying that a more reliable result of the speech recognition exists. - View Dependent Claims (5, 6, 7, 8)
-
Specification