Electronic device for performing operation corresponding to voice input
First Claim
1. An electronic device comprising:
- a housing;
a speaker disposed in a first area of the housing;
a microphone disposed in a second area of the housing;
a display disposed in a third area of the housing;
a communication circuit disposed inside the housing or attached to the housing;
at least one processor disposed inside the housing and operatively connected to the speaker, the microphone, the display, and the communication circuit; and
a memory disposed inside the housing, operatively connected to the at least one processor, and configured to store a plurality of application programs and instructions,wherein the instructions, when executed, cause the at least one processor to;
receive, through the microphone, a user utterance including a request to perform a task by using at least one of the plurality of application programs,transmit, through the communication circuit, first data associated with the user utterance to an external server,receive, from the external server through the communication circuit, second data including a text generated by an automatic speech recognition (ASR) module,in response to receiving the second data including the text, identify the at least one of the plurality of application programs based on at least part of the second data including the text,prepare to execute the identified at least one of the plurality of application programs while the external server creates or determines a sequence of states of the electronic device,after identifying the at least one of the plurality of application programs, receive, from the external server through the communication circuit, a first response including information associated with the sequence of states of the electronic device to perform at least part of the task, andin response to receiving the first response, perform the at least part of the task by controlling the electronic device to enter the sequence of the states by using the at least one of the plurality of application programs.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic device is provided. The electronic device includes a housing, a speaker, a microphone, a display, a communication circuit, at least one processor operatively connected to the speaker, the microphone, the display, and the communication circuit, and a memory operatively connected to the at least one processor, and storing instructions to cause the at least one processor to receive a first user utterance, including a request to perform a task by using an application program, through the microphone, transmit first data associated with the first user utterance to an external server, receive second data including a text generated by an automatic speech recognition (ASR) module, from the external server, identify the application program based on the second data, receive a first response including information of a sequence of states of the electronic device from the external server to perform at least part of the task, and perform the at least part of the task by controlling the electronic device to enter the sequence of the states by using the application program.
42 Citations
20 Claims
-
1. An electronic device comprising:
-
a housing; a speaker disposed in a first area of the housing; a microphone disposed in a second area of the housing; a display disposed in a third area of the housing; a communication circuit disposed inside the housing or attached to the housing; at least one processor disposed inside the housing and operatively connected to the speaker, the microphone, the display, and the communication circuit; and a memory disposed inside the housing, operatively connected to the at least one processor, and configured to store a plurality of application programs and instructions, wherein the instructions, when executed, cause the at least one processor to; receive, through the microphone, a user utterance including a request to perform a task by using at least one of the plurality of application programs, transmit, through the communication circuit, first data associated with the user utterance to an external server, receive, from the external server through the communication circuit, second data including a text generated by an automatic speech recognition (ASR) module, in response to receiving the second data including the text, identify the at least one of the plurality of application programs based on at least part of the second data including the text, prepare to execute the identified at least one of the plurality of application programs while the external server creates or determines a sequence of states of the electronic device, after identifying the at least one of the plurality of application programs, receive, from the external server through the communication circuit, a first response including information associated with the sequence of states of the electronic device to perform at least part of the task, and in response to receiving the first response, perform the at least part of the task by controlling the electronic device to enter the sequence of the states by using the at least one of the plurality of application programs. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An electronic device comprising:
-
a housing; a speaker disposed in a first area of the housing; a microphone disposed in a second area of the housing; a display disposed in a third area of the housing; a communication circuit disposed inside the housing or attached to the housing; at least one processor disposed inside the housing and operatively connected to the speaker, the microphone, the display, and the communication circuit; and a memory disposed inside the housing, operatively connected to the at least one processor, and configured to store a plurality of application programs and instructions, wherein the instructions, when executed, cause the at least one processor to; receive, through the microphone, a user utterance, transmit, through the communication circuit, the user utterance to an external server, receive, through the communication circuit from the external server, text data corresponding to the user utterance, in response to receiving the text data, identify a first application program of the plurality of application programs based on at least part of the text data, in response to identifying the first application program, establish communication between the first application program and a control program capable of executing the plurality of application programs while the external server creates or determines a sequence of states of the electronic device, after establishing the communication between the first application program and the control program, receive, through the communication circuit from the external server, the sequence of states of the electronic device, compare the first application program and a second application program executable based on the sequence of states, and execute the first application program or the second application program based on a result of the comparing. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. An electronic device comprising:
-
a housing; a speaker disposed in a first area of the housing; a microphone disposed in a second area of the housing; a display disposed in a third area of the housing; a communication circuit disposed inside the housing or attached to the housing; at least one processor disposed inside the housing and operatively connected to the speaker, the microphone, the display, and the communication circuit; and a memory disposed inside the housing, operatively connected to the at least one processor, and configured to store a plurality of application programs and instructions, wherein the instructions, when executed, cause the at least one processor to; receive, through the microphone, a first user utterance including a request to perform a task by using at least one of the plurality of application programs, in response to receiving the user utterance, compare the first user utterance with a second user utterance stored in the memory, and when a result of the comparing indicates that a degree of matching between the first user utterance and the second user utterance is at a threshold level, perform at least part of the task based on a sequence of states of the electronic device corresponding to the second user utterance. - View Dependent Claims (17, 18, 19, 20)
-
Specification