Server for processing speech, control method thereof, image processing apparatus, and control method thereof
First Claim
1. A system comprising:
- a server; and
at least one image processing apparatus,wherein the server comprises;
a communicator configured to communicate with the at least one image processing apparatus;
a processor configured to;
based on an input of a speech of a user being received from the at least one image processing apparatus, control the communicator to;
transmit, to the at least one image processing apparatus, a plurality of retrieval results corresponding to the input of the speech of the user,wherein the at least one image processing apparatus comprises;
a display;
a user interface comprising;
a speech input interface configured to receive the input of a speech of a user; and
a non-speech input interface configured to receive a non-speech user input;
a communicator configured to communicate with the server; and
a processor configured to;
based on the input of the speech being received through the speech input interface,control the communicator of the at least one image processing apparatus to transmit the input of the speech to the server;
control the display to display the plurality of retrieval results to be selected, based on the plurality of retrieval results corresponding to the input of the speech being received from the server;
based on the non-speech user input being received through the non-speech input interface,identify whether the non-speech user input is related to the input of the speech of the user;
based on the non-speech user input not being related to the input of the speech of the user, perform an operation corresponding to the non-speech user input, independently of the input of the speech; and
based on the non-speech user input being related to the input of the speech of the user, perform an operation on one of the plurality of retrieval results selected by the non-speech user input, based on a result of processing the input of the speech.
1 Assignment
0 Petitions
Accused Products
Abstract
A system includes a server and an image processing apparatus, and the server is provided that includes a communication interface, a storage, and a processor. The communication interface is configured to communicate with the image processing apparatus. The storage is configured to store data. The processor may provide a result of processing a first event that includes a speech of a user to the image processing apparatus in response to the first event being received from the image processing apparatus, store a record of the first event in the storage according to processing of the first event, determine a relation between the first and second events that includes a user input by a non-speech method in response to the second event being received from the image processing apparatus, and process the second event based on the record of the first event stored in the storage in response to the relation.
-
Citations
14 Claims
-
1. A system comprising:
-
a server; and at least one image processing apparatus, wherein the server comprises; a communicator configured to communicate with the at least one image processing apparatus; a processor configured to; based on an input of a speech of a user being received from the at least one image processing apparatus, control the communicator to; transmit, to the at least one image processing apparatus, a plurality of retrieval results corresponding to the input of the speech of the user, wherein the at least one image processing apparatus comprises; a display; a user interface comprising; a speech input interface configured to receive the input of a speech of a user; and a non-speech input interface configured to receive a non-speech user input; a communicator configured to communicate with the server; and a processor configured to; based on the input of the speech being received through the speech input interface, control the communicator of the at least one image processing apparatus to transmit the input of the speech to the server; control the display to display the plurality of retrieval results to be selected, based on the plurality of retrieval results corresponding to the input of the speech being received from the server; based on the non-speech user input being received through the non-speech input interface, identify whether the non-speech user input is related to the input of the speech of the user; based on the non-speech user input not being related to the input of the speech of the user, perform an operation corresponding to the non-speech user input, independently of the input of the speech; and based on the non-speech user input being related to the input of the speech of the user, perform an operation on one of the plurality of retrieval results selected by the non-speech user input, based on a result of processing the input of the speech. - View Dependent Claims (2, 3, 4)
-
-
5. A control method of a server, the control method comprising:
-
receiving, from an image processing apparatus, an input of a speech of a user; transmitting, to the image processing apparatus, a plurality of retrieval results corresponding to the input of the speech to be displayed on the image processing apparatus; receiving, by the image processing apparatus, at least one image from the server the plurality of retrieval results corresponding to the input of the speech; displaying, on the image processing apparatus, the plurality of retrieval results to be selected on a display of the image processing apparatus; receiving a non-speech user input through a non-speech input interface of the image processing apparatus; based on the non-speech user input being received through the non-speech input interface, identifying, by the image processing apparatus, whether the non-speech user input is related to the input of the speech of the user; based on the non-speech user input not being related to the input of the speech of the user, performing, by the image processing apparatus, an operation corresponding to the non-speech user input, independently of the input of the speech; and based on the non-speech user input being related to the input of the speech of the user, performing, by the image processing apparatus, an operation on one of the plurality of retrieval results selected by the non-speech user input, based on a result of processing the input of the speech. - View Dependent Claims (6, 7, 8)
-
-
9. An image processing apparatus comprising:
-
a display; a user interface comprising; a speech input interface configured to receive an input of a speech of a user; and a non-speech input interface configured to receive a non-speech user input; a communicator configured to communicate with a server that performs retrieval processing corresponding to the input of the speech; and a processor configured to; based on the input of the speech being received through the speech input interface, control the communicator to transmit the input of the speech to the server; control the display to display a received plurality of retrieval results to be selected, based on the plurality of retrieval results corresponding to the input of the speech being received from the server; based on the non-speech user input being received through the non-speech input interface, identify whether the non-speech user input is related to the input of the speech of the user; based on the non-speech user input not being related to the input of the speech of the user, perform an operation corresponding to the non-speech user input, independently of the input of the speech; and based on the non-speech user input being related to the input of the speech of the user, perform an operation on one of the plurality of retrieval results is selected by the non-speech user input, based on a result of processing the input of the speech. - View Dependent Claims (10, 11)
-
-
12. A control method of an image processing apparatus, the control method comprising:
-
receiving an input of a speech of a user; communicating with a server that performs retrieval processing on the input of the speech; transmitting the input of the speech to the server based on the input of the speech being received; receiving from the server a plurality of retrieval results on the input of the speech; displaying the plurality of retrieval results for selection on a display of the image processing apparatus; receiving a non-speech user input through a non-speech input interface of the image processing apparatus; based on the non-speech user input being received through the non-speech input interface, identifying whether the non-speech user input is related to the input of the speech of the user; based on the non-speech user input not being related to the input of the speech of the user, performing an operation corresponding to the non-speech user input, independently of the input of the speech; and based on the non-speech user input being related to the input of the speech of the user, performing an operation on one of the plurality of retrieval results selected by the non-speech user input, based on a result of processing the input of the speech. - View Dependent Claims (13, 14)
-
Specification