Image manipulation
First Claim
Patent Images
1. An image manipulation apparatus comprising:
- means for reproducing an image;
a speech recognition user interface for allowing a user to input a speech signal comprising a description of a desired change to be made to the reproduced image;
means for interpreting a recognition result output from the speech recognition interface; and
changing means responsive to the interpreting means for changing the colour of one or more parts of the reproduced image in order to affect a manipulation desired by the user;
wherein said description comprises a number of continuously spoken words;
wherein said speech recognition user interface comprises;
a memory for storing a plurality of reference word models, each representative of a word, and for storing a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands;
matching means for matching the input speech signal with selected sequences of said word models, selected in accordance with the stored language model;
recognition means, responsive to said matching means, for providing a recognition result based upon a likely sequence of reference models that corresponds to an input utterance;
receive means for receiving a new input speech command comprising two or more whole words;
means for generating a word model for each of the words contained within the new input speech command, if they do not already exist; and
means for adapting said language model to incorporate said new input speech command.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus for manipulating the colour of an image is provided, having a microphone for providing electrical speech signals representative of a user command, a speech recognition unit for recognizing the input speech signal, a command interpreter for interpreting the recognized speech, a graphics package responsive to the command interpreter and a display for displaying the current image being edited. The apparatus accepts other inputs, for example, from a pointing device.
-
Citations
28 Claims
-
1. An image manipulation apparatus comprising:
-
means for reproducing an image; a speech recognition user interface for allowing a user to input a speech signal comprising a description of a desired change to be made to the reproduced image; means for interpreting a recognition result output from the speech recognition interface; and changing means responsive to the interpreting means for changing the colour of one or more parts of the reproduced image in order to affect a manipulation desired by the user; wherein said description comprises a number of continuously spoken words; wherein said speech recognition user interface comprises; a memory for storing a plurality of reference word models, each representative of a word, and for storing a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands; matching means for matching the input speech signal with selected sequences of said word models, selected in accordance with the stored language model; recognition means, responsive to said matching means, for providing a recognition result based upon a likely sequence of reference models that corresponds to an input utterance; receive means for receiving a new input speech command comprising two or more whole words; means for generating a word model for each of the words contained within the new input speech command, if they do not already exist; and means for adapting said language model to incorporate said new input speech command. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of manipulating an image comprising the steps of:
-
reproducing an image; using a speech recognition user interface for allowing a user to input a speech signal comprising a description of a desired change to be made to the reproduced image; interpreting a recognition result output from the speech user recognition interface; and changing a colour of one or more parts of the reproduced image in response to the interpreting step, in order to effect the colour manipulation desired by a user; wherein said description comprises a number of continuously spoken words; wherein said speech recognition user interface performs the steps of; i) matching the input speech signal with selected sequences of word models, selected in accordance with a stored language model by using a memory storing a plurality of reference word models, each representative of a word, and a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands; and ii) providing, in response to the matching step, a recognition result based upon a likely sequence of reference models that corresponds of an input utterance; and wherein said language model is adaptable by; (a) receiving a new input speech command comprising two or more whole words; (b) generating a word model for each of the words contained within the new input speech command, if they do not already exist; and (c) adapting the language model to incorporate the new input speech command. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
-
19. A computer readable medium storing computer executable process steps to allow image manipulation, the process steps comprising the steps of:
-
reproducing an image; using a speech recognition user interface for allowing a user to input a speech signal comprising a description of a desired change to be made to the reproduced image; interpreting a recognition result output from the speech user recognition interface; and changing the colour of one or more parts of the reproduced image in response to said interpreting step, in order to effect a manipulation desired by a user; wherein said description comprises a number of continuously spoken words; wherein said speech recognition user interface performs the step of; matching the input speech signal with selected sequences of word models, selected in accordance with a stored language model by using a memory storing a plurality of reference word models, each representative of a word, and a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands; and providing, in response to the matching step, a recognition result based upon a likely sequence of reference models that corresponds to an input utterance; and wherein said language model is adaptable by; (a) receiving a new input speech command comprising two or more whole words; (b) generating a word model for each of the words contained within the new input speech command, if they do not already exist; and (c) adapting the language model to incorporate the new input speech command. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
-
26. An image manipulation apparatus comprising:
-
a speech recognition user interface for allowing a user to input a speech signal of a command comprising a number of continuously spoken words; means for interpreting a recognition result output from the speech recognition interface; and means responsive to the interpreting means for executing a function corresponding to the command; wherein said speech recognition user interface uses a memory for storing a plurality of reference word models, each representative of a word, and for storing a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands and comprises; matching means for matching the input speech signal with selected sequences of said word models, selected in accordance with the stored language model; recognition means, responsive to said matching means, for providing a recognition result based upon a likely sequence of reference models that corresponds to an input utterance; receive means for receiving a new input speech command comprising two or more whole words; means for generating a word model for each of the words contained within the new input speech command, if they do not already exist; and means for adapting said language model to incorporate said new input speech command.
-
-
27. A method of manipulating an image comprising the steps of:
-
using a speech recognition user interface for allowing a user to input a speech signal of a command comprising a number of continuously spoken words; interpreting a recognition result output from the speech recognition interface; and executing a function corresponding to the command in response to the interpreting step; wherein said speech recognition user interface uses a memory for storing a plurality of reference word models, each representative of a word, and for storing a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands and matches the input speech signal with selected sequences of said word models, selected in accordance with the stored language model; and
provides in response to said matching step a recognition result based upon a likely sequence of reference models that corresponds to an input utterance; andwherein the method further comprises the steps of; receiving a new input speech command comprising two or more whole words; generating a word model for each of the words contained within the new input speech command, if they do not already exist; and adapting said language model to incorporate said new input speech command.
-
-
28. A computer readable medium storing computer executable process steps to allow image manipulation, the process steps comprising the steps of:
-
using a speech recognition user interface for allowing user to input a speech signal of a command comprising a number of continuously spoken words; interpreting a recognition result output from the speech recognition interface; and executing a function corresponding to the command in response to the interpreting step; wherein said speech recognition user interface uses memory for storing a plurality of reference word models, each representative of a word, and for storing a language model which defines sequences of the reference word models which can be matched with the input speech signal, in order to define input speech commands and matches the input speech signal with selected sequences of said word models, selected in accordance with the stored language model; and
provides, in response to said matching step a recognition result based upon a likely sequence of reference models that corresponds to an input utterance; andwherein the process steps further comprise the steps of; receiving a new input speech command comprising two or more whole words; generating a word model for each of the words contained within the new input speech command, if they do not already exist; and adapting said language model to incorporate said new input speech command.
-
Specification