Methods and arrangements employing sensor-equipped smart phones
First Claim
1. A method employing a portable user system having a processor, a display, at least one microphone that captures audio, and at least one image sensor for capturing imagery, the method comprising the acts:
- capturing imagery with the image sensor of the portable user system, the captured imagery depicting plural physical subjects within a physical environment of a user, and presenting the captured imagery to the user on the display;
the processor selecting a first of said depicted plural physical subjects as being of likely interest to the user, in accordance with stored rule data, and indicating said processor selection of the first depicted subject by a marking presented on the display;
capturing user speech with the microphone;
sending, to a speech recognition module, audio data corresponding to the user speech, and receiving recognized user speech data corresponding thereto;
the processor employing information from the recognized user speech data as a verbal clue to help identify a second physical subject depicted within the captured imagery, different than the processor-selected first depicted subject, that is of actual interest to said user;
the processor causing said marking to move from the depiction of the first subject to the depiction of the second subject;
after receiving the recognized user speech data, performing an image processing operation concerning the second depicted subject; and
presenting on said display, as a graphical overlay on the captured imagery, a graphical indicia output, different than said marking, that depends on a result of said image processing operation.
1 Assignment
0 Petitions
Accused Products
Abstract
The present technology concerns improvements to smart phones and related sensor-equipped systems. Some embodiments involve spoken clues, e.g., by which a user can assist a smart phone in identifying what portion of imagery captured by a smart phone camera should be processed, or identifying what type of image processing should be conducted. Some arrangements include the degradation of captured content information in accordance with privacy rules, which may be location-dependent, or based on the unusualness of the captured content, or responsive to later consultation of the stored content information by the user. A great variety of other features and arrangements are also detailed.
-
Citations
12 Claims
-
1. A method employing a portable user system having a processor, a display, at least one microphone that captures audio, and at least one image sensor for capturing imagery, the method comprising the acts:
-
capturing imagery with the image sensor of the portable user system, the captured imagery depicting plural physical subjects within a physical environment of a user, and presenting the captured imagery to the user on the display; the processor selecting a first of said depicted plural physical subjects as being of likely interest to the user, in accordance with stored rule data, and indicating said processor selection of the first depicted subject by a marking presented on the display; capturing user speech with the microphone; sending, to a speech recognition module, audio data corresponding to the user speech, and receiving recognized user speech data corresponding thereto; the processor employing information from the recognized user speech data as a verbal clue to help identify a second physical subject depicted within the captured imagery, different than the processor-selected first depicted subject, that is of actual interest to said user; the processor causing said marking to move from the depiction of the first subject to the depiction of the second subject; after receiving the recognized user speech data, performing an image processing operation concerning the second depicted subject; and presenting on said display, as a graphical overlay on the captured imagery, a graphical indicia output, different than said marking, that depends on a result of said image processing operation. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable medium containing software instructions operative to cause a user'"'"'s portable computer system, equipped with a processor, a display, at least one microphone, and at least one image sensor, to perform acts including:
-
receiving camera imagery from the image sensor, depicting plural physical subjects within a physical environment of the user, and presenting the imagery to the user on the display; selecting a first of said depicted plural physical subjects as being of likely interest to the user, in accordance with stored rule data, and indicating said selection of the first depicted subject by a marking presented on the display; capturing user speech with the microphone; sending, to a speech recognition module, audio data corresponding to the user speech, and receiving recognized user speech data corresponding thereto; employing information from the recognized user speech data as a verbal clue to help identify a second physical subject depicted within the imagery, different than the first depicted subject, that is of actual interest to said user; causing said marking to move from the depiction of the first subject to the depiction of the second subject; after receiving the recognized user speech data, performing an image processing operation concerning the second depicted subject; and presenting on said display, as a graphical overlay on the imagery, a graphical indicia output, different than said marking, that depends on a result of said image processing operation.
-
-
9. A method employing a portable user device having a display, at least one microphone that captures audio, and at least one image sensor for capturing imagery, the method comprising the acts:
-
(a) capturing imagery with the image sensor, the captured imagery depicting plural physical subjects within an environment of said user, and capturing user speech with the microphone; (b) sending, to a speech recognition module, audio data corresponding to the user speech, and receiving recognized user speech data corresponding thereto; (c) performing a verbally-clued computer-implemented cognition process, said cognition process employing information from the recognized user speech data as a verbal clue to help identify a physical subject within the captured imagery that is of interest to said user; (d) displaying the captured imagery to the user on said display; and (e) presenting on said display, as a graphical overlay on the captured imagery, a graphical indicia that is determined based on said verbally-clued cognition process; wherein the presented graphical indicia varies based on said identified physical subject; and wherein a set of stored rules establishes a priority order by which the cognition process ranks the plural subjects depicted in the captured imagery as being of probable interest to the user, wherein the recognized user speech data causes the cognition process to progress from one subject in said priority order, to a next subject in said priority order. - View Dependent Claims (10, 11)
-
-
12. A method employing a portable user device having a display, at least one microphone that captures audio, and at least one image sensor for capturing imagery, the method comprising the acts:
-
(a) capturing imagery with the image sensor, the captured imagery depicting plural physical subjects within an environment of said user, and capturing user speech with the microphone; (b) sending, to a speech recognition module, audio data corresponding to the user speech, and receiving recognized user speech data corresponding thereto; (c) performing a verbally-clued computer-implemented cognition process, said cognition process employing information from the recognized user speech data as a verbal clue to help identify a physical subject within the captured imagery that is of interest to said user; (d) displaying the captured imagery to the user on said display; and (e) presenting on said display, as a graphical overlay on the captured imagery, a graphical indicia that is determined based on said verbally-clued cognition process; wherein the presented graphical indicia varies based on said identified physical subject; and
wherein;the device indicates a first subject of possible interest to the user, by presenting an indicia on the screen indicating said first subject; in response to first recognized user speech data, the device indicates a second subject of possible interest to the user, by changing said indicia to indicate said second subject instead of said first subject; and in response to second recognized user speech data, the device indicates a third subject of possible interest to the user, by changing said indicia to indicate said third subject instead of said second subject; wherein the method emulates conversation, with the user directing, the device responding, and the user further-directing.
-
Specification