Digital camera with real-time picture identification functionality
First Claim
1. A digital device including a camera subsystem with enhanced picture identification capability for generating digital photographs of target scenes having a target scene image and descriptive text generated in response to image characterization prompting messages comprising:
- an image capturing device for capturing a digital image of a target scene during a target scene capture mode, said target scene capture mode including an image capturing segment and an image characterization data capturing segment;
an audio input device for generating an audio signal indicative of a user'"'"'s description of the target scene entered during said image characterization data capturing segment substantially in real time with capturing a digital image of the target scene;
a processing subsystem for initiating target scene image characterization processing substantially in real time with the image capturing device capturing the image of said target scene during said target scene capture mode;
a storage device for storing information relating to;
a first image characterization prompting message prompting the identification of “
who”
is depicted in the target scene, a second image characterization prompting message prompting the identification of “
what”
the target scene depicts, a third image characterization prompting message prompting the identification of “
where”
the target scene is located, and a fourth image characterization prompting message prompting the identification of “
when”
the digital image of the target scene is being captured;
said processing subsystem being operatively coupled to said storage device and being configured to access a plurality of said image characterization prompting messages and to generate a plurality of prompting messages for presentation to the user to extract from the user a description of the target scene including information corresponding to a plurality of image characterization categories selected from the group consisting of;
a first image characterization category identifying “
who”
is depicted in the target scene, a second image characterization category identifying “
what”
the target scene depicts, a third image characterization category identifying “
where”
the target scene is located, and a fourth image characterization category identifying “
when”
the digital image of the target scene is being captured;
said processing system being operatively coupled to said audio input device and being configured to receive the audio signal indicative of a user'"'"'s description of the target scene for recognizing the user'"'"'s audio description of the target scene specified by a user in response to a prompting message and for generating text characterizing the target scene,said processing subsystem being configured to associate an image file with text identifying the target scene generated in response to image characterization prompting messages; and
an output generating subsystem using text generated in response to at least one prompting message for generating a digital photograph of the target scene including text identifying the target scene falling within a plurality of said image characterization categories selected from a group consisting of a first image characterization category identifying “
who”
is depicted in the target scene, a second image characterization category identifying “
what”
the target scene depicts, a third image characterization category identifying “
where”
the target scene is located, and a fourth image characterization category identifying “
when”
the digital image of the target scene is being captured.
0 Assignments
0 Petitions
Accused Products
Abstract
A unique digital camera electronics and software associates substantially in real-time the image captured with a description of a photograph during the time frame when the photograph is first taken. After a photograph is taken, a user generates an audio description of the photograph including a set of image characterization data. For example, such image characterization data may identify “who” is in the photograph, “what” the photograph depicts (e.g., the Jefferson Memorial), “where” the photograph was taken, and “when” it was taken. Such an audio input is coupled to a speech recognition and processing subsystem in which the decoded voice data is transcribed into textual data that describes the picture and that is associated with its corresponding captured image data file. Such descriptive text is displayed at, for example, a location at a desired border of the photograph.
-
Citations
19 Claims
-
1. A digital device including a camera subsystem with enhanced picture identification capability for generating digital photographs of target scenes having a target scene image and descriptive text generated in response to image characterization prompting messages comprising:
-
an image capturing device for capturing a digital image of a target scene during a target scene capture mode, said target scene capture mode including an image capturing segment and an image characterization data capturing segment; an audio input device for generating an audio signal indicative of a user'"'"'s description of the target scene entered during said image characterization data capturing segment substantially in real time with capturing a digital image of the target scene; a processing subsystem for initiating target scene image characterization processing substantially in real time with the image capturing device capturing the image of said target scene during said target scene capture mode; a storage device for storing information relating to;
a first image characterization prompting message prompting the identification of “
who”
is depicted in the target scene, a second image characterization prompting message prompting the identification of “
what”
the target scene depicts, a third image characterization prompting message prompting the identification of “
where”
the target scene is located, and a fourth image characterization prompting message prompting the identification of “
when”
the digital image of the target scene is being captured;said processing subsystem being operatively coupled to said storage device and being configured to access a plurality of said image characterization prompting messages and to generate a plurality of prompting messages for presentation to the user to extract from the user a description of the target scene including information corresponding to a plurality of image characterization categories selected from the group consisting of;
a first image characterization category identifying “
who”
is depicted in the target scene, a second image characterization category identifying “
what”
the target scene depicts, a third image characterization category identifying “
where”
the target scene is located, and a fourth image characterization category identifying “
when”
the digital image of the target scene is being captured;said processing system being operatively coupled to said audio input device and being configured to receive the audio signal indicative of a user'"'"'s description of the target scene for recognizing the user'"'"'s audio description of the target scene specified by a user in response to a prompting message and for generating text characterizing the target scene, said processing subsystem being configured to associate an image file with text identifying the target scene generated in response to image characterization prompting messages; and an output generating subsystem using text generated in response to at least one prompting message for generating a digital photograph of the target scene including text identifying the target scene falling within a plurality of said image characterization categories selected from a group consisting of a first image characterization category identifying “
who”
is depicted in the target scene, a second image characterization category identifying “
what”
the target scene depicts, a third image characterization category identifying “
where”
the target scene is located, and a fourth image characterization category identifying “
when”
the digital image of the target scene is being captured. - View Dependent Claims (2, 3, 4)
-
-
5. A digital device having a camera subsystem with enhanced picture identification capability for generating digital photographs of target scenes having a target scene image and descriptive text generated in response to image characterization prompting messages comprising:
-
an image capturing device for capturing a digital image of a target scene during a target scene capture mode, said target scene capture mode including an image capturing segment and an image characterization data capturing segment; a storage device for storing information indicative of a plurality of image characterization prompting messages selected from a group consisting of;
an image characterization prompting message prompting the identification of “
who”
is depicted in the target scene an image characterization prompting message prompting the identification of “
what”
the target scene depicts, an image characterization prompting message prompting the identification of “
where”
the target scene is located, and an image characterization prompting message prompting the identification of “
when”
the digital image of the target scene is being captured;an audio input device for generating an audio signal indicative of a user'"'"'s description of the target scene image provided in response to a prompting message; a processor subsystem operatively coupled to said storage device and said audio input device for generating a sequence of prompting messages and for presenting each prompting message to the user to prompt the user to verbally input image characterization data identifying the target scene image;
said processing system being configured to access said storage device and to generate a first prompting message and present said first prompting message to the user to verbally input image characterization data identifying the target scene image, said first prompting message requesting the user to describe the target scene to provide information selected from the group consisting of;
identifying “
who”
is depicted in the target scene, identifying “
what”
the target scene depicts, identifying “
where”
the target scene is located, and identifying “
when”
the digital image of the target scene is being captured;said processing subsystem being configured to capture a user'"'"'s verbal description of the target scene image that is provided by the user in response to said first prompting message in substantially real time with the capturing of the target scene image; said processing subsystem being configured to access said storage device to generate a second prompting message and present said second prompting message to the user to verbally input image characterization data further identifying the target scene image, said second prompting message requesting the user to describe the target scene to provide information selected from the group consisting of”
identifying “
who”
is depicted in the target scene, identifying “
what”
the target scene depicts, identifying “
where”
the target scene is located, and identifying “
when”
the digital image of the target scene is being captured, said second prompting message requesting information that is different from the information identified in said first prompting message;said processing subsystem being operatively coupled to receive the audio signal indicative of a user'"'"'s description of the target and being configured to recognize the user'"'"'s audio description of the target in response to said first prompting message and said second prompting message and for generating a text file identifying a plurality of items characterizing the target scene, said processing subsystem being configured to associate an image file with the text file identifying the target scene; and an output generation subsystem for generating a digital photograph of the target scene having a text data portion including a first text segment including information provided in response to said first prompting message and a second text segment information provided in response to said second prompting message, wherein the text data portion identifies the target scene by including text falling within a plurality of target scene image characterization categories selected from the group consisting of;
identifying “
who”
is depicted in the target scene, identifying “
what”
the target scene depicts, identifying “
where”
the target scene is located, and identifying “
when”
the digital image of the target scene is being captured. - View Dependent Claims (6, 7, 8)
-
-
9. A method of operating a digital device having a camera subsystem and a processing subsystem, said digital device providing enhanced picture identification capability by generating digital photographs of target scenes having a target scene image and descriptive text generated in response to image characterization prompting messages comprising:
-
capturing a digital image of a first target scene image during a target scene capture mode, said target scene capture mode including an image capturing segment and an image characterization data capturing segment; generating, by the processing subsystem, a first prompting message and presenting the first prompting message to a user to verbally input image characterization data identifying the first target scene image during said image characterization data capturing segment of said target scene capture mode, said first prompting message requesting the user to describe the first target scene to provide information falling within a first one of a plurality of image characterization categories selected from the group consisting of; identifying who is depicted in the first target scene, identifying what the first target scene depicts, identifying where the first target scene is located, and identifying when the digital image of the first target scene is being captured; capturing the user'"'"'s verbal description of the first target scene responding to said first prompting message in substantially real time with the capturing of a target scene image; generating, by said processing subsystem, a second prompting message and presenting the second prompting message to the user to prompt the user to verbally input image characterization data during said image characterization data capturing segment of said target scene capture mode to further identify the first target scene image, said second prompting message requesting the user to describe the target scene to provide information conforming to a second image characterization category selected from the group consisting of;
an image characterization category identifying who is depicted in the first target scene, an image characterization category identifying what the first target scene depicts, an image characterization category identifying where the first target scene is located, and an image characterization category identifying when the digital image of the first target scene is being captured, said second prompting message requesting information relating to an image characterization category that is different from the category identified in said first prompting message;capturing the user'"'"'s verbal description of the first target scene image responding to said second prompting message in substantially real time with the capturing of the first target scene image; recognizing, by said processing subsystem, the user'"'"'s description of the target scene in response to said first prompting message and said second prompting message and converting the input image characterization data captured in response to said first prompting message and said second prompting message into text; generating, by said processing subsystem, a text file identifying a plurality of items characterizing the first target scene including data captured in response to said first prompting message and said second prompting message, associating an image file with a text file identifying the first target scene, said text file including a data package having textual information generated in response to the first prompting message and the second prompting message; and generating a digital photograph of the first target scene having text thereon, said text including image characterization data identifying the first target scene and including text generated in response to the first prompting message and the second prompting message. - View Dependent Claims (10, 11, 12)
-
-
13. A method of operating a digital device having a camera subsystem with enhanced picture identification capability by generating digital photographs of target scenes having a target scene image and descriptive text generated in response to image characterization prompting messages, said digital device having a processing subsystem, and a storage device, said method comprising the steps of:
-
capturing a digital image of a first target scene image during a target scene capture mode, said target scene capture mode including an image capturing segment and an image characterization data capturing segment; accessing information from said storage device indicative of a first image characterization prompting message selected from a group consisting of;
an image characterization prompting message prompting the identification of who is depicted in the target scene, an image characterization prompting message prompting the identification of what the target scene depicts, an image characterization prompting message prompting the identification of where the target scene is located, and an image characterization prompting message prompting the identification of when the digital image of the target scene is being captured;presenting to the user, by said processing subsystem, said first prompting message to prompt the user to verbally input image characterization data identifying the first target scene image during said image characterization data capturing segment of said target scene capture mode, said first prompting message requesting the user to describe the first target scene to provide information conforming to a first image characterization category selected from the group consisting of;
a first image characterization category identifying who is depicted in the first target scene, a second image characterization category identifying what the first target scene depicts, a third image characterization category identifying where the target scene is located, and a fourth image characterization category identifying when the digital image of the first target scene is being captured;capturing a user'"'"'s verbal description of the first target scene image that is in response to said first prompting message in substantially real time with the capturing of the first target scene image; accessing from said storage device information indicative of a second image characterization prompting message selected from a group consisting of;
an image characterization prompting message prompting the identification of who is depicted in the first target scene, an image characterization prompting message prompting the identification of what the first target scene depicts, an image characterization prompting message prompting the identification of where the first target scene is located, and an image characterization prompting message prompting the identification of when the digital image of the first target scene is being captured;presenting to the user, by said processing subsystem during said image characterization data capturing segment of said target scene capture mode, said second prompting message to prompt the user to verbally input image characterization data further identifying the first target scene image, said second prompting message requesting the user to describe the first target scene to provide information conforming to a second image characterization category selected from the group consisting of;
an image characterization category identifying who is depicted in the first target scene, an image characterization category identifying what the first target scene depicts, an image characterization category identifying where the first target scene is located, and an image characterization category identifying when the digital image of the first target scene is being captured, said second prompting message requesting information relating to an image characterization category that is different from the category identified in said first prompting message;capturing a user'"'"'s verbal description of the first target scene image that is in response to said second prompting message in substantially real time with the capturing of the first target scene image; converting the input image characterization data captured in response to said first prompting message and said second prompting message into text and generating a text file identifying a plurality of items characterizing the first target scene, associating an image file of the first target scene with a text file identifying the first target scene, said text file including a data package having textual information generated in response to the first prompting message and the second prompting message; and generating a digital photograph of the first target scene having text thereon, said text including image characterization data identifying the first target scene and including text generated in response to the first prompting message and the second prompting message. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification