Method and system for segmenting and identifying events in images using spoken annotations
First Claim
1. A method for automatically organizing digitized photographic images into events based on spoken annotations, where the events are useful in organizing photographic albums, said method comprising the steps of:
- providing natural-language text based on spoken annotations corresponding to a plurality of frames of photographic images;
extracting predetermined information from the natural-language text that characterizes the annotations of the images;
segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and
identifying each event by assembling the categories of information into event descriptions,wherein the step of segmenting the images into events comprises the steps of;
assigning a strength value for the certain categories of information which are indicative of a boundary between events;
computing the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and
allocating the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for automatically organizing digitized photographic images into events based on spoken annotations comprises the steps of: providing natural-language text based on spoken annotations corresponding to at least some of the photographic images; extracting predetermined information from the natural-language text that characterizes the annotations of the images; segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and identifying each event by assembling the categories of information into event descriptions. The invention further comprises the step of summarizing each event by selecting and arranging the event descriptions in a suitable manner, such as in a photographic album.
-
Citations
23 Claims
-
1. A method for automatically organizing digitized photographic images into events based on spoken annotations, where the events are useful in organizing photographic albums, said method comprising the steps of:
-
providing natural-language text based on spoken annotations corresponding to a plurality of frames of photographic images; extracting predetermined information from the natural-language text that characterizes the annotations of the images; segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and identifying each event by assembling the categories of information into event descriptions, wherein the step of segmenting the images into events comprises the steps of; assigning a strength value for the certain categories of information which are indicative of a boundary between events; computing the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and allocating the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer program product for automatically organizing digitized photographic images into events based on spoken annotations, where the events are useful in organizing photographic albums, said computer program product comprising a computer readable storage medium having a computer program stored thereon for performing the steps of:
-
providing natural-language text based on spoken annotations corresponding to at least some of the photographic images; extracting predetermined information from the natural-language text that characterizes the annotations of the images; segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and identifying each event by assembling the categories of information into event descriptions; wherein the step of segmenting the images into events comprises the steps of; assigning a strength value for the certain categories of information which are indicative of a boundary between events; computing the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and allocating the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event. - View Dependent Claims (15)
-
-
16. A system for automatically organizing digitized photographic images into events based on spoken annotations, where the events are useful in organizing photographic albums, said system comprising:
-
an input for receiving natural-language text based on spoken annotations corresponding to a plurality of frames of photographic images; an event extraction stage for extracting predetermined information from the natural-language text that characterizes the annotations of the images; an event segmentation stage for segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and an event identification stage for identifying each event by assembling the categories of information into event descriptions, wherein the event segmentation stage comprises a processor that; assigns a strength value for the certain categories of information which are indicative of a boundary between events; computes the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and allocates the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
22. A method for automatically organizing digitized photographic images into events, said method comprising the steps of:
-
providing spoken annotations corresponding to a plurality of frames of photographic images; segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and identifying each event by assembling the categories of information into event descriptions, wherein the step of segmenting the images into events comprises the steps of; assigning a strength value for the certain categories of information which are indicative of a boundary between events; computing the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and allocating the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event.
-
-
23. A system for automatically organizing digitized photographic images into events based on spoken annotations, where the events are useful in organizing photographic albums, said system comprising:
-
an input for receiving spoken annotations corresponding to a plurality of frames of photographic images; an event segmentation stage for segmenting the images into events by examining each annotation for the presence of certain categories of information which are indicative of a boundary between events; and an event identification stage for identifying each event by assembling the categories of information into event descriptions, wherein the event segmentation stage comprises a processor that; assigns a strength value for the certain categories of information which are indicative of a boundary between events; computes the evidence in favor of and against an event break with regard to a current frame by summing the strength values from the certain categories of information present for the current frame relative to a preceding frame already allocated to a current event; and allocates the frame to a new event when the summarized strength values in favor of an event break exceed a predetermined threshold, otherwise allocating the frame to the current event.
-
Specification