System and method for whiteboard and audio capture
First Claim
1. A process for summarizing and indexing audiovisual content, comprising the following process actions:
- capturing a sequence of images of content written on a non-electronic white board with a camera;
recording audio signals correlated with the sequence of images; and
analyzing the sequence of images to isolate key frames summarizing key points of said board content;
wherein said analyzing process action comprises the following process actions;
rectifying a view of the whiteboard in every image in the sequence of images;
extracting whiteboard background color;
dividing each image of the sequence of images into cells, each cell representing a cell image;
clustering cell images throughout the sequence of images for each cell over time;
classifying each cell image as a stroke, a foreground object or whiteboard cell; and
extracting key frame images using the classification results.
2 Assignments
0 Petitions
Accused Products
Abstract
A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.
94 Citations
57 Claims
-
1. A process for summarizing and indexing audiovisual content, comprising the following process actions:
-
capturing a sequence of images of content written on a non-electronic white board with a camera; recording audio signals correlated with the sequence of images; and analyzing the sequence of images to isolate key frames summarizing key points of said board content;
wherein said analyzing process action comprises the following process actions;rectifying a view of the whiteboard in every image in the sequence of images; extracting whiteboard background color; dividing each image of the sequence of images into cells, each cell representing a cell image; clustering cell images throughout the sequence of images for each cell over time; classifying each cell image as a stroke, a foreground object or whiteboard cell; and extracting key frame images using the classification results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
38. A system for capturing the audio and video content of a meeting comprising:
-
a capture system that captures a sequence of images of data written on a whiteboard and audio signals corresponding to sounds that occur during a meeting; an analysis server for analyzing the sequence of images that extracts key data frames written on the whiteboard and correlates the audio signals to the key data frames wherein the analysis server identifies the key data frames by; rectifying a view of the whiteboard in every image in the sequence of images; extracting whiteboard background color; dividing each image of the sequence of images into cells of cell images; clustering cell images that are similar throughout the sequence of images for each cell over time; classifying each cell image as a stroke, a foreground object or whiteboard cell; and extracting key frame images using the classification results; and a browsing module for viewing the analyzed meeting key data frames and correlated audio. - View Dependent Claims (39, 40, 41, 42, 46, 47, 48)
-
-
43. A system for capturing the audio and video content of a meeting comprising:
-
a capture system that captures a sequence of images of data written on a whiteboard and audio signals corresponding to sounds that occur during a meeting, wherein one or more users register a user identifier at the capture unit before recording starts; if at least one user registers at the capture unit, the analysis server generates an access token after event recording and analysis; the access token and computer memory location of analyzed meeting data is provided to the registered user identifiers; and the one or more users access the computer memory location of the analyzed event data to review the analyzed meeting data; an analysis server for analyzing the sequence of images that extracts key data frames written on the whiteboard and correlates the audio signals to the key data frames; and a browsing module for viewing the analyzed meeting key data frames and correlated audio. - View Dependent Claims (44, 45)
-
-
49. A process for distilling event content, comprising the following process actions:
-
capturing a sequence of images of content written on a non-electronic white board with a camera; recording audio signals correlated with the sequence of images; and analyzing the sequence of images to isolate key frames summarizing key points of said board content by; rectifying a view of the whiteboard in each image in the sequence of images; extracting whiteboard background color; dividing each image of the sequence of images into cells; clustering cell images throughout the sequence of images for each cell over time; classifying each cell image as a stroke, a foreground object or whiteboard cell; and extracting key frame images using the classification results. - View Dependent Claims (50, 51, 52, 53, 54)
-
-
55. A system for distilling the content of a meeting comprising:
-
a capture system that captures a sequence of data written on a whiteboard, said capture system tracking pen location to infer content written on a whiteboard and recording audio signals correlating to said content written on a whiteboard; an analysis server for analyzing the sequence of images that extracts key data frames written on the whiteboard and correlates the audio signals to the key data frames, wherein said analysis server performs the following actions; dividing each region of the whiteboard into cells; clustering cells that are the same throughout the sequence of data written for each cell over time; classifying each cell as a stroke or a whiteboard cell; and extracting key frame images using the classification results. - View Dependent Claims (56)
-
-
57. A computer-readable medium having computer-executable instructions for providing non-linear access to data recorded during a meeting, said computer executable instructions comprising program modules for:
-
capturing a sequence of images of content written on a non-electronic white board with a camera; recording audio signals correlated with the sequence of images; and analyzing the sequence of images to isolate key frames summarizing key points of said board content by; rectifying a view of the whiteboard in each image in the sequence of images; extracting whiteboard background color; dividing each image of the sequence of images into cells; clustering cell images throughout the sequence of images for each cell over time; classifying each cell image as a stroke, a foreground object or whiteboard cell; and extracting key frame images using the classification results.
-
Specification