System and method for automatically cataloguing data by utilizing speech recognition procedures
First Claim
10-1. The method of claim 21 wherein said label manager instructs said electronic device to enter a non-real-time label mode for creating and storing said text labels, said electronic device responsively retrieving and playing back said audio/video data and said narration.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for automatically cataloguing data by utilizing speech recognition procedures includes an electronic device that captures audio/video data and corresponding verbal narration. A speech recognition engine coupled to the electronic device automatically performs a speech recognition process upon the audio/video data and verbal narration to generate labels that correspond to respective subject matter locations in the audio/video data. A label manager of the electronic device manages a label mode for generating and storing the foregoing labels. The label manager also controls a label search mode during which a system user utilizes the labels to automatically locate corresponding subject matter locations in the captured audio/video data.
65 Citations
47 Claims
-
10-1. The method of claim 21 wherein said label manager instructs said electronic device to enter a non-real-time label mode for creating and storing said text labels, said electronic device responsively retrieving and playing back said audio/video data and said narration.
-
21. A method for cataloguing electronic information, comprising:
-
capturing audio/video data corresponding to a photographic target by utilizing an electronic device, said audio/video data including a narration provided by a narrator;
providing a speech recognition engine that automatically performs a speech recognition process upon said narration to generate text labels that correspond to respective subject matter locations in said audio/video data;
managing a label mode for generating and storing said text labels by utilizing a label manager; and
controlling a label search mode with said label manager, said label search mode utilizing said text labels to locate said respective subject matter locations in said audio/video data. - View Dependent Claims (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25, 26, 27, 28, 29, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A computer-readable medium comprising program instructions for cataloguing electronic information by:
-
capturing audio/video data corresponding to a photographic target by utilizing an electronic device, said audio/video data including a narration provided by a narrator;
providing a speech recognition engine that automatically performs a speech recognition process upon said narration to generate text labels that correspond to respective subject matter locations in said audio/video data;
managing a label mode for generating and storing said text labels by utilizing a label manager; and
controlling a label search mode with said label manager, said label search mode utilizing said text labels to locate said respective subject matter locations in said audio/video data.
-
-
42. A system for cataloguing electronic information, comprising:
-
means for capturing audio/video data corresponding to a photographic target, said audio/video data including a narration provided by a narrator;
means for automatically performing a speech recognition process upon said narration to generate text labels that correspond to respective subject matter locations in said audio/video data;
means for managing a label mode to generate and store said text labels; and
means for controlling a label search mode that utilizes said text labels to locate said respective subject matter locations in said audio/video data.
-
-
43. A system for cataloguing electronic information, comprising:
-
an imaging device that captures audio/video data corresponding to selected photographic targets, said audio/video data including a verbal narration provided by a narrator;
a speech recognition engine that automatically performs a speech recognition process upon said narration to generate text labels that are based upon said narration, said text labels corresponding to respective subject matter locations in said audio/video data, said text labels including abbreviated word sequences that identify said selected photographic targets; and
a label manager that manages a label mode during which said text labels are generated by said speech recognition engine, said label manager also storing said text labels during said label mode, said text labels being stored along with meta-information that associates said respective subject matter locations to corresponding ones of said text labels, said label manager also controlling a label search mode for utilizing said text labels to locate specific corresponding ones of said respective subject matter locations from said audio/video data, said label manager providing a label-search user interface upon a display of said imaging device for displaying said text labels and corresponding visual images of said respective subject matter locations from said audio/video data, a system user interactively choosing a selected text label by utilizing said label-search user interface, said imaging device responsively displaying said audio/video data from a selected subject matter location corresponding only to said selected text label.
-
-
44. A system for cataloguing electronic information, comprising:
-
an electronic device that captures said electronic information that includes verbal narration data;
a speech recognition engine that analyzes said electronic information to generate labels that correspond to respective subject matter locations in said electronic information; and
a label manager that utilizes said labels to locate said respective subject matter locations in said electronic information.
-
-
45. A system for cataloguing electronic information, comprising:
-
an electronic device that captures audio/video data corresponding to a photographic target, said audio/video data including a narration provided by a narrator; and
a speech recognition engine that automatically performs a speech recognition process upon said audio/video data to generate labels that correspond to respective subject matter locations in said audio/video data.
-
-
46. A system for cataloguing electronic information, comprising:
-
an electronic device that captures audio/video data corresponding to a photographic target, said audio/video data including a narration provided by a narrator; and
a label manager that controls a label search mode for utilizing labels derived from said narration to locate corresponding respective subject matter locations in said audio/video data.
-
-
47. An electronic cataloguing system implemented by:
-
capturing electronic data which includes a narration provided by a narrator;
performing a speech recognition process upon said electronic data to automatically generate labels that correspond to respective subject matter locations in said electronic data; and
utilizing said labels to locate said respective subject matter locations in said electronic data.
-
Specification