Method and apparatus for managing information
First Claim
1. A method for recording, categorizing, organizing, managing and retrieving speech information, said method comprising,a. obtaining a speech stream,b. storing the speech stream in at least a temporary storage,c. extracting multiple, selected features from the speech stream, wherein the multiple features include the speaker'"'"'s identity or location, duration of speech phrases, and pauses in speaking,d. constructing a visual representation of the selected features of the speech stream,e. providing the visual representation to a user,f. categorizing portions of the speech stream, with or without the aid of the representation, by at least one of the following categorization techniques:
- user command and,automatic recognition of speech qualities, including tempo, fundamental pitch, and phonemes, andg. storing, in at least a temporary storage, data structure which represents the categorized portions of the speech stream.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for recording, categorizing, organizing, managing and retrieving speech information obtains a speech stream; stores the speech stream in at least a temporary storage; provides a visual representation of portions of the speech stream to the user; categorizes portions of a speech stream, with or without the aid of the visual representation, by user command and/or by automatic recognition of speech qualities; stores, in at least a temporary storage, structure which represents a categorized portions of the speech stream; and selectively retrieves one or more of the categorized portions of the Speech stream.
-
Citations
57 Claims
-
1. A method for recording, categorizing, organizing, managing and retrieving speech information, said method comprising,
a. obtaining a speech stream, b. storing the speech stream in at least a temporary storage, c. extracting multiple, selected features from the speech stream, wherein the multiple features include the speaker'"'"'s identity or location, duration of speech phrases, and pauses in speaking, d. constructing a visual representation of the selected features of the speech stream, e. providing the visual representation to a user, f. categorizing portions of the speech stream, with or without the aid of the representation, by at least one of the following categorization techniques: -
user command and, automatic recognition of speech qualities, including tempo, fundamental pitch, and phonemes, and g. storing, in at least a temporary storage, data structure which represents the categorized portions of the speech stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method for recording, categorizing, organizing, managing and retrieving speech information transmitted by telephone, said method comprising,
a. obtaining a speech stream from a telephone connection, b. storing the speech stream in at least a temporary storage, c. extracting multiple, selected features from the speech stream, wherein the multiple features include the speaker'"'"'s identity or location, duration of speech phrases, and pauses in speaking. d. categorizing portions of the speech stream by user command or by automatic recognition of speech qualities, including tempo, fundamental pitch, and phonemes, and wherein the categorizing portions of the speech stream includes categorizing the speaker by indicating which end of the telephone connection the speech is coming from, e. storing, in at least a temporary storage, data structure which represents the categorized portions of the speech stream, and f. selectively retrieving one or more of the categorized portions of the speech stream.
-
27. A method of recording speech, said method comprising,
capturing the speech, storing the captured speech in a temporary storage, extracting multiple, selected features from the speech stream, wherein the multiple features include the speaker'"'"'s location, duration of speech phrases, and pauses in speaking, representing selected, extracted features of the speech in a visual form to the user, using the visual representation to select portions of the speech for storage and including the step of looking at the visual representation of the captured speech in the temporary storage and selectively categorizing portions of that speech, with the aid of the visual representation, after the speech has been captured in the temporary storage.
-
28. A method for recording and indexing speech information, said method comprising,
obtaining a speech stream, storing the entire speech stream as an unannotated speech stream in a first, separate storage, automatically recognizing qualities of the speech stream, including tempo, fundamental pitch, and phonemes, categorizing portions of the speech stream by user command, and by association with the automatically recognized qualities, storing the categorized portions together with said automatically recognized qualities in a second storage, synchronizing at least a portion of the obtained speech stream with both the stored categorized portions and the stored automatically recognized qualities, and compiling the automatically recognized qualities with the categorized portions as compiled speech information in a manner which permits the compiled speech information to be organized, managed, and selectively retrieved by a user.
-
29. A speech information apparatus for recording, categorizing, organizing, managing and retrieving speech information, said apparatus comprising,
a. speech stream means for obtaining a speech stream, b. first storage means for storing the speech stream in at least a temporary storage, c. extracting means for extracting multiple, selected features from the speech stream, and wherein the multiple features include the speaker'"'"'s identity or location, duration of speech phrases, and pauses in speaking, d. constructing means for constructing a visual representation of the selected features of the speech stream, e. visual representation means for providing the visual representation to a user, f. categorizing means for categorizing portions of the speech stream, with or without the aid of the representation, by at least one of the following categorizing techniques: -
user command and, automatic recognition of speech qualities, including tempo, fundamental pitch, and phonemes, and g. second storage means for storing, in at least a temporary storage, data structure which represents the categorized portions of the speech stream. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53)
-
-
54. A speech information apparatus for recording, categorizing, organizing, managing and retrieving speech information transmitted by telephone, said apparatus comprising,
a. a speech stream means for obtaining a speech stream from a telephone call, b. first storage means for storing the speech stream in at least a temporary storage, c. extracting means for extracting multiple, selected features from the speech stream, wherein the multiple features include the speaker'"'"'s identity or location, duration of speech phrases,and pauses in speaking, d. categorizing means for categorizing portions of the speech stream by user command or by automatic recognition of speech qualities, including tempo, fundamental pitch, and phonemes, e. second storage means for storing, in at least a temporary storage, structure which represents the categorized portions of the speech stream, and f. retrieving means for selectively retrieving one or more of the categorized portions of the speech stream, and g. wherein the speech portions are categorized in the categorizing means by speaker by indicating which end of the telephone connection the speech is coming from.
-
55. A speech information apparatus for recording speech, said apparatus comprising,
capture means for capturing the speech, temporary storage means for storing captured speech in a temporary storage, extracting means for extracting multiple, selected features from the speech, wherein the multiple features include the speaker'"'"'s location, duration of speech phrases, and pauses in speaking, visual representation means for representing selected, extracted features of the speech in a visual form to a user, selection means for using the visual representation to select portions of the speech for storage, and including visual means for looking at the captured speech in the temporary store and categorizing means for selectively categorizing portions of that speech, with the aid of the visual representation, after the speech has been captured and stored in the temporary storage means.
-
56. A speech information apparatus for recording and indexing speech information, said apparatus comprising,
speech stream means for obtaining a speech stream, first storage means for storing an entire speech stream as an unannotated speech stream in a first storage, automatic categorizing means for automatically recognizing qualities of the speech stream, including tempo, fundamental pitch, and phonemes, user command means for categorizing portions of the speech stream by user command and by association with the automatically recognized qualities, second storage means separate from the first storage means for storing the categorized portions of the speech stream together with the automatically recognized qualities, synchronizing means for synchronizing at least a portion of the obtained speech stream with the categorized portions and the automatically recognized qualities stored in said second storage, and compiling means for compiling the automatically recognized qualities with the categorized portions as compiled speech information in a manner which permits the compiled speech information to be organized, managed, selectively retrieved by a user.
-
57. A video information apparatus for recording, categorizing, organizing, managing and retrieving video information, said apparatus comprising,
a. stream means for obtaining a video stream, b. first storage means for storing the speech stream in at least a temporary storage, c. extracting means for extracting multiple, selected features from the video stream, d. constructing means for constructing a visual representation of the selected features of the video stream, e. visual means for providing the visual representation to a user, f. categorizing means for categorizing portions of the speech stream by user command or by automatic recognition of visual or audio qualities, and g. second storage means for storing, in at least a temporary storage, structure which represents the categorized portions of the speech stream.
Specification