Language independent and voice operated information management system
First Claim
1. A voice operated portable information management system that is substantially language independent and capable of supporting a substantially unlimited vocabulary, including:
- an input transducer for receiving a user'"'"'s speech;
an output transducer for outputting sound including speech;
a speech processing system coupled to the input and output transducers and including means for;
generating and storing compressed speech data corresponding to a user'"'"'s speech received through the input transducer;
comparing the stored speech data;
re-synthesizing the stored speech data for output as speech through the output transducer;
providing an audible user interface including a speech assistant for providing instructions in the user'"'"'s language;
storing user-specific compressed speech data, including commands, received in response to prompts from the speech assistant for purposes of adapting the system to the user'"'"'s speech;
identifying memo management commands spoken by the user, and storing and organizing compressed speech data as a function of the identified commands;
identifying memo retrieval commands and search utterances spoken by the user, performing a best-match search comparing the search utterances to all or specified parts of the stored speech data, building a most likely candidate list of stored speech data, and outputting the identified stored speech data as a function of the commands; and
identifying memo edit commands spoken by the user, and editing the identified stored speech data as a function of the commands.
2 Assignments
0 Petitions
Accused Products
Abstract
A voice operated portable information management system that is substantially language independent and capable of supporting a substantially unlimited vocabulary. The system includes a microphone, speaker, clock and GPS connected to a speech processing system. The speech processing system: 1) generates and stores compressed speech data corresponding to a user'"'"'s speech received through the microphone, 2) compares the stored speech data, 3) re-synthesizes the stored speech data for output as speech through the speaker, 4) provides an audible user interface including a speech assistant for providing instructions in the user'"'"'s language, 5) stores user-specific compressed speech data, including commands, received in response to prompts from the speech assistant for purposes of adapting the system to the user'"'"'s speech, 6) identifies memo management commands spoken by the user, and stores and organizes compressed speech data as a function of the identified commands, and 7) identifies memo retrieval commands spoken by the user, and retrieves and outputs the stored speech data as a function of the commands. In addition to general purpose voice memo and retrieval functions, application supported by the information management system include time, data and stopwatch functions, temporal and geographic alarms and functions, calendar management, voice calculation, voice maps, mobile telephone tasks and audio content management.
-
Citations
23 Claims
-
1. A voice operated portable information management system that is substantially language independent and capable of supporting a substantially unlimited vocabulary, including:
-
an input transducer for receiving a user'"'"'s speech;
an output transducer for outputting sound including speech;
a speech processing system coupled to the input and output transducers and including means for;
generating and storing compressed speech data corresponding to a user'"'"'s speech received through the input transducer;
comparing the stored speech data;
re-synthesizing the stored speech data for output as speech through the output transducer;
providing an audible user interface including a speech assistant for providing instructions in the user'"'"'s language;
storing user-specific compressed speech data, including commands, received in response to prompts from the speech assistant for purposes of adapting the system to the user'"'"'s speech;
identifying memo management commands spoken by the user, and storing and organizing compressed speech data as a function of the identified commands;
identifying memo retrieval commands and search utterances spoken by the user, performing a best-match search comparing the search utterances to all or specified parts of the stored speech data, building a most likely candidate list of stored speech data, and outputting the identified stored speech data as a function of the commands; and
identifying memo edit commands spoken by the user, and editing the identified stored speech data as a function of the commands. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
the system further includes one or more switches actuated by a user to indicate memo management commands and/or memo retrieval commands; and
the speech processing system is coupled to the one or more switches and operates as a function of the actuated switches.
-
-
3. The information management system of claim 1 wherein:
-
the system further includes a clock; and
the speech processing system is coupled to the clock and includes means for;
identifying temporal commands spoken by the user, and storing and organizing compressed speech data and temporal actions, including alarms, as a function of the commands; and
responding to the stored temporal actions.
-
-
4. The information management system of claim 3 wherein:
-
storing temporal actions includes storing time stamps; and
responding to temporal actions includes responding as a function of the stored time stamps.
-
-
5. The information management system of claim 1 wherein:
-
the system further includes a global positioning sensor (GPS); and
the speech processing system is coupled to the GPS and includes means for;
identifying geographic commands spoken by the user, and storing and organizing compressed speech data and geographic actions, including alarms, as a function of the commands; and
responding to the stored geographic actions.
-
-
6. The information management system of claim 1 wherein:
-
storing geographic actions includes storing geographic stamps; and
responding to geographic actions includes responding as a function of the stored geographic stamps.
-
-
7. The information management system of claim 1 and further including a data port coupled to the speech processing system for communicating data and programs.
-
8. The information management system of claim 1 wherein the speech processing system further includes means for:
-
identifying calculation commands spoken by a user, and storing compressed speech data and performing calculations as a function of the data and commands; and
outputting results of the performed calculations in speech.
-
-
9. The information management system of claim 1 wherein the speech processing system further includes means for:
-
identifying calendar management commands spoken my a user, and storing and organizing compressed speech data as a function of the commands; and
identifying calendar retrieval commands spoken by a user, and retrieving and outputting stored speech as a function of the commands.
-
-
10. The information management system of claim 1 wherein the speech processing system further includes means for:
-
identifying telephone management commands spoken by a user, and storing and organizing compressed speech data as a function of the commands; and
identifying telephone retrieval commands spoken by a user, and retrieving and outputting stored speech as a function of the commands.
-
-
11. The information management system of claim 1 wherein the speech processor includes:
-
means for dividing speech into a plurality of frames;
means for extracting features from the frames of speech; and
means for compressing the extracted features.
-
-
12. The information management system of claim 11 wherein the means for extracting features includes means for calculating pitch, spectral envelope and gain.
-
13. The information management system of claim 12 wherein the means for calculating the spectral envelope includes means for calculating the spectral envelope using a warped linear prediction operation.
-
14. The information management system of claim 11 wherein the means for compressing the extracted features includes means for compressing pitch and gain using scalar quantization.
-
15. The information management system of claim 11 wherein the means for compressing the extracted features includes means for compressing the spectral envelope as a function of the user-specific compressed speech data stored during operation in the codebook generation mode.
-
16. The information management system of claim 1 wherein the speech processing system stores the speech assistant instructions in a plurality of languages and provides the prompts in a language selected by a user.
-
17. The information management system of claim 1 wherein the speech data is stored in a tree structure.
-
18. The information management system of claim 17 wherein the speech data is stored in a tree structure determined by user-spoken commands.
-
19. The information management system of claim 1 wherein storing compressed speech data includes storing the time of storage and the location of the system at the time of storage.
-
20. The information management system of claim 1 wherein the speech processing system further includes means for:
-
identifying telephone task commands, including dialing commands; and
responding to the telephone task commands, including dialing requested telephone numbers.
-
-
21. The information management system of claim 1 wherein the speech processing system further includes means for:
-
identifying audio content management commands, including store and play commands; and
responding to identified commands, including storing and playing audio content.
-
-
22. The information management system of claim 1 wherein:
-
the system further includes a global positioning sensor (GPS); and
the speech processing system is coupled to the GPS and includes means for;
storing location stamps associated with stored speech data;
in response to commands and search utterances, performing best-match voice searches to identify stored location stamps;
in response to commands and search utterances, performing best-match voice searches biased to geographic locations, to identify stored speech data having location stamps corresponding to the geographic locations;
identifying geographic commands spoken by a user, and outputting synthesized voice responses in response to the identified geographic commands; and
identifying geographic events as a function of the GPS and outputting synthesized voice responses and/or stored speech data in response to the identified geographic events.
-
-
23. The information management system of claim 1 wherein:
-
the system further includes a clock; and
the speech processing system is coupled to the clock and includes means for;
storing time stamps associated with the stored speech data;
in response to commands and search utterances, performing best-match voice searches to identify stored time stamps;
in response to commands and search utterances, performing best-match voice searches biased to times, to identify stored speech data having time stamps corresponding to the times;
identifying temporal commands spoken by the user and outputting synthesized voice responses in response to the identified temporal commands; and
identifying temporal events as a function of the clock and outputting synthesized voice responses and/or stored speech data in response to the identified temporal events.
-
Specification