Voice command-driven database
First Claim
1. A mobile device news reader, comprising:
- at least one processor;
at least one database coupled to the at least one processor;
a text-to-speech engine embedded in the mobile device and executable by the at least one processor to convert text data stored in the at least one database to a speech-based output;
a speech-to-text engine embedded in the mobile device and executable by the at least one processor to process a voice command input received from a user of the device and generate data in the form of text that corresponds to the received voice command input; and
non-transient memory coupled to the at least one processor that stores program instructions, which, when executed by the at least one processor, cause the at least one processor to;
receive text-based news articles and text-based headlines associated with the news articles over a network from a remote server,store an identification of a single, unique, system-wide, and context-independent voice command input for use to select text-based news articles for output by the speech-to-text engine, to select menu options, and to modify output settings,store a list of the received news articles correlated with a list of the received headlines in the at least one database, andwithout utilizing a connection to the network,retrieve the list of headlines from the at least one database,enter a first list-pause cycle comprising;
a list-pause that comprises;
outputting, via the text-to-speech engine, a headline from the list of received headlines; and
a pause of a predetermined length of time;
repeating the list-pause, each repetition of the list-pause outputting a different next headline from the list of received headlines, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and
responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, outputting, via the text-to-speech engine, at least a portion of the news article from the sequence of news articles correlated with the headline output during that most recent list-pause;
enter a second list-pause cycle comprising;
a list-pause comprising;
outputting, via the text-to-speech engine, an identifier of a menu option; and
a pause of a predetermined length of time;
repeating the list-pause, each repetition of the list-pause outputting a different next identifier of a menu option, until that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and
responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, navigating to a different menu or mode of the news reader associated with the identifier output during that list-pause; and
enter a third list-pause cycle comprising;
a list-pause comprising;
outputting, via the text-to-speech engine, an identifier of an output setting; and
a pause of a predetermined length of time;
repeating the list-pause, each repetition of the list-pause outputting a different next identifier of an output setting, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and
responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, modifying an output setting of the text-to-speech engine to match a setting associated with the identifier output during that list-pause.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice command-driven system and computer-implemented method are disclosed for selecting a data item in a list of text-based data items stored in a database using a simple affirmative voice command input without utilizing a connection to a network. The text-based data items in the list are converted to speech using an embedded text-to-speech engine and an audio output of a first converted data item is provided. A listening state is entered into for a predefined pause time to await receipt of the simple affirmative voice command input. If the simple affirmative voice command input is received during the predefined pause time, the first converted data item is selected for processing. If the simple affirmative voice command input is not received during the predefined pause time, an audio output of a next converted data item in the list is provided.
-
Citations
14 Claims
-
1. A mobile device news reader, comprising:
-
at least one processor; at least one database coupled to the at least one processor; a text-to-speech engine embedded in the mobile device and executable by the at least one processor to convert text data stored in the at least one database to a speech-based output; a speech-to-text engine embedded in the mobile device and executable by the at least one processor to process a voice command input received from a user of the device and generate data in the form of text that corresponds to the received voice command input; and non-transient memory coupled to the at least one processor that stores program instructions, which, when executed by the at least one processor, cause the at least one processor to; receive text-based news articles and text-based headlines associated with the news articles over a network from a remote server, store an identification of a single, unique, system-wide, and context-independent voice command input for use to select text-based news articles for output by the speech-to-text engine, to select menu options, and to modify output settings, store a list of the received news articles correlated with a list of the received headlines in the at least one database, and without utilizing a connection to the network, retrieve the list of headlines from the at least one database, enter a first list-pause cycle comprising; a list-pause that comprises;
outputting, via the text-to-speech engine, a headline from the list of received headlines; and
a pause of a predetermined length of time;repeating the list-pause, each repetition of the list-pause outputting a different next headline from the list of received headlines, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, outputting, via the text-to-speech engine, at least a portion of the news article from the sequence of news articles correlated with the headline output during that most recent list-pause; enter a second list-pause cycle comprising; a list-pause comprising;
outputting, via the text-to-speech engine, an identifier of a menu option; and
a pause of a predetermined length of time;repeating the list-pause, each repetition of the list-pause outputting a different next identifier of a menu option, until that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, navigating to a different menu or mode of the news reader associated with the identifier output during that list-pause; and enter a third list-pause cycle comprising; a list-pause comprising;
outputting, via the text-to-speech engine, an identifier of an output setting; and
a pause of a predetermined length of time;repeating the list-pause, each repetition of the list-pause outputting a different next identifier of an output setting, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, modifying an output setting of the text-to-speech engine to match a setting associated with the identifier output during that list-pause. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A voice-command driven method for controlling a news reader device, comprising:
-
receiving, using at least one processor, text-based news articles and text-based headlines associated with the news articles over a network from a remote server; storing, using the at least one processor, an identification of a single, unique, system-wide, and context-independent voice command input for use to select text-based news articles for output by the speech-to-text engine, to select menu options, and to modify output settings, storing, using the at least one processor, a list of the received news articles correlated with a list of the received headlines in at least one database coupled to the at least one processor in the device, and without utilizing a connection to the network, retrieving, using the at least one processor, the list of headlines from the at least one database, entering a first list-pause cycle comprising; a list-pause that comprises; outputting, via the text-to-speech engine, a headline from the list of received headlines; and a pause of a predetermined length of time; repeating the list-pause, each repetition of the list-pause outputting a different next headline from the list of received headlines, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, outputting, via the text-to-speech engine, at least a portion of the news article from the sequence of news articles correlated with the headline output during that most recent list-pause; entering a second list-pause cycle comprising; a list-pause comprising; outputting, via the text-to-speech engine, an identifier of a menu option; and a pause of a predetermined length of time; repeating the list-pause, each repetition of the list-pause outputting a different next identifier of a menu option, until that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, navigating to a different menu or mode of the news reader associated with the identifier output during that list-pause; and enter a third list-pause cycle comprising; a list-pause comprising; outputting, via the text-to-speech engine, an identifier of an output setting; and a pause of a predetermined length of time; repeating the list-pause, each repetition of the list-pause outputting a different next identifier of an output setting, until determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of a most recent list-pause; and responsive to determining that the single, unique, system-wide, and context-independent voice command input has been spoken during the pause of the most recent list-pause, modifying an output setting of the text-to-speech engine to match a setting associated with the identifier output during that list-pause. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification