Audibly indicating secondary content with spoken text
First Claim
1. A computer-implemented method for navigating secondary content during a text-to-speech process, the method comprising:
- outputting first audio including an audio tone preceded by first synthesized speech and followed by second synthesized speech, the audio tone corresponding to an indicator of a first footnote located in a string of text, the first synthesized speech associated with a portion of the string of text prior to the indicator and the second synthesized speech associated with a portion of the string of text following the indicator;
detecting first contact on a touch-screen of a computing device within a first period of time following output of the audio tone;
determining that the first contact corresponds to a predefined first arc gesture, the first contact extending along both a horizontal axis and a vertical axis from a first point to a second point, a difference between a first horizontal coordinate associated with the first point and a second horizontal coordinate associated with the second point exceeding a horizontal threshold in a first direction relative to the first point, and a difference between a first vertical coordinate associated with the first point and a second vertical coordinate associated with a midpoint of the contact exceeding a vertical threshold;
selecting the first footnote in response to the first arc gesture;
identifying supplemental text associated with the first footnote; and
outputting third synthesized speech corresponding to the supplemental text associated with the first footnote.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for navigating secondary content. The system may monitor for gestures input to the system by an input device and may detect an arc gesture. The arc gesture may travel along both a horizontal axis and a vertical axis from a first point to a second point and may be delineated from a horizontal or a vertical motion. The system may identify secondary content corresponding to the arc gesture in response to the arc gesture and output data corresponding to the secondary content. The system may identify supplemental text associated with the secondary content and synthesize supplemental speech corresponding to the supplemental text. The output data may include audio including the synthesized supplemental speech.
10 Citations
22 Claims
-
1. A computer-implemented method for navigating secondary content during a text-to-speech process, the method comprising:
-
outputting first audio including an audio tone preceded by first synthesized speech and followed by second synthesized speech, the audio tone corresponding to an indicator of a first footnote located in a string of text, the first synthesized speech associated with a portion of the string of text prior to the indicator and the second synthesized speech associated with a portion of the string of text following the indicator; detecting first contact on a touch-screen of a computing device within a first period of time following output of the audio tone; determining that the first contact corresponds to a predefined first arc gesture, the first contact extending along both a horizontal axis and a vertical axis from a first point to a second point, a difference between a first horizontal coordinate associated with the first point and a second horizontal coordinate associated with the second point exceeding a horizontal threshold in a first direction relative to the first point, and a difference between a first vertical coordinate associated with the first point and a second vertical coordinate associated with a midpoint of the contact exceeding a vertical threshold; selecting the first footnote in response to the first arc gesture; identifying supplemental text associated with the first footnote; and outputting third synthesized speech corresponding to the supplemental text associated with the first footnote. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method comprising:
-
outputting first audio including an audio tone preceded by first synthesized speech and followed by second synthesized speech, the audio tone associated with an indicator of first secondary content located in a string of text and based on a type of the first secondary content, the first synthesized speech associated with a portion of the string of text prior to the indicator and the second synthesized speech associated with a portion of the string of text following the indicator; detecting first contact on a touch-screen of a computing device; determining that the first contact corresponds to a first arc gesture the first contact extending along both a horizontal axis and a vertical axis from a first point to a second point, a difference between a first horizontal coordinate associated with the first point and a second horizontal coordinate associated with the second point exceeding a horizontal threshold in a first direction relative to the first point, and a difference between a first vertical coordinate associated with the first point and a second vertical coordinate associated with a midpoint of the first contact exceeding a vertical threshold; selecting the first secondary content that corresponds to the first arc gesture; identifying supplemental text associated with the first secondary content; and outputting second audio corresponding to the supplemental text in response to the first arc gesture. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computing device comprising:
-
one or more processors; and a memory including instructions operable to be executed by the one or more processors to perform a set of actions to configure the device to; output first audio including an audio tone preceded by first synthesized speech and followed by second synthesized speech, the audio tone associated with an indicator of first secondary content located in a string of text and based on a type of the first secondary content, the first synthesized speech associated with a portion of the string of text prior to the indicator and the second synthesized speech associated with a portion of the string of text following the indicator; detect first contact on a touch-screen of a computing device; determine that the first contact corresponds to a first arc gesture, the first contact extending along both a horizontal axis and a vertical axis from a first point to a second point, a difference between a first horizontal coordinate associated with the first point and a second horizontal coordinate associated with the second point exceeding a horizontal threshold in a first direction relative to the first point, and a difference between a first vertical coordinate associated with the first point and a second vertical coordinate associated with a midpoint of the first contact exceeding a vertical threshold; select the first secondary content that corresponds to the first arc gesture; identify supplemental text associated with the first secondary content; and output second audio corresponding to the supplemental text in response to the first arc gesture. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification