Methods and apparatus to present a video program to a visually impaired person
First Claim
Patent Images
1. A method comprising:
- detecting, with a processor, a text portion of a program from a program stream having a video stream and a first audio stream, the text portion of the program not being consumable by a blind person;
obtaining, with the processor, text associated with the text portion of the program;
converting the text to a second audio stream;
generating pronunciation and prosody annotations based on the second audio stream and a language type associated with the program; and
combining the second audio stream with the first audio stream to form a combined audio stream.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatus to present a video program to a visually impaired person are disclosed. An example method comprises receiving a video stream and an associated audio stream of a video program, detecting a portion of the video program that is not readily consumable by a visually impaired person, obtaining text associated with the portion of the video program, converting the text to a second audio stream, and combining the second audio stream with the associated audio stream.
36 Citations
38 Claims
-
1. A method comprising:
-
detecting, with a processor, a text portion of a program from a program stream having a video stream and a first audio stream, the text portion of the program not being consumable by a blind person; obtaining, with the processor, text associated with the text portion of the program; converting the text to a second audio stream; generating pronunciation and prosody annotations based on the second audio stream and a language type associated with the program; and combining the second audio stream with the first audio stream to form a combined audio stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A set-top box comprising:
-
a network interface to receive a video stream and an associated audio stream; a detector to identify a portion of the video stream that is not consumable by a blind person; a text locator to obtain text associated with the portion of the video stream; a text-to-speech module to convert the text to a second audio stream; a pronunciation and prosody analyzer to generate verbal annotations of the second audio stream based on a language type associated with the video stream; and a mixer to combine the second audio stream with the associated audio stream. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A tangible machine readable medium storing instructions which, when executed, cause a machine to perform a method comprising:
-
detecting, with a processor, a text portion of a program from a program stream having a video stream and a first audio stream, the text portion of the program not being consumable by a blind person; obtaining, with the processor, text associated with the text portion of the program; converting the text to a second audio stream; generating pronunciation and prosody annotations based on the second audio stream and a language type associated with the program; and combining the second audio stream with the first audio stream to form a combined audio stream. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification