SPEECH RECOGNITION FOR INTERNET VIDEO SEARCH AND NAVIGATION
First Claim
Patent Images
1. A system, comprising:
- an audio video device (AVD) configured for communicating with the Internet;
at least one computer memory comprising instructions executable by at least one processor for;
receiving speech signals of a viewer of the AVD and representing a viewer-desired video site or video subject;
implementing speech recognition on received speech signals representing a desired video site or video subject to generate recognized speech; and
accessing at least one database including at least one index derived from closed captioned text in a televised video program received by the AVD; and
correlating speech with at least one element of the index to return at least one matching index element, the matching index element useful for providing video to the AVD.
0 Assignments
0 Petitions
Accused Products
Abstract
Speech representing a desired video site or video subject is detected and digitized at a TV remote, and then sent to a TV. The TV or in some embodiments an Internet server communicating with the TV use speech recognition principles to recognize the speech, enter a database using the recognized speech as entering argument, and return a link to an Internet site hosting the desired video. The link can be displayed on the TV for selection thereof by a user to retrieve the video.
32 Citations
19 Claims
-
1. A system, comprising:
-
an audio video device (AVD) configured for communicating with the Internet; at least one computer memory comprising instructions executable by at least one processor for; receiving speech signals of a viewer of the AVD and representing a viewer-desired video site or video subject; implementing speech recognition on received speech signals representing a desired video site or video subject to generate recognized speech; and accessing at least one database including at least one index derived from closed captioned text in a televised video program received by the AVD; and correlating speech with at least one element of the index to return at least one matching index element, the matching index element useful for providing video to the AVD. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for returning a desired video, comprising:
-
accessing closed captioned text received in video program; providing signals representing human speech to an audio video device (AVD) the speech being related to the video; recognizing at least phonemes in the speech; and using the phonemes as entering argument, accessing a database including an index derived from the closed captioned text received in the video program to retrieve the desired video. - View Dependent Claims (7, 8, 9)
-
-
10. A device comprising:
-
a computer memory that is not a transitory signal and that comprises instructions executable by at least one processor to; recognize a viewer'"'"'s digitized speech representing a video and generating recognized speech in response; accessing a data structure correlating speech representing video to computer storage locations of stored video, the data structure being generated at least in past using metadata received in video presented on an audio video device (AVD) and/or closed caption text received in video presented on the AVD; and retrieving, from the data structure, at least an identification of at one video correlated to a match of the recognized speech. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
Specification