Automated closed captioning using temporal data
First Claim
1. A system for increasing accuracy of computer speech recognition comprising:
- a dynamic grammar builder computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the dynamic grammar builder computing device to;
obtain social network data occurring within a threshold timespan of a broadcast of media content;
identify named entities from the obtained social network data that are trending within the obtained social network data;
rank the identified named entities based upon the trending; and
build a dynamic grammar comprising at least some of the named entities based upon the ranking; and
a speech recognition computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the speech recognition computing device to;
perform speech recognition of spoken words, spoken by the broadcast of the media content, utilizing the dynamic grammar to create closed caption text for the broadcast of the media content.
1 Assignment
0 Petitions
Accused Products
Abstract
One or more systems and/or techniques are provided for automatic closed captioning for media content. In an example, real-time content, occurring within a threshold timespan of a broadcast of media content (e.g., social network posts occurring during and an hour before a live broadcast of an interview), may be accessed. A list of named entities, occurring within the social network data, may be generated (e.g., Interviewer Jon, Interviewee Kathy, Husband Dave, Son Jack, etc.). A ranked list of named entities may be created based upon trending named entities within the list of named entities (e.g., a named entity may be ranked higher based upon a more frequent occurrence within the social network posts). A dynamic grammar (e.g., library, etc.) may be built based upon the ranked list of named entities. Speech recognition may be performed upon the broadcast of media content utilizing the dynamic grammar to create closed caption text.
-
Citations
20 Claims
-
1. A system for increasing accuracy of computer speech recognition comprising:
-
a dynamic grammar builder computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the dynamic grammar builder computing device to; obtain social network data occurring within a threshold timespan of a broadcast of media content; identify named entities from the obtained social network data that are trending within the obtained social network data; rank the identified named entities based upon the trending; and build a dynamic grammar comprising at least some of the named entities based upon the ranking; and a speech recognition computing device comprising one or more processing units and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the speech recognition computing device to; perform speech recognition of spoken words, spoken by the broadcast of the media content, utilizing the dynamic grammar to create closed caption text for the broadcast of the media content. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing device comprising:
-
one or more processing units; and one or more computer-readable media comprising computer-executable instructions which, when executed by the one or more processing units, cause the computing device to; obtain real-time content occurring within a threshold timespan of a broadcast of media content; identify named entities from the obtained real-time content that are trending within the obtained real-time content; ranking the identified named entities based upon the trending; building a dynamic grammar comprising at least some of the named entities based upon the ranking; and utilizing the dynamic grammar for correcting user generated closed captioning for the broadcast of the media content.
-
-
8. A method for increasing accuracy of computer speech recognition comprising:
-
obtaining, by a computing device, social network data occurring within a threshold timespan of a broadcast of media content; identifying, on the computing device, named entities from the obtained social network data that are trending within the obtained social network data; ranking, on the computing device, the identified named entities based upon the trending; building, on the computing device, a dynamic grammar comprising at least some of the named entities based upon the ranking; and performing computer speech recognition of spoken words, spoken by the broadcast of the media content, utilizing the dynamic grammar to create closed caption text for the broadcast of the media content. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification