Speech recognition and summarization
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, by a videoconference system that includes an automated speech recognizer and a context builder and from a first computing device that includes a microphone, speech data representing an utterance spoken by a particular participant of a video conference and captured by the microphone of the first computing device;
transcribing, by the automated speech recognizer of the video-conference system, the speech data representing the utterance spoken by the particular participant of the video conference into text in real-time;
determining, by the automated speech recognizer of the videoconference system, a topic of the video conference by analyzing one or more words and/or phrases in the text of the speech data;
annotating, by the automated speech recognizer of the videoconference system, the text of the speech data by;
determining one or more relevant terms in the text of the speech data as being potentially relevant to the determined topic; and
identifying, using the one or more relevant terms in the text, one or more resources associated with the determined topic of the video conference, each identified resource comprising at least one of advertising content, a search result, an event, or a location; and
for each identified resource;
generating, using the context builder of the video conference system, a user interface component for the identified resource; and
outputting, by the context builder of the video conference system, the corresponding user interface component for the identified resource to a second computing device in real-time, the corresponding user interface component when received by the second computing device causing the second computing device to display the corresponding user interface component on a videoconference graphical user interface executing on the second computing device.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be embodied in, among other things, a method that includes receiving two or more data sets each representing speech of a corresponding individual attending an internet-based social networking video conference session, decoding the received data sets to produce corresponding text for each individual attending the internet-based social networking video conference, and detecting characteristics of the session from a coalesced transcript produced from the decoded text of the attending individuals for providing context to the internet-based social networking video conference session.
90 Citations
17 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a videoconference system that includes an automated speech recognizer and a context builder and from a first computing device that includes a microphone, speech data representing an utterance spoken by a particular participant of a video conference and captured by the microphone of the first computing device; transcribing, by the automated speech recognizer of the video-conference system, the speech data representing the utterance spoken by the particular participant of the video conference into text in real-time; determining, by the automated speech recognizer of the videoconference system, a topic of the video conference by analyzing one or more words and/or phrases in the text of the speech data; annotating, by the automated speech recognizer of the videoconference system, the text of the speech data by; determining one or more relevant terms in the text of the speech data as being potentially relevant to the determined topic; and identifying, using the one or more relevant terms in the text, one or more resources associated with the determined topic of the video conference, each identified resource comprising at least one of advertising content, a search result, an event, or a location; and for each identified resource; generating, using the context builder of the video conference system, a user interface component for the identified resource; and outputting, by the context builder of the video conference system, the corresponding user interface component for the identified resource to a second computing device in real-time, the corresponding user interface component when received by the second computing device causing the second computing device to display the corresponding user interface component on a videoconference graphical user interface executing on the second computing device. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a videoconference system that includes an automated speech recognizer and a context builder and from a first computing device that includes a microphone, speech data representing an utterance spoken by a particular participant of a video conference and captured by the microphone of the first computing device; transcribing, by the automated speech recognizer of the video-conference system, the speech data representing the utterance spoken by the particular participant of the video conference into text in real-time; determining, by the automated speech recognizer of the videoconference system, a topic of the video conference by analyzing one or more words and/or phrases in the text of the speech data; annotating, by the automated speech recognizer of the videoconference system, the text of the speech data by; determining one or more relevant terms in the text of the speech data as being potentially relevant to the determined topic; and identifying, using the one or more relevant terms in the text, one or more resources associated with the determined topic of the video conference, each identified resource comprising at least one of advertising content, a search result, an event, or a location; and for each identified resource; generating, using the context builder of the video conference system, a corresponding user interface component for the identified resource; and outputting, by the context builder of the video conference system, the corresponding user interface component for the identified resource to a second computing device in real-time, the corresponding user interface component when received by the second computing device causing the second computing device to display the corresponding user interface component on a videoconference graphical user interface executing on the second computing device. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A computer-readable storage device storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a videoconference system that includes an automated speech recognizer and a context builder and from a first computing device that includes a microphone, speech data representing an utterance spoken by a particular participant of a video conference and captured by the microphone of the first computing device; transcribing, by the automated speech recognizer of the video-conference system, the speech data representing the utterance spoken by the particular participant of the video conference into text in real-time; determining, by the automated speech recognizer of the videoconference system, a topic of the video conference by analyzing one or more words and/or phrases in the text of the speech data; annotating, by the automated speech recognizer of the videoconference system, the text of the speech data by; determining one or more relevant terms in the text of the speech data as being potentially relevant to the determined topic; and identifying, using the one or more relevant terms in the text, one or more resources associated with the determined topic of the video conference, each identified resource comprising at least one of advertising content, a search result, an event, or a location; and for each identified resource; generating, using the context builder of the video conference system, a corresponding user interface component for the identified resource; and outputting, by the context builder of the video conference system, the corresponding user interface component for the identified resource to a second computing device in real-time, the corresponding user interface component when received by the second computing device causing the second computing device to display the corresponding user interface component on a videoconference graphical user interface executing on the second computing device. - View Dependent Claims (14, 15, 16, 17)
-
Specification