Speech recognition and summarization
First Claim
Patent Images
1. A method comprising:
- in one or more processing devices, executing instructions to perform operations comprising;
receiving two or more data sets, each data set representing speech of a corresponding individual attending a social networking video conference session;
decoding the received data sets to produce corresponding text for each individual attending the social networking video conference session;
detecting one or more topics of the social networking video conference session from a transcript produced from the text for each individual attending the social networking video conference session; and
providing, to one or more of the attending individuals of the social networking video conference session, context relating to the one or more topics of the social networking video conference session detected from the transcript produced from the text for each individual attending the social networking video conference session.
2 Assignments
0 Petitions
Accused Products
Abstract
The subject matter of this specification can be embodied in, among other things, a method that includes receiving two or more data sets each representing speech of a corresponding individual attending an internet-based social networking video conference session, decoding the received data sets to produce corresponding text for each individual attending the internet-based social networking video conference, and detecting characteristics of the session from a coalesced transcript produced from the decoded text of the attending individuals for providing context to the internet-based social networking video conference session.
-
Citations
20 Claims
-
1. A method comprising:
in one or more processing devices, executing instructions to perform operations comprising; receiving two or more data sets, each data set representing speech of a corresponding individual attending a social networking video conference session; decoding the received data sets to produce corresponding text for each individual attending the social networking video conference session; detecting one or more topics of the social networking video conference session from a transcript produced from the text for each individual attending the social networking video conference session; and providing, to one or more of the attending individuals of the social networking video conference session, context relating to the one or more topics of the social networking video conference session detected from the transcript produced from the text for each individual attending the social networking video conference session. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving two or more data sets, each data set representing speech of a corresponding individual attending a social networking video conference session; decoding the received data sets to produce corresponding text for each individual attending the social networking video conference session; detecting one or more topics of the social networking video conference session from a transcript produced from the text for each individual attending the social networking video conference session; and providing, to one or more of the attending individuals of the social networking video conference session, context relating to the one or more topics of the social networking video conference session detected from the transcript produced from the text for each individual attending the social networking video conference session. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. One or more non-transitory machine-readable media storing instructions that are executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving two or more data sets, each data set representing speech of a corresponding individual attending a social networking video conference session; decoding the received data sets to produce corresponding text for each individual attending the social networking video conference session; detecting one or more topics of the social networking video conference session from a transcript produced from the text for each individual attending the social networking video conference session; and providing, to one or more of the attending individuals of the social networking video conference session, context relating to the one or more topics of the social networking video conference session detected from the transcript produced from the text for each individual attending the social networking video conference session. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification