Real-time speech-to-text conversion in an audio conference session
First Claim
1. A computer system for providing real-time resources to participants in an audio conference session, the computer system comprising:
- a conference system for establishing an audio conference session between a plurality of computing devices connected via a communication network; and
a server configured to communicate with the conference system and the plurality of computing devices via the communication network, the server comprising;
a processor and a memory;
a pre-processing engine stored in the memory and executed by the processor, the pre-processing engine comprising logic configured to;
receive an audio stream associated with one or more of the computing devices, the audio stream comprising a speech signal; and
extract the speech signal from the audio stream;
a speech-to-text conversion engine stored in the memory and executed by the processor, the speech-to-text conversion engine comprising logic configured to extract words from the speech signal;
a relevance engine stored in the memory and executed by the processor, the relevance engine comprising an algorithm for outputting a relevant keyword or topic being discussed in the audio conference session based on a plurality of data inputs, the plurality of data inputs comprising the extracted words from the speech-to-text conversion engine, a speaker identity with a corresponding role or category of one or more participants who spoke the extracted words, the algorithm identifying the relevant keyword or topic by calculating and updating a relevance score during the audio conference session and, if the relevance score exceeds a threshold, outputting the relevant keyword or topic, wherein the relevance score is based on a usage density associated with the one or more extracted words; and
a resources engine stored in the memory and executed by the processor, the resources engine operatively coupled to the relevance engine and comprising logic configured to;
receive from the relevance engine the relevant keyword or topic identified by the algorithm based on the speaker identity with the corresponding role or category;
identify a plurality of resources related to the relevant keyword or topic;
display in a graphical user interface and during the audio conference session, the plurality of resources to the one or more computing devices in a conference user interface associated with the audio conference session; and
enable user-selection of one or more of the plurality of resources via the conference user interface.
7 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments of systems, methods, and computer programs are disclosed for providing real-time resources to participants in an audio conference session. One embodiment is a method for providing real-time resources to participants in an audio conference session via a communication network. One such method comprises: a conferencing system establishing an audio conference session between a plurality of computing devices via a communication network, each computing device generating a corresponding audio stream comprising a speech signal; and in real-time during the audio conference session, a server: receiving and processing the audio streams to determine the speech signals; extracting words from the speech signals; analyzing the extracted words to determine a relevant keyword being discussed in the audio conference session; identifying a resource related to the relevant keyword; and providing the resource to one or more of the computing devices.
-
Citations
18 Claims
-
1. A computer system for providing real-time resources to participants in an audio conference session, the computer system comprising:
-
a conference system for establishing an audio conference session between a plurality of computing devices connected via a communication network; and a server configured to communicate with the conference system and the plurality of computing devices via the communication network, the server comprising; a processor and a memory; a pre-processing engine stored in the memory and executed by the processor, the pre-processing engine comprising logic configured to;
receive an audio stream associated with one or more of the computing devices, the audio stream comprising a speech signal; and
extract the speech signal from the audio stream;a speech-to-text conversion engine stored in the memory and executed by the processor, the speech-to-text conversion engine comprising logic configured to extract words from the speech signal; a relevance engine stored in the memory and executed by the processor, the relevance engine comprising an algorithm for outputting a relevant keyword or topic being discussed in the audio conference session based on a plurality of data inputs, the plurality of data inputs comprising the extracted words from the speech-to-text conversion engine, a speaker identity with a corresponding role or category of one or more participants who spoke the extracted words, the algorithm identifying the relevant keyword or topic by calculating and updating a relevance score during the audio conference session and, if the relevance score exceeds a threshold, outputting the relevant keyword or topic, wherein the relevance score is based on a usage density associated with the one or more extracted words; and a resources engine stored in the memory and executed by the processor, the resources engine operatively coupled to the relevance engine and comprising logic configured to; receive from the relevance engine the relevant keyword or topic identified by the algorithm based on the speaker identity with the corresponding role or category; identify a plurality of resources related to the relevant keyword or topic; display in a graphical user interface and during the audio conference session, the plurality of resources to the one or more computing devices in a conference user interface associated with the audio conference session; and
enable user-selection of one or more of the plurality of resources via the conference user interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for providing real-time resources to participants in an audio conference session via a communication network, the method comprising:
-
a conferencing system establishing an audio conference session between a plurality of computing devices via a communication network, each computing device generating a corresponding audio stream comprising a speech signal; and in real-time during the audio conference session, a server; receiving and processing the audio streams to determine the speech signals; extracting words from the speech signals; inputting the extracted words to a relevance algorithm; the relevance algorithm outputting a relevant keyword being discussed in the audio conference session based on a speaker identity with a corresponding role or category of one or more participants who spoke the extracted words, the relevant keyword determined by calculating and updating a relevance score during the audio conference session and, if the relevance score exceeds a threshold, outputting the relevant keyword, wherein the relevance score is based on a usage density associated with the one or more extracted words; identifying a plurality of resources related to the relevant keyword output from the relevance algorithm based on the speaker identity with the corresponding role or category of the one or more participants who spoke the extracted words; displaying the plurality of resources to one or more of the computing devices in a conference user interface associated with the audio conference session; and receiving a user selection of one or more of the plurality of resources to present, via the conference user interface to the one or more participants. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
Specification