Enhancing speech recognition with domain-specific knowledge to detect topic-related content
First Claim
1. A computer-implemented method for providing action items from an audio file within an enterprise context, the method being executed using one or more processors and comprising:
- determining, by the one or more processors, a context of the audio file that is to be processed based on a user input indicating a training data selection;
providing, by the one or more processors, training data to a speech recognition component, the training data being in a format recognizable by the speech recognition component and being provided based on the context;
receiving, by the one or more processors, a textual transcript corresponding to the audio file from the speech recognition component;
processing, by the one or more processors, the textual transcript to identify one or more action items by identifying one or more concepts within the textual transcript and matching the one or more concepts to respective transitions in an automaton; and
providing the one or more action items for display to one or more users.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and computer-readable storage media for providing action items from audio within an enterprise context. In some implementations, actions include determining a context of audio that is to be processed, providing training data to a speech recognition component, the training data being provided based on the context, receiving text from the speech recognition component, processing the text to identify one or more action items by identifying one or more concepts within the text and matching the one or more concepts to respective transitions in an automaton, and providing the one or more action items for display to one or more users.
-
Citations
17 Claims
-
1. A computer-implemented method for providing action items from an audio file within an enterprise context, the method being executed using one or more processors and comprising:
-
determining, by the one or more processors, a context of the audio file that is to be processed based on a user input indicating a training data selection; providing, by the one or more processors, training data to a speech recognition component, the training data being in a format recognizable by the speech recognition component and being provided based on the context; receiving, by the one or more processors, a textual transcript corresponding to the audio file from the speech recognition component; processing, by the one or more processors, the textual transcript to identify one or more action items by identifying one or more concepts within the textual transcript and matching the one or more concepts to respective transitions in an automaton; and providing the one or more action items for display to one or more users. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A non-transitory computer-readable storage medium coupled to one or more processors and having instructions stored thereon which, when executed by the one or more processors, cause the one or more processors to perform operations for providing action items from an audio file within an enterprise context, the operations comprising:
-
determining, by the one or more processors, a context of the audio file that is to be processed based on a user input indicating a training data selection; providing, by the one or more processors, training data to a speech recognition component, the training data being in a format recognizable by the speech recognition component and being provided based on the context; receiving, by the one or more processors, a textual transcript corresponding to the audio file from the speech recognition component; processing, by the one or more processors, the textual transcript to identify one or more action items by identifying one or more concepts within the textual transcript and matching the one or more concepts to respective transitions in an automaton; and providing the one or more action items for display to one or more users.
-
-
17. A system, comprising:
-
a computing device; and a computer-readable storage device coupled to the computing device and having instructions stored thereon which, when executed by the computing device, cause the computing device to perform operations for providing action items from an audio file within an enterprise context, the operations comprising; determining a context of the audio file that is to be processed based on a user input indicating a training data selection; providing, training data to a speech recognition component, the training data being in a format recognizable by the speech recognition component and being provided based on the context; receiving a textual transcript corresponding to the audio file from the speech recognition component; processing the textual transcript to identify one or more action items by identifying one or more concepts within the textual transcript and matching the one or more concepts to respective transitions in an automaton; and providing the one or more action items for display to one or more users.
-
Specification