REAL-TIME SPEAKER STATE ANALYTICS PLATFORM
First Claim
Patent Images
1. A speech analytics platform implemented in one or more computing devices, for providing speech-derived speaker state data as a service, the platform comprising:
- a speech data processing subsystem embodied in one or more non-transitory machine accessible storage media, the speech data processing subsystem configured to produce speech data corresponding to audio input captured from a human or synthetic speaker, the produced speech data being dynamically segmented for real-time speech-based speaker state determination; and
a plurality of analytics engines embodied in one or more non-transitory machine accessible storage media, wherein each of the plurality of analytics engines is configured to receive the pre-processed speech data from the speech data processing subsystem and provide as output a speaker state indicator, the plurality of analytics engines comprising;
an automatic speech recognition module configured to perform a speech recognition operation on the speech data; and
a plurality of algorithms each configured for a different type of speaker state analytics, at least one of the algorithms extracting at least one non-word feature of the speech data and outputting speaker state data relating to the type of speaker state analytics for which the at least one algorithm has been configured.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are machine learning-based technologies that analyze an audio input and provide speaker state predictions in response to the audio input. The speaker state predictions can be selected and customized for each of a variety of different applications.
-
Citations
22 Claims
-
1. A speech analytics platform implemented in one or more computing devices, for providing speech-derived speaker state data as a service, the platform comprising:
-
a speech data processing subsystem embodied in one or more non-transitory machine accessible storage media, the speech data processing subsystem configured to produce speech data corresponding to audio input captured from a human or synthetic speaker, the produced speech data being dynamically segmented for real-time speech-based speaker state determination; and a plurality of analytics engines embodied in one or more non-transitory machine accessible storage media, wherein each of the plurality of analytics engines is configured to receive the pre-processed speech data from the speech data processing subsystem and provide as output a speaker state indicator, the plurality of analytics engines comprising; an automatic speech recognition module configured to perform a speech recognition operation on the speech data; and a plurality of algorithms each configured for a different type of speaker state analytics, at least one of the algorithms extracting at least one non-word feature of the speech data and outputting speaker state data relating to the type of speaker state analytics for which the at least one algorithm has been configured. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system configured to provide output comprising speech-derived speaker state information, the system configured to:
-
capture a speech signal from a speaker; convert the speech signal to a predetermined format configured to facilitate an interaction-time analysis of non-lexical features extracted from the speech signal by dynamically segmenting the speech signal using speech activity detection; select a plurality of analytics engines based at least partly on an application specification; and operate the selected analytics engines to, in an interaction time;
extract the non-lexical features from the speech signal, analyze the non-lexical features, and, based on the analyzing of the non-lexical features, provide as output a plurality of different speaker state indicators. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification