Topic shift detector
First Claim
1. A computer-implemented method for detecting shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues, the method comprising executing on a computer processor:
- identifying an initial topic of a first spoken query within an audio data input from a user that is mapped to at least one keyword extracted from the first spoken query within a user keyword ontology mapping;
identifying a second spoken query within the audio data input that is subsequent in time to the first spoken query and is distinguished from the first query by recognizing at least one non-verbal cue associated with the audio data input, wherein the at least one non-verbal cue comprises a time elapsed between the first spoken query and the second spoken query, and a user'"'"'s motion activity relative to a programmable device comprising the processor during a time between the first spoken query and the second spoken query;
determining whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the at least one keyword extracted from the first query to at least one keyword extracted from the second query and to the at least one non-verbal cue within the user ontology mapping; and
determining that the second spoken query is directed to the new topic in response to determining that the user'"'"'s motion activity indicates motion away from progressing toward a geographic reference point of a result that satisfies the first spoken query.
1 Assignment
0 Petitions
Accused Products
Abstract
Aspects detect or recognize shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues. An initial topic is mapped to one or more keywords extracted from a first spoken query within a user keyword ontology mapping. A query spoken subsequent in time to the first query is identified and distinguished by recognizing one or more non-verbal cues associated with the audio data input that include a time elapsed between the queries, and in some aspects a user'"'"'s facial expression or motion activity. Aspects determine whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the keyword(s) extracted from the first query to one or more keywords extracted from the second query and to the non-verbal cue(s) within the user ontology mapping.
-
Citations
18 Claims
-
1. A computer-implemented method for detecting shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues, the method comprising executing on a computer processor:
-
identifying an initial topic of a first spoken query within an audio data input from a user that is mapped to at least one keyword extracted from the first spoken query within a user keyword ontology mapping; identifying a second spoken query within the audio data input that is subsequent in time to the first spoken query and is distinguished from the first query by recognizing at least one non-verbal cue associated with the audio data input, wherein the at least one non-verbal cue comprises a time elapsed between the first spoken query and the second spoken query, and a user'"'"'s motion activity relative to a programmable device comprising the processor during a time between the first spoken query and the second spoken query; determining whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the at least one keyword extracted from the first query to at least one keyword extracted from the second query and to the at least one non-verbal cue within the user ontology mapping; and determining that the second spoken query is directed to the new topic in response to determining that the user'"'"'s motion activity indicates motion away from progressing toward a geographic reference point of a result that satisfies the first spoken query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a processor; a computer readable memory in circuit communication with the processor; and a computer readable storage medium in circuit communication with the processor; wherein the processor executes program instructions stored on the computer-readable storage medium via the computer readable memory and thereby; identifies an initial topic of a first spoken query within an audio data input from a user that is mapped to at least one keyword extracted from the first spoken query within a user keyword ontology mapping; identifies a second spoken query within the audio data input that is subsequent in time to the first spoken query and is distinguished from the first query by recognizing at least one non-verbal cue associated with the audio data input, wherein the at least one non-verbal cue comprises a time elapsed between the first spoken query and the second spoken query, and a user'"'"'s motion activity relative to a programmable device comprising the processor during a time between the first spoken query and the second spoken query; determines whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the at least one keyword extracted from the first query to at least one keyword extracted from the second query and to the at least one non-verbal cue within the user ontology mapping; and determines that the second spoken query is directed to the new topic in response to determining that the user'"'"'s motion activity indicates motion away from progressing toward a geographic reference point of a result that satisfies the first spoken query. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computer program product for detecting shifts in topics in computer implemented speech recognition processes as a function of mapping keywords to non-verbal cues, the computer program product comprising:
-
a computer readable storage medium having computer readable program code embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the computer readable program code comprising instructions for execution by a processor that cause the processor to; identify an initial topic of a first spoken query within an audio data input from a user that is mapped to at least one keyword extracted from the first spoken query within a user keyword ontology mapping; identify a second spoken query within the audio data input that is subsequent in time to the first spoken query and is distinguished from the first query by recognizing at least one non-verbal cue associated with the audio data input, wherein the at least one non-verbal cue comprises a time elapsed between the first spoken query and the second spoken query, and a user'"'"'s motion activity relative to a programmable device comprising the processor during a time between the first spoken query and the second spoken query; determine whether the second spoken query is directed to the initial topic or to a new topic that is different from the initial topic, as a function of mappings of the at least one keyword extracted from the first query to at least one keyword extracted from the second query and to the at least one non-verbal cue within the user ontology mapping; and determine that the second spoken query is directed to the new topic in response to determining that the user'"'"'s motion activity indicates motion away from progressing toward a geographic reference point of a result that satisfies the first spoken query. - View Dependent Claims (15, 16, 17, 18)
-
Specification