Device for extracting information from a dialog
First Claim
Patent Images
1. A device comprising:
- at least one microphone;
a screen display; and
at least one programmable processor and at least one data storage unit for storing digital data, wherein the at least one programmable processor is in communication with the at least one microphone and the screen display, and wherein the at least one programmable processor is programmed to;
automatically recognize speech input by a first speaker received by the at least one microphone, comprising;
receiving the speech input from the first speaker;
determining a recognized speech result based on the received speech input; and
determining, by the computer-based speech translation system, whether there exists a recognition ambiguity in the recognized speech of the first speaker, wherein the recognition ambiguity indicates more than one possible match for the recognized speech result;
translate, by the computer-based speech translation system, the recognized speech result of the first speaker in the first language into a second language;
determine, by the computer-based speech translation system, whether there exists a translation ambiguity for one or more words in the translation of the recognized speech result of the first speaker in the first language into the second language, wherein the translation ambiguity indicates more than one possible translation of the one or more words;
upon a determination by the computer-based speech translation system that there is either (i) a recognition ambiguity in the recognized speech result of the first speaker or (ii) a translation ambiguity in the translation of the recognized speech result of the first speaker in the first language into the second language, determining a confidence score based on the recognition or translation ambiguity; and
responsive to the confidence score being below a threshold, issuing by the computer-based speech translation system a disambiguation query to the first speaker via a user-interface of the speech translation system, wherein a response to the disambiguation query resolves the recognition or translation ambiguity.
3 Assignments
0 Petitions
Accused Products
Abstract
Computer-implemented systems and methods for extracting information during a human-to-human mono-lingual or multi-lingual dialog between two speakers are disclosed. Information from either the recognized speech (or the translation thereof) by the second speaker and/or the recognized speech by the first speaker (or the translation thereof) is extracted. The extracted information is then entered into an electronic form stored in a data store.
-
Citations
37 Claims
-
1. A device comprising:
-
at least one microphone; a screen display; and at least one programmable processor and at least one data storage unit for storing digital data, wherein the at least one programmable processor is in communication with the at least one microphone and the screen display, and wherein the at least one programmable processor is programmed to; automatically recognize speech input by a first speaker received by the at least one microphone, comprising; receiving the speech input from the first speaker; determining a recognized speech result based on the received speech input; and determining, by the computer-based speech translation system, whether there exists a recognition ambiguity in the recognized speech of the first speaker, wherein the recognition ambiguity indicates more than one possible match for the recognized speech result; translate, by the computer-based speech translation system, the recognized speech result of the first speaker in the first language into a second language; determine, by the computer-based speech translation system, whether there exists a translation ambiguity for one or more words in the translation of the recognized speech result of the first speaker in the first language into the second language, wherein the translation ambiguity indicates more than one possible translation of the one or more words; upon a determination by the computer-based speech translation system that there is either (i) a recognition ambiguity in the recognized speech result of the first speaker or (ii) a translation ambiguity in the translation of the recognized speech result of the first speaker in the first language into the second language, determining a confidence score based on the recognition or translation ambiguity; and responsive to the confidence score being below a threshold, issuing by the computer-based speech translation system a disambiguation query to the first speaker via a user-interface of the speech translation system, wherein a response to the disambiguation query resolves the recognition or translation ambiguity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-based device comprising:
-
at least one microphone; a screen display; at least one data storage unit for storing digital data; a first automatic speech recognition module for automatically recognizing speech input by a first speaker received by the at least one microphone, wherein automatically recognizing the speech input by the first speaker comprises; receiving the speech input from the first speaker; determining a recognized speech result based on the received speech input; an interactive disambiguation module for a determining whether there exists a recognition ambiguity in the recognized speech result of the first speaker, wherein the recognition ambiguity indicates more than one possible match for the recognized speech result, and for determining whether there exists a translation ambiguity for one or more words in the translation of the recognized speech result of the first speaker in the first language into the second language, wherein the translation ambiguity indicates more than one possible translation of the one or more words; a first machine translation module for translating the recognized speech result of the first speaker in the first language into a second language; and wherein the interactive disambiguation module is further configured for, upon a determination that there is either (i) a recognition ambiguity in the recognized speech result of the first speaker or (ii) a translation ambiguity in the translation of the recognized speech result of the first speaker in the first language into the second language, determining a confidence score based on the recognition or translation ambiguity, and responsive to the confidence score being below a threshold, issuing by the computer-based speech translation system a disambiguation query to the first speaker via a user-interface of the speech translation system, wherein a response to the disambiguation query resolves the recognition or translation ambiguity. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method comprising:
-
recognizing, by a computer-based speech translation system, speech input by a first speaker in a first language, comprising; receiving the speech input from the first speaker; determining a recognized speech result based on the received speech input; and determining, by the computer-based speech translation system, whether there exists a recognition ambiguity in the recognized speech result of the first speaker, wherein the recognition ambiguity indicates more than one possible match for the recognized speech result; translating, by the computer-based speech translation system, the recognized speech result of the first speaker in the first language into a second language; determining, by the computer-based speech translation system, whether there exists a translation ambiguity for one or more words in the translation of the recognized speech result of the first speaker in the first language into the second language, wherein the translation ambiguity indicates more than one possible translation of the one or more words; upon a determination by the computer-based speech translation system that there is either (i) a recognition ambiguity in the recognized speech result of the first speaker or (ii) a translation ambiguity in the translation of the recognized speech result of the first speaker in the first language into the second language, determining a confidence score based on the recognition or translation ambiguity; and responsive to the confidence score being below a threshold, issuing by the computer-based speech translation system a disambiguation query to the first speaker via a user-interface of the speech translation system, wherein a response to the disambiguation query resolves the recognition or translation ambiguity. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
Specification