Speech to text for assisted form completion
First Claim
Patent Images
1. A method for capturing information from a live conversation between an operator and a customer, comprising:
- designating a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator;
setting, by the operator during the live conversation, a visual cue by physically moving a cursor to an information field in the form;
monitoring the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text;
recognizing at least one portion of the live conversation as a text portion after converting the live conversation to text;
interpreting one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue;
relating the one or more cues to the information field associated with the context for the live conversation; and
storing information obtained from the text portion of the live conversation into the information field, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for capturing information from a live conversation between an operator and a customer, involving monitoring the live conversation between the operator and the customer, recognizing at least one portion of the live conversation as a text portion upon converting the live conversation to text, interpreting a cue in the live conversation, relating the cue to an information field associated with a context for the live conversation, and storing information obtained from the text portion into the information field, wherein the information obtained from the text portion includes at least one word spoken after the cue.
78 Citations
25 Claims
-
1. A method for capturing information from a live conversation between an operator and a customer, comprising:
-
designating a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator; setting, by the operator during the live conversation, a visual cue by physically moving a cursor to an information field in the form; monitoring the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text; recognizing at least one portion of the live conversation as a text portion after converting the live conversation to text; interpreting one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue; relating the one or more cues to the information field associated with the context for the live conversation; and storing information obtained from the text portion of the live conversation into the information field, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for capturing information from a live conversation between an operator and a customer, comprising:
-
a context designator executing on a processor and configured to; designate a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator; obtain a visual cue setting from the operator during the live conversation, wherein the operator sets the visual cue by physically moving a cursor to an information field in the form; monitor the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text; a speech recognition engine executing on the processor and configured to recognize the live conversation as a text portion after converting the live conversation to text; and a document completion engine executing on the processor and configured to; interpret one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue; relate the one or more cues to the information field associated with the context for the live conversation; and store information obtained from the text portion into the information field within the context, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable storage medium comprising instructions embodied thereon to perform a method for capturing information from a live conversation between an operator and a customer, comprising:
-
designating a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator; setting, by the operator during the live conversation, a visual cue by physically moving a cursor to an information field in the form; monitoring the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text; recognizing at least one portion of the live conversation as a text portion after converting the live conversation to text; interpreting one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue; relating the one or more cues to the information field associated with the context for the live conversation; and storing information obtained from the text portion of the live conversation into the information field, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues. - View Dependent Claims (20, 21, 22)
-
-
23. A computer readable storage medium organized in a library comprising instructions embodied thereon to provide a method for capturing information from a live conversation between an operator and a customer, comprising:
-
designating a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator; setting, by the operator during the live conversation, a visual cue by physically moving a cursor to an information field in the form; monitoring the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text; recognizing at least one portion of the live conversation as a text portion after converting the live conversation to text; interpreting one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue; relating the one or more cues to the information field associated with the context for the live conversation; and storing information obtained from the text portion of the live conversation into the information field, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues. - View Dependent Claims (24)
-
-
25. A computer system for capturing information from a live conversation between an operator and a customer, comprising:
-
a processor; a memory; a storage device; and software instruction stored in the memory for enabling the computer system under control of the processor to; designate a context for the live conversation between the operator and the customer wherein the context is a form viewed on a computer screen and actively in use by the operator; set, by the operator during the live conversation, a visual cue by physically moving a cursor to an information field in the form; monitor the live conversation in response to setting the visual cue, wherein the visual cue triggers the conversion of the live conversation to text; recognize at least one portion of the live conversation as a text portion after converting the live conversation to text; interpret one or more cues in the live conversation, wherein the one or more cues comprises at least the visual cue; relate the one or more cues to the information field associated with the context for the live conversation; and store information obtained from the text portion of the live conversation into the information field, wherein the information obtained from the text portion comprises at least one word spoken after the one or more cues.
-
Specification