Library of existing spoken dialog data for use in generating new natural language spoken dialog systems
First Claim
1. A method comprising:
- collecting a plurality of audible utterances, via a plurality of industry specific spoken dialog systems, and during a plurality of collection phases comprising respective defined periods of time, to yield stored audible utterances;
organizing, via a processor, the stored audible utterances into a plurality of datasets having call-type labels, wherein each dataset in the plurality of datasets pertains to a unique industrial sector in a plurality of industrial sectors, and wherein the each dataset for its respective unique industrial sector is independent of other datasets in the plurality of industrial sectors;
generating an annotation guide comprising mnemonic names, textual descriptions, and both a positive example utterance and a negative example utterance for an associated call-type; and
building a natural language spoken dialog system using the plurality of datasets and the annotation guide, wherein datasets having a call-type label associated with the negative example utterance of the annotation guide are not included in the natural language spoken dialog system.
4 Assignments
0 Petitions
Accused Products
Abstract
A machine-readable medium may include a group of reusable components for building a spoken dialog system. The reusable components may include a group of previously collected audible utterances. A machine-implemented method to build a library of reusable components for use in building a natural language spoken dialog system may include storing a dataset in a database. The dataset may include a group of reusable components for building a spoken dialog system. The reusable components may further include a group of previously collected audible utterances. A second method may include storing at least one set of data. Each one of the at least one set of data may include ones of the reusable components associated with audible data collected during a different collection phase.
-
Citations
20 Claims
-
1. A method comprising:
-
collecting a plurality of audible utterances, via a plurality of industry specific spoken dialog systems, and during a plurality of collection phases comprising respective defined periods of time, to yield stored audible utterances; organizing, via a processor, the stored audible utterances into a plurality of datasets having call-type labels, wherein each dataset in the plurality of datasets pertains to a unique industrial sector in a plurality of industrial sectors, and wherein the each dataset for its respective unique industrial sector is independent of other datasets in the plurality of industrial sectors; generating an annotation guide comprising mnemonic names, textual descriptions, and both a positive example utterance and a negative example utterance for an associated call-type; and building a natural language spoken dialog system using the plurality of datasets and the annotation guide, wherein datasets having a call-type label associated with the negative example utterance of the annotation guide are not included in the natural language spoken dialog system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, result in the processor performing operations comprising; collecting a plurality of audible utterances, via a plurality of industry specific spoken dialog systems, and during a plurality of collection phases comprising respective defined periods of time, to yield stored audible utterances; organizing the stored audible utterances into a plurality of datasets having call-type labels, wherein each dataset in the plurality of datasets pertains to a unique industrial sector in a plurality of industrial sectors, and wherein the each dataset for its unique industrial sector is independent of other datasets in the plurality of industrial sectors; generating an annotation guide comprising mnemonic names, textual descriptions, and both a positive example utterance and negative example utterance for an associated call-type; and building a natural language spoken dialog system using the plurality of datasets and the annotation guide, wherein datasets having a call-type label associated with the negative example utterance of the annotation guide are not included in the natural language spoken dialog system. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage device having instructions stored which, when executed by a computing device, result in the computing device performing operations comprising:
-
collecting a plurality of audible utterances, via a plurality of industry specific spoken dialog systems, and during a plurality of collection phases comprising respective defined periods of time, to yield stored audible utterances; organizing the stored audible utterances into a plurality of datasets having call-type labels, wherein each dataset in the plurality of datasets pertains to a unique industrial sector in a plurality of industrial sectors, and wherein the each dataset for its respective unique industrial sector is independent of other datasets in the plurality of industrial sectors; generating an annotation guide comprising mnemonic names, textual descriptions, and both a positive example utterance and negative example utterance for an associated call-type; and building a natural language spoken dialog system using the plurality of datasets and the annotation guide, wherein datasets having a call-type label associated with the negative example utterance of the annotation guide are not included in the natural language spoken dialog system. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification