System and method for robust access and entry to large structured data using voice form-filling
First Claim
Patent Images
1. A method comprising:
- performing, via a processor, a first pass of entries in a database to generate a shortlist of the entries, the first pass comprising;
utilizing a phonotactic grammar to perform speech recognition on received speech and to generate a phoneme lattice, wherein the utilizing ignores inter-field constraints of the phonotactic grammar;
removing silence and filler words from the phoneme lattice to yield a revised phoneme lattice;
normalizing costs in the revised phoneme lattice, to yield a normalized phoneme lattice with a best path having a cost of zero;
identifying N-best lowest cost indices of an index for the database based on the normalized phoneme lattice; and
generating the shortlist of the entries in the database based on the N-best lowest cost indices;
performing a second pass on the shortlist of the entries in the database to obtain a final result, the second pass comprising;
generating a grammar from the shortlist of the entries in the database to yield a generated grammar, having inter-field constraints; and
identifying data from the database for each of the input field based on the received speech, the generated grammar, and the inter-field constraints; and
populating an input field of a form using the final result.
5 Assignments
0 Petitions
Accused Products
Abstract
A method, apparatus and machine-readable medium are provided. A phonotactic grammar is utilized to perform speech recognition on received speech and to generate a phoneme lattice. A document shortlist is generated based on using the phoneme lattice to query an index. A grammar is generated from the document shortlist. Data for each of at least one input field is identified based on the received speech and the generated grammar.
48 Citations
23 Claims
-
1. A method comprising:
-
performing, via a processor, a first pass of entries in a database to generate a shortlist of the entries, the first pass comprising; utilizing a phonotactic grammar to perform speech recognition on received speech and to generate a phoneme lattice, wherein the utilizing ignores inter-field constraints of the phonotactic grammar; removing silence and filler words from the phoneme lattice to yield a revised phoneme lattice; normalizing costs in the revised phoneme lattice, to yield a normalized phoneme lattice with a best path having a cost of zero; identifying N-best lowest cost indices of an index for the database based on the normalized phoneme lattice; and generating the shortlist of the entries in the database based on the N-best lowest cost indices; performing a second pass on the shortlist of the entries in the database to obtain a final result, the second pass comprising; generating a grammar from the shortlist of the entries in the database to yield a generated grammar, having inter-field constraints; and identifying data from the database for each of the input field based on the received speech, the generated grammar, and the inter-field constraints; and populating an input field of a form using the final result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; performing a first pass of entries in a database to generate a shortlist of the entries, the first pass comprising; utilizing a phonotactic grammar to perform speech recognition on received speech and to generate a phoneme lattice, wherein the utilizing ignores inter-field constraints of the phonotactic grammar; removing silence and filler words from the phoneme lattice to generate a revised phoneme lattice; normalizing costs in the revised phoneme lattice, to yield a normalized phoneme lattice with a best path having a cost of zero; identifying N-best lowest cost indices of an index for the database based on the normalized phoneme lattice; and generating the shortlist of the entries in the database based on the N-best lowest cost indices; perform a second pass on the shortlist of the entries in the database to obtain a final result, the second pass comprising; generating a grammar from the shortlist of the entries in the database to yield a generated grammar, having inter-field constraints; and identifying data from the database for each of the input field based on the received speech, the generated grammar, and the inter-field constraints; and populating an input field of a form using the final result. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
instructions for performing, via a processor, a first pass of entries in a database to generate a shortlist of the entries, the first pass comprising; utilizing a phonotactic grammar to perform speech recognition on received speech and to generate a phoneme lattice, wherein the utilizing ignores inter-field constraints of the phonotactic grammar; removing silence and filler words from the phoneme lattice to yield a revised phoneme lattice; normalizing costs in the revised phoneme lattice, to yield a normalized phoneme lattice with a best path having a cost of zero; identifying N-best lowest cost indices of an index for the database based on the normalized phoneme lattice; and generating the shortlist of the entries in the database based on the N-best lowest cost indices; and instructions for performing a second pass on the shortlist of the entries in the database to obtain a final result, the second pass comprising; generating a grammar from the shortlist of the entries in the database to yield a generated grammar, having inter-field constraints; and identifying data from the database for each of the input field based on the received speech, the generated grammar, and the inter-field constraints; and
populating an input field of a form using the final result. - View Dependent Claims (19, 20, 21, 22, 23)
-
Specification