Consolidating Speech Recognition Results
First Claim
1. A computer-implemented method for generating a consolidated list of speech recognition results, comprising:
- at a processor, receiving a list of candidate interpretations of spoken input;
at the processor, forming a grid of tokens from the received list, the grid being organized into a plurality of rows and a plurality of columns;
at the processor, splitting the grid into a set of column groups based on timing information, each column group comprising a plurality of token groups, each token group comprising at least one token;
at the processor, responsive to detecting duplicated token groups in the grid, removing the duplicated token groups to generate a consolidated grid; and
at an output device, outputting the candidate interpretations based on the consolidated grid.
1 Assignment
0 Petitions
Accused Products
Abstract
Candidate interpretations resulting from application of speech recognition algorithms to spoken input are presented in a consolidated manner that reduces redundancy. A list of candidate interpretations is generated, and each candidate interpretation is subdivided into time-based portions, forming a grid. Those time-based portions that duplicate portions from other candidate interpretations are removed from the grid. A user interface is provided that presents the user with an opportunity to select among the candidate interpretations; the user interface is configured to present these alternatives without duplicate elements.
323 Citations
60 Claims
-
1. A computer-implemented method for generating a consolidated list of speech recognition results, comprising:
-
at a processor, receiving a list of candidate interpretations of spoken input; at the processor, forming a grid of tokens from the received list, the grid being organized into a plurality of rows and a plurality of columns; at the processor, splitting the grid into a set of column groups based on timing information, each column group comprising a plurality of token groups, each token group comprising at least one token; at the processor, responsive to detecting duplicated token groups in the grid, removing the duplicated token groups to generate a consolidated grid; and at an output device, outputting the candidate interpretations based on the consolidated grid. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method for selecting among entries in a consolidated list of candidate interpretations of speech input, the method comprising:
-
receiving, at a processor, a consolidated list of candidate interpretations of speech input, the consolidated list comprising at least one column group having a single token group and at least one column group having a plurality of token groups; for each column group having a single token group, displaying the single token group on an output device; and for each column group having a plurality of token groups, displaying, on the output device, a first one of the token groups in a visually distinctive manner as compared with the display of a column group comprising a single token group. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
-
30. A computer-implemented method for generating a consolidated list of speech recognition results, comprising:
-
at a processor running at a server, obtaining a list of candidate interpretations of spoken input; at the processor, forming a grid of tokens from the received list, the grid being organized into a plurality of rows and a plurality of columns; at the processor, splitting the grid into a set of column groups based on timing information, each column group comprising a plurality of token groups, each token group comprising at least one token; at the processor, responsive to detecting duplicated token groups in the grid, removing the duplicated token groups to form a consolidated list of candidates; and transmitting a representation of the consolidated list of candidates from the server to a client. - View Dependent Claims (31, 32)
-
-
33. A computer program product for generating a consolidated list of speech recognition results, comprising:
-
a nontransitory computer-readable storage medium; and computer program code, encoded on the medium, configured to cause at least one processor to perform the steps of; receiving a list of candidate interpretations of spoken input; forming a grid of tokens from the received list, the grid being organized into a plurality of rows and a plurality of columns; splitting the grid into a set of column groups based on timing information, each column group comprising a plurality of token groups, each token group comprising at least one token; responsive to detecting duplicated token groups in the grid, removing the duplicated token groups to generate a consolidated grid; and causing an output device to output the candidate interpretations based on the consolidated grid. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
-
47. A system for generating a consolidated list of speech recognition results, comprising:
-
a processor, configured to; receive a list of candidate interpretations of spoken input; form a grid of tokens from the received list, the grid being organized into a plurality of rows and a plurality of columns; split the grid into a set of column groups based on timing information, each column group comprising a plurality of token groups, each token group comprising at least one token; responsive to detecting duplicated token groups in the grid, remove the duplicated token groups to generate a consolidated grid; and an output device, communicatively coupled to the processor and configured to output the candidate interpretations based on the consolidated grid. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
Specification