Simultaneous speech processing apparatus and method
First Claim
1. A simultaneous speech processing apparatus, comprising:
- a storage to store, as processing piece information, the processing piece character string and time information of a speech signal corresponding to a speech section in which the processing piece character string is uttered, in association with each other; and
a processor programmed to;
acquire a speech signal;
generate a decided character string and at least one candidate character string, the decided character string being a character string corresponding to a speech section in which part of the speech signal having undergone speech recognition processing is converted into a character string, the at least one candidate character string being a character string corresponding to a speech section in which a character string as a conversion result is variable during speech recognition processing in a speech section succeeding the decided character string;
detect a first character string as a processing piece character string if the first character string included in the decided character string exists commonly in one or more combined character strings on dividing the one or more combined character strings by a boundary, the one or more combined character strings being obtained by connecting the decided character string and the at least one candidate character string, the boundary indicating a morphological position serving as a start position of a processing piece in natural language processing;
output the processing piece character string; and
if a first processing piece information as new processing piece information is added in the storage and a second processing piece information exists, connect processing piece character strings included in the second processing piece information and the first processing piece information in time series to generate a reprocessing piece character string, and update the processing piece information stored in the storage with the reprocessing piece character string and time information corresponding to the reprocessing piece character string, the second processing piece information preceding the first processing piece information and corresponding to a speech section in which the second processing piece information is uttered continuously within a time falling within a threshold value.
4 Assignments
0 Petitions
Accused Products
Abstract
According to one embodiment, a simultaneous speech processing apparatus includes an acquisition unit, a speech recognition unit, a detection unit and an output unit. The acquisition unit acquires a speech signal. The speech recognition unit generates a decided character string and at least one candidate character string. The detection unit detects a first character string as a processing piece character string if the first character string included in the decided character string exists commonly in one or more combined character strings on dividing the one or more combined character strings by a boundary indicating a morphological position serving as a start position of a processing piece in natural language processing. The output unit outputs the processing piece character string.
-
Citations
17 Claims
-
1. A simultaneous speech processing apparatus, comprising:
-
a storage to store, as processing piece information, the processing piece character string and time information of a speech signal corresponding to a speech section in which the processing piece character string is uttered, in association with each other; and a processor programmed to; acquire a speech signal; generate a decided character string and at least one candidate character string, the decided character string being a character string corresponding to a speech section in which part of the speech signal having undergone speech recognition processing is converted into a character string, the at least one candidate character string being a character string corresponding to a speech section in which a character string as a conversion result is variable during speech recognition processing in a speech section succeeding the decided character string; detect a first character string as a processing piece character string if the first character string included in the decided character string exists commonly in one or more combined character strings on dividing the one or more combined character strings by a boundary, the one or more combined character strings being obtained by connecting the decided character string and the at least one candidate character string, the boundary indicating a morphological position serving as a start position of a processing piece in natural language processing; output the processing piece character string; and if a first processing piece information as new processing piece information is added in the storage and a second processing piece information exists, connect processing piece character strings included in the second processing piece information and the first processing piece information in time series to generate a reprocessing piece character string, and update the processing piece information stored in the storage with the reprocessing piece character string and time information corresponding to the reprocessing piece character string, the second processing piece information preceding the first processing piece information and corresponding to a speech section in which the second processing piece information is uttered continuously within a time falling within a threshold value. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A simultaneous speech processing method, comprising:
-
storing, as processing piece information, the processing piece character string and time information of a speech signal corresponding to a speech section in which the processing piece character string is uttered, in association with each other in the storage; acquiring a speech signal; generating a decided character string and at least one candidate character string, the decided character string being a character string corresponding to a speech section in which part of the speech signal having undergone speech recognition processing is converted into a character string, the at least one candidate character string being a character string corresponding to a speech section in which a character string as a conversion result is variable during speech recognition processing in a speech section succeeding the decided character string; detecting a first character string as a processing piece character string if the first character string included in the decided character string exists commonly in one or more combined character strings on dividing the one or more combined character strings by a boundary, the one or more combined character strings being obtained by connecting the decided character string and the at least one candidate character string, the boundary indicating a morphological position serving as a start position of a processing piece in natural language processing; outputting the processing piece character string; and connecting, if first processing piece information as new processing piece information is added in the storage and second processing piece information exists, processing piece character strings included in the second processing piece information and the first processing piece information in time series to generate a reprocessing piece character string, and updating the processing piece information stored in the storage with the reprocessing piece character string and time information corresponding to the reprocessing piece character string, the second processing piece information preceding the first processing piece information and corresponding to a speech section in which the second processing piece information is uttered continuously within a time falling within a threshold value. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer readable medium including computer executable instructions, wherein the instructions, when executed by a processor, cause the processor to perform a method comprising:
-
storing, as processing piece information, the processing piece character string and time information of a speech signal corresponding to a speech section in which the processing piece character string is uttered, in association with each other in the storage; acquiring a speech signal; generating a decided character string and at least one candidate character string, the decided character string being a character string corresponding to a speech section in which part of the speech signal having undergone speech recognition processing is converted into a character string, the at least one candidate character string being a character string corresponding to a speech section in which a character string as a conversion result is variable during speech recognition processing in a speech section succeeding the decided character string; detecting a first character string as a processing piece character string if the first character string included in the decided character string exists commonly in one or more combined character strings on dividing the one or more combined character strings by a boundary, the one or more combined character strings being obtained by connecting the decided character string and the at least one candidate character string, the boundary indicating a morphological position serving as a start position of a processing piece in natural language processing; outputting the processing piece character string; and connecting, if first processing piece information as new processing piece information is added in the storage and second processing piece information exists, processing piece character strings included in the second processing piece information and the first processing piece information in time series to generate a reprocessing piece character string, and updating the processing piece information stored in the storage with the reprocessing piece character string and time information corresponding to the reprocessing piece character string, the second processing piece information preceding the first processing piece information and corresponding to a speech section in which the second processing piece information is uttered continuously within a time falling within a threshold value. - View Dependent Claims (14, 15, 16, 17)
-
Specification