SYSTEMS AND METHODS FOR AUTOMATIC REPAIR OF SPEECH RECOGNITION ENGINE OUTPUT
First Claim
1. A processor implemented method (200) comprising:
- receiving a text output from a general purpose automatic speech recognition (GP-ASR) engine (202);
identifying in real-time, a current environment of a speaker associated with the text output based on one or more environment variables associated thereof (204); and
automatically and selectively correcting the output of the GP-ASR by replacing one or more erroneously recognized terms in the text output with one or more best-fit terms based on at least a fitness function and a pre-determined threshold associated with the identified current environment (206).
2 Assignments
0 Petitions
Accused Products
Abstract
Text output of speech recognition engines tend to be erroneous when spoken data has domain specific terms. The present disclosure facilitates automatic correction of errors in speech to text conversion using abstractions of evolutionary development and artificial development. The words in a speech recognition engine text output are treated as a set of injured genes in a biological cell that need repair which are then repaired and form genotypes that are then repaired to phenotypes through a series of repair steps based on a matching, mapping and linguistic repair through a fitness criteria. A basic genetic level repair involves phonetic MATCHING function together with a FITNESS function to select the best among the matching genes. A second genetic level repair involves a contextual MAPPING function for repairing remaining ‘injured’ genes of the speech recognition engine output. Finally, a genotype to phenotype repair involves using linguistic rules and semantic rules of the domain.
4 Citations
19 Claims
-
1. A processor implemented method (200) comprising:
-
receiving a text output from a general purpose automatic speech recognition (GP-ASR) engine (202); identifying in real-time, a current environment of a speaker associated with the text output based on one or more environment variables associated thereof (204); and automatically and selectively correcting the output of the GP-ASR by replacing one or more erroneously recognized terms in the text output with one or more best-fit terms based on at least a fitness function and a pre-determined threshold associated with the identified current environment (206). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system (100) comprising:
-
one or more data storage devices (102) operatively coupled to one or more hardware processors (104) and configured to store instructions configured for execution by the one or more hardware processors to; receive a text output from a general purpose automatic speech recognition (GP-ASR) engine; identify in real-time, a current environment of a speaker associated with the text output based on one or more environment variables associated thereof;
the identified current environment being classified as at least one of a broad classification and a granular classification; andautomatically and selectively correct the output of the GP-ASR by replacing one or more erroneously recognized terms in the text output with one or more best-fit terms based on at least a fitness function and a pre-determined threshold associated with the identified current environment. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product comprising a non-transitory computer readable medium having a computer readable program embodied therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:
-
receive a text output from a general purpose automatic speech recognition (GP-ASR) engine; identify in real-time, a current environment of a speaker associated with the text output based on one or more environment variables associated thereof; and automatically and selectively correct the output of the GP-ASR by replacing one or more erroneously recognized terms in the text output with one or more best-fit terms based on at least a fitness function and a pre-determined threshold associated with the identified current environment.
-
Specification