GENERATING ACOUSTIC MODELS
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, at a computer system, a request to generate or modify a target acoustic model for a target language;
accessing, by the computer system, a source acoustic model for a source language, wherein the source acoustic model includes information that maps acoustic features of the source language to phonemes in a transformed feature space;
aligning, using the source acoustic model in the transformed feature space, untransformed voice data in the target language with phonemes in a corresponding textual transcript to obtain aligned voice data, wherein the untransformed voice data is in an untransformed feature space;
transforming the aligned voice data according to a particular transform operation using the source acoustic model to obtain transformed voice data;
adapting the source acoustic model to the target language using the untransformed voice data in the target language to obtain an adapted acoustic model; and
training, by the computer system, a target acoustic model for the target language using the transformed voice data and the adapted acoustic model; and
providing the target acoustic model in association with the target language.
2 Assignments
0 Petitions
Accused Products
Abstract
This document describes methods, systems, techniques, and computer program products for generating and/or modifying acoustic models. Acoustic models and/or transformations for a target language/dialect can be generated and/or modified using acoustic models and/or transformations from a source language/dialect.
29 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a computer system, a request to generate or modify a target acoustic model for a target language; accessing, by the computer system, a source acoustic model for a source language, wherein the source acoustic model includes information that maps acoustic features of the source language to phonemes in a transformed feature space; aligning, using the source acoustic model in the transformed feature space, untransformed voice data in the target language with phonemes in a corresponding textual transcript to obtain aligned voice data, wherein the untransformed voice data is in an untransformed feature space; transforming the aligned voice data according to a particular transform operation using the source acoustic model to obtain transformed voice data; adapting the source acoustic model to the target language using the untransformed voice data in the target language to obtain an adapted acoustic model; and training, by the computer system, a target acoustic model for the target language using the transformed voice data and the adapted acoustic model; and providing the target acoustic model in association with the target language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a computer system; an interface of the computer system to receive a request to generate or modify a target acoustic model for a target language; an acoustic model repository of the computer system to provide access to a source acoustic model for a source language, wherein the source acoustic model includes information that maps acoustic features of the source language to phonemes in a transformed feature space; an alignment component of the computer system to use the source acoustic model in the transformed feature space to align untransformed voice data in the target language with phonemes in a corresponding textual transcript to obtain aligned voice data, wherein the untransformed voice data is in an untransformed feature space; and a target model generator of the computer system to i) transform the aligned voice data according to a particular transform operation using the source acoustic model to obtain transformed voice data, ii) adapt the source acoustic model to the target language using the untransformed voice data in the target language to obtain an adapted acoustic model; and
iii) train a target acoustic model for the target language using the transformed voice data and the adapted acoustic model;wherein the interface is further configured to provide access to the target acoustic model. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
a computer system; an interface of the computer system to receive a request to generate or modify a target acoustic model for a target language; an acoustic model repository of the computer system to provide access to a source acoustic model for a source language, wherein the source acoustic model includes information that maps acoustic features of the source language to phonemes in a transformed feature space; an alignment component of the computer system to use the source acoustic model in the transformed feature space to align untransformed voice data in the target language with phonemes in a corresponding textual transcript to obtain aligned voice data, wherein the untransformed voice data is in an untransformed feature space; and means for generating a target acoustic model for a target language from using the source acoustic model in the transformed feature space and the aligned voice data in the untransformed feature space; wherein the interface is further configured to provide access to the target acoustic model. - View Dependent Claims (18, 19, 20)
-
Specification