SYSTEM AND METHOD FOR RESTRICTING LARGE LANGUAGE MODELS
First Claim
1. A computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to generate a masked language model, the instructions comprising:
- receiving a language model having a vocabulary of words;
identifying an adaptation subset of related words in the language model; and
adding a bit mask to each word in the large vocabulary of words based on the adaptation subset, thereby generating a masked language model.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and computer-readable storage media for performing speech recognition based on a masked language model. A system configured to practice the method receives a masked language model including a plurality of words, wherein a bit mask identifies whether each of the plurality of words is allowed or disallowed with regard to an adaptation subset, receives input speech, generates a speech recognition lattice based on the received input speech using the masked language model, removes from the generated lattice words identified as disallowed by the bit mask for the adaptation subset, and recognizes the received speech based on the lattice. Alternatively during the generation step, the system can only add words indicated as allowed by the bit mask. The bit mask can be separate from or incorporated as part of the masked language model. The system can dynamically update the adaptation subset and bit mask.
-
Citations
20 Claims
-
1. A computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to generate a masked language model, the instructions comprising:
-
receiving a language model having a vocabulary of words; identifying an adaptation subset of related words in the language model; and adding a bit mask to each word in the large vocabulary of words based on the adaptation subset, thereby generating a masked language model. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for recognizing speech using a masked language model, the system comprising:
-
a processor; a module controlling the processor to receive a masked language model including a plurality of words, wherein a bit mask identifies whether each of the plurality of words is allowed or disallowed with regard to an adaptation subset; a module controlling the processor to receive input speech; a module controlling the processor to generate a speech recognition lattice based on the received input speech using the masked language model; a module controlling the processor to remove from the generated lattice words identified as disallowed by the bit mask for the adaptation subset; and a module controlling the processor to recognize the received speech based on the lattice. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method for generating a masked language model, the method causing a computing device to perform steps comprising:
-
receiving a language model having a vocabulary of words; identifying an adaptation subset of related words in the language model; and generating a masked language model by adding a bit mask to the language model based on the adaptation subset. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification